OpenAI's LifeSciBench evaluates whether frontier AI can handle real life-science research across 750 expert-authored tasks, seven workflows, and seven biological domains. Built by 173 PhD scientists with 19,020 rubric criteria, it grades reasoning and decisions, not just recall. The best model, GPT-Rosalind, passes 36.1%, leaving large headroom on artifacts, exact outputs, and operational calls.
The post OpenAI Releases LifeSciBench, a 750-Task Benchmark Grading AI Models on Real Life-Science Research With Expert-Written Rubric appeared first on MarkTechPost.
The EU's pursuit of Anthropic's AI model highlights the complex interplay between cybersecurity needs and international tech regulations.
The post EU Commission confirms meeting with Anthropic on cybersecurity appeared first on Crypto Briefing.
The AI standoff highlights the growing appeal of decentralized AI solutions, potentially reshaping investment strategies and global tech dynamics.
The post Trump says negotiations with Anthropic are progressing well amid months-long AI standoff appeared first on Crypto Briefing.
For generations, technology export controls referred to the transfer of source code to other countries. But that no longer works, as the latest Anthropic fight with the US Commerce Department makes clear.
On Friday, Anthropic announced that it had received instructions from Commerce “to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Anthropic models will not be affected.”
Technically, the Commerce letter doesn’t explicitly say that, but lawyers and consultants argue that, when combined with an earlier executive order declaring Anthropic a supply chain risk, that very well might be what it means.
What the Commerce letter says is that Anthropic needs a license to export Fable 5 and Mythos 5 (a “deemed export“), listing four circu
Shazeer's move to OpenAI could significantly shift AI innovation dynamics, impacting competitive strategies and technological advancements in the field.
The post Noam Shazeer joins OpenAI after leaving Google appeared first on Crypto Briefing.
LifeSciBench's rigorous evaluation of AI in life sciences could redefine research methodologies, enhancing AI's role in scientific innovation.
The post OpenAI launches LifeSciBench to evaluate AI in life sciences appeared first on Crypto Briefing.
Microsoft's AI expansion in China highlights the tension between cost-effective innovation and geopolitical tech security concerns.
The post Microsoft builds AI model business in China amid US concerns from OpenAI and Anthropic appeared first on Crypto Briefing.
Days before Anthropic took its most advanced AI models offline, the White House ordered the company to revoke SK Telecom’s access to Claude Mythos over claims of alleged ties to China.