Microsoft has open-sourced an AI evaluation framework that converts natural-language requirements into executable tests, expanding its push into enterprise AI governance as organizations struggle to validate agent behavior before production deployments systematically.
The framework, called ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), generates evaluation scenarios, datasets, metrics, and scorecards from written specifications, product requirements, and governance documents, Microsoft said in a blog post announcing the release.
“Agents fail in ways that are hard to see,” Microsoft wrote in the blog post. “They drift from policy, produce unsafe outputs in edge cases, and behave differently in production than they did in testing. Generic benchmarks do not catch these failures because they are not built around your policies, your agent, or your use case.”
Rather than requiring developers to manually create evaluation suites, ASSERT translates written intent
Microsoft's AI expansion in China could face regulatory challenges, impacting global AI market dynamics and U.S.-China tech relations.
The post Microsoft expands AI model business in China amid OpenAI, Anthropic concerns appeared first on Crypto Briefing.
Insider Brief XDOF emerged from stealth with $70 million in funding to develop the infrastructure for robot foundation models. The company is developing datasets, robotic systems and software tools designed to help robotics companies and research labs build more capable physical AI systems. Backers include Thrive Capital, Andreessen Horowitz, Spark Capital, Lux and WnderCo, according […]
Microsoft's AI expansion in China highlights the tension between cost-effective innovation and geopolitical tech security concerns.
The post Microsoft builds AI model business in China amid US concerns from OpenAI and Anthropic appeared first on Crypto Briefing.
A Microsoft and Huazhong University benchmark tested GPT-4o, GPT-5, Grok-3, and others on realistic enterprise data scenarios. Privacy violation rates hit 50.9%. More capable models made it worse, and the fix has nothing to do with model selection...
Microsoft has introduced usage-based billing for Copilot Cowork, which is now generally available.
Microsoft unveiled Copilot Cowork in March, pitching it as an AI agent that’s capable of independently performing long-running, multi-step tasks — even when a user’s computer is off.
It’s built on the same technology that underpins Anthropic’s Claude Cowork. Unlike Claude Cowork, which can interact directly with files and applications on a user’s computer, Copilot Cowork runs in Microsoft’s cloud environment and acts on documents held in a customer’s Microsoft 365 tenant.
Copilot Cowork now comes with usage-based billing.
Microsoft
On Tuesday, Microsoft unveiled pricing details for Copilot Cowork, which involves usage-based billing in addition to a Microsoft 365 Copilot license ($30 per user each month for large enterprises before discounts, and $20 for Microsoft 365 Copilot for Business).
The usage-based pricing is calculated from four components, according to Microsoft: “model
Despite best efforts by defenders, malicious emails continue to slip through the cybersecurity cracks, leading some enterprises to implement a layered “defense in depth” strategy that incorporates multiple tools.
Microsoft seems to be challenging this idea, revealing that there are only nominal returns from adding integrated pre- and post-send partners to Defender for Office 365’s protections.
According to its new quarterly benchmarking data, the tech giant catches the vast majority of malicious and spam emails before delivery, misses the fewest compared to competitors by a wide margin, and removes nearly 100% of dangerous emails that do reach the inbox. Collectively, its integrated partners improve that catch rate by less than .05%.
While these numbers seem to tip the scales towards a one-vendor email security stack, experts urge enterprises to be skeptical and cautious of such vendor claims.
Seva Ioussoufovitch, senior research analyst at Info-Tech Research Group, pointed out, “perce
The failed leasing talks may slow infrastructure expansion, impacting both companies' growth strategies and competitive positioning in cloud services.
The post Microsoft and Oracle cloud infrastructure leasing talks reportedly fall through appeared first on Crypto Briefing.