New Microsoft tool lets devs spin up AI behavior tests using text descriptions

TechCrunch
Microsoft has released ASSERT, an open-source tool that generates and runs AI behavior tests based on natural-language descriptions of intended goals and policies.

Summary

Microsoft has introduced ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), an open-source framework designed to help developers create application-specific AI evaluations. By converting natural-language descriptions of desired behaviors, policies, and constraints into structured test cases, the tool allows developers to verify that AI systems function as intended within specific products. ASSERT generates test scenarios, runs them against the target system, and provides detailed scoring and audit trails of the AI's decision-making paths, making it useful for both initial development and continuous post-deployment monitoring.

(Source:TechCrunch)