Flawed AI Safety Tests: Experts Uncover Critical Gaps in Hundreds of Benchmarks
Security experts found critical weaknesses in 440+ AI safety benchmarks used by major tech companies. Results may be misleading or irrelevant for evaluating AI model safety.
Read more