Battling bots face off in cybersecurity arena

Fri Feb 13 2026

Zero-day

www.csoonline.com

AI agents are increasingly seen as a way to reinforce the capabilities of cybersecurity teams — but which can do the best job? Wiz has developed a benchmark suite of 257 real-world challenges spanning five offensive domains: zero-day discovery, CVE (code vulnerability) detection, API security, web security, and cloud security to find out. Wiz tests different combinations of AI agents and their underlying AI models against the test suite to see which score the highest in each of the five categories. Scoring is deterministic and programmatic using several factors: multi-dimensional rubrics for zero-day and CVE detection; endpoint-and-severity matching for API security and lag capture for web a...