Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits