Building AI Capability Benchmarks for Cyber Offense by Andrey Anurin
Explore the theory and practice behind AI capability benchmarks, focusing on cyber offense. Learn why these benchmarks are crucial and how they're constructed. Essential for AI researchers, security professionals, and those interested in AI evaluation methodologies.
See the hackathon and sign up here: https://www.apartresearch.com/event/agent-security-hackathon Join us online for the live Q&A and project presentations at https://discord.gg/E5DweGEGjJ
Join future hackathons at https://apartresearch.com/sprints. Our moderator and organizer is Esben Kran and Apart Research.
Explore the theory and practice behind AI capability benchmarks, focusing on cyber offense. Learn why these benchmarks are crucial and how they’re constructed. Essential for AI researchers, security professionals, and those interested in AI evaluation methodologies.
See the hackathon and sign up here: https://www.apartresearch.com/event/agent-security-hackathon Join us online for the live Q&A and project presentations at https://discord.gg/E5DweGEGjJ
Join future hackathons at https://apartresearch.com/sprints. Our moderator and organizer is Esben Kran and Apart Research.