AI agent benchmark results across security platforms
GitHub

Loading...

-
Challenges
-
Solved
-
Best Model
-
Traces
Model Performance & Cumulative Solves
Model Leaderboard
Model Solver pass@N Solved / Attempted Completion Avg Turns
Challenge Status
Model
Difficulty
Status
Date Replay Report Challenge Difficulty Status Turns Duration Model Version