Show HN: Agentic Arena – 52 tasks implemented by Opus 4.5, Gemini 3, and GPT-5.1 arena.logic.inc 1 points by sgk284 8 hours ago
lostmsu 7 hours ago How does one vote? The name of the model that made the game should be hidden.Is there a leaderboard? sgk284 7 hours ago We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.
sgk284 7 hours ago We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.
How does one vote? The name of the model that made the game should be hidden.
Is there a leaderboard?
We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.