Back to Dashboard
CategoryWeight: 1.0x
Bug Fixes
Identify and fix bugs in existing codebases, including race conditions, off-by-one errors, and logic flaws.
Best Score
0.0Avg Score
0.0Tests
3Performance Over Time — All Models
Model Rankings
Test Breakdown
Off-by-One Boundary Fix
Fix pagination logic that skips the last page of results
Claude Sonnet 4.6
96.7Grok
96.3GPT-5.5
93.3Claude Opus 4.8
90.0Race Condition Detection
Find and fix a subtle race condition in async queue processing
Claude Sonnet 4.6
96.7Grok
96.3GPT-5.5
93.3Claude Opus 4.8
90.0Memory Leak Fix
Identify and patch a memory leak caused by unclosed event listeners
Claude Sonnet 4.6
96.7Grok
96.3GPT-5.5
93.3Claude Opus 4.8
90.0