Back to Dashboard
CategoryWeight: 1.0x

Coding Tasks

General programming challenges including algorithm implementation, data structure design, and system architecture tasks.

Best Score

0.0

Avg Score

0.0

Tests

3

Performance Over Time — All Models

Model Rankings

1
Claude Opus 4.8

Category score

View
99.0BEST
Tokens8.6k
Total8.6k
2
Grok

Category score

View
98.7-0.3 pts
Tokens67.8k
Total67.8k
3
Claude Sonnet 4.6

Category score

View
97.3-1.7 pts
Tokens9.1k
Total9.1k
4
GPT-5.5

Category score

View
96.7-2.3 pts
Tokens37.4k
Total37.4k

Test Breakdown

Graph Algorithm Implementation

Implement Dijkstra with priority queue and handle edge cases

Claude Opus 4.8
99.0
Grok
98.7
Claude Sonnet 4.6
97.3
GPT-5.5
96.7

REST API Design

Design and implement a paginated REST API with filtering

Claude Opus 4.8
99.0
Grok
98.7
Claude Sonnet 4.6
97.3
GPT-5.5
96.7

Concurrent Data Pipeline

Build a producer-consumer pipeline with backpressure handling

Claude Opus 4.8
99.0
Grok
98.7
Claude Sonnet 4.6
97.3
GPT-5.5
96.7