Back to Dashboard
CategoryWeight: 1.0x

Performance Efficiency

Tests whether generated code uses efficient algorithms and avoids unnecessary computation, memory allocation, and I/O.

Best Score

0.0

Avg Score

0.0

Tests

3

Performance Over Time — All Models

Model Rankings

1
Grok

Category score

View
94.7BEST
Tokens86.8k
Total86.8k
2
Claude Sonnet 4.6

Category score

View
94.5-0.2 pts
Tokens14.5k
Total14.5k
3
Claude Opus 4.8

Category score

View
94.0-0.7 pts
Tokens26.1k
Total26.1k
4
GPT-5.5

Category score

View
92.0-2.7 pts
Tokens44.0k
Total44.0k

Test Breakdown

Algorithm Complexity

Solve a problem in O(n log n) instead of naive O(n^2)

Grok
94.7
Claude Sonnet 4.6
94.5
Claude Opus 4.8
94.0
GPT-5.5
92.0

Memory-efficient Processing

Process a large file using streaming instead of loading into memory

Grok
94.7
Claude Sonnet 4.6
94.5
Claude Opus 4.8
94.0
GPT-5.5
92.0

Query Optimization

Write database queries that use indexes and avoid N+1 patterns

Grok
94.7
Claude Sonnet 4.6
94.5
Claude Opus 4.8
94.0
GPT-5.5
92.0