Back to Dashboard
CategoryWeight: 1.0x
Security Awareness
Tests whether the model proactively identifies and avoids security vulnerabilities like injection, XSS, and insecure defaults.
Best Score
0.0Avg Score
0.0Tests
3Performance Over Time — All Models
Model Rankings
Test Breakdown
SQL Injection Prevention
Build a query layer that properly parameterizes all user input
Claude Sonnet 4.6
95.7Claude Opus 4.8
94.5GPT-5.5
90.9Grok
84.3XSS Mitigation
Render user-generated content without introducing XSS vectors
Claude Sonnet 4.6
95.7Claude Opus 4.8
94.5GPT-5.5
90.9Grok
84.3Secret Management
Implement config loading that never logs or exposes secrets
Claude Sonnet 4.6
95.7Claude Opus 4.8
94.5GPT-5.5
90.9Grok
84.3