Swirlsmith
Back to leaderboard
#11

Amazon Q Developer

๐Ÿ”’ Closed

AWS-integrated AI developer agent with security scanning, multi-file editing, and cloud-native task automation.

๐Ÿ’ฐ Included with AWS subscription ยท vscode, jetbrains, cli, web, api

64.7Overall Score
Non-Gameable Scoring

Scores are derived from established benchmarks, adjusted for harness-specific performance across four dimensions: Coding, Reasoning, Tool Use, and Autonomy.

Each dimension starts from public benchmark data and applies harness-specific modifiers based on tool integration, context handling, and orchestration quality. The overall score is a weighted composite that penalizes narrow optimization.

ModelOverall
Claude Opus 4.664.7
Claude Sonnet 4.648.7
Claude Sonnet 4.530.7
Claude Opus 4.529.9
Claude Opus 4.122.5
Claude Opus 417.0