Swirlsmith
Back to leaderboard
#9

Augment Code

๐Ÿ”“ Open

Enterprise-grade agentic coding platform with deep codebase understanding, multi-repo context, and persistent workspace memory.

๐Ÿ’ฐ Open-source ยท vscode, jetbrains, web

70.0Overall Score
Non-Gameable Scoring

Scores are derived from established benchmarks, adjusted for harness-specific performance across four dimensions: Coding, Reasoning, Tool Use, and Autonomy.

Each dimension starts from public benchmark data and applies harness-specific modifiers based on tool integration, context handling, and orchestration quality. The overall score is a weighted composite that penalizes narrow optimization.

ModelOverall
Claude Opus 4.670.0
Gemini 3.1 Pro59.4
Kimi K2.559.1
GPT-5.452.8
Claude Sonnet 4.652.7
MiniMax-M2.751.0
DeepSeek R146.4
Qwen 3.545.1
Gemini 3 Pro40.9
MiMo-V2-Flash40.1