Swirlsmith
Back to leaderboard
#2

Roo Code

๐Ÿ”“ Open

Fork of Cline with custom modes (Architect, Code, Debug), boomerang orchestration, and enhanced context management.

๐Ÿ’ฐ Open-source VS Code extension ยท vscode, jetbrains

76.1Overall Score
Non-Gameable Scoring

Scores are derived from established benchmarks, adjusted for harness-specific performance across four dimensions: Coding, Reasoning, Tool Use, and Autonomy.

Each dimension starts from public benchmark data and applies harness-specific modifiers based on tool integration, context handling, and orchestration quality. The overall score is a weighted composite that penalizes narrow optimization.

ModelOverall
Claude Opus 4.676.1
Gemini 3.1 Pro65.3
Kimi K2.564.6
GPT-5.459.4
Claude Sonnet 4.659.1
MiniMax-M2.757.5
DeepSeek R150.4
Qwen 3.548.9
Gemini 3 Pro45.9
MiMo-V2-Flash44.0