Swirlsmith
Back to leaderboard
#12

Windsurf

๐Ÿ”‘ Semi-Open

Flow-based agentic IDE with Cascade technology for multi-step autonomous coding and contextual awareness.

๐Ÿ’ฐ $15/mo Pro, $30/mo Enterprise ยท vscode, web

64.4Overall Score
Non-Gameable Scoring

Scores are derived from established benchmarks, adjusted for harness-specific performance across four dimensions: Coding, Reasoning, Tool Use, and Autonomy.

Each dimension starts from public benchmark data and applies harness-specific modifiers based on tool integration, context handling, and orchestration quality. The overall score is a weighted composite that penalizes narrow optimization.

ModelOverall
Claude Opus 4.664.4
Gemini 3.1 Pro54.9
Kimi K2.554.4
Claude Sonnet 4.648.5
GPT-5.447.9
MiniMax-M2.747.0
DeepSeek R142.5
Qwen 3.540.8
Gemini 3 Pro37.6
MiMo-V2-Flash37.2