Back to leaderboard
#5
Hermes Agent
๐ OpenMulti-modal agentic harness with persistent memory, MCP tool orchestration, sub-agent delegation, and autonomous task execution.
๐ฐ Open-source, self-hostable ยท cli, api, web
72.6Overall Score
| Model | Overall |
|---|---|
| Claude Opus 4.6 | 72.6 |
| Gemini 3.1 Pro | 64.6 |
| Kimi K2.5 | 62.1 |
| GPT-5.4 | 58.1 |
| Claude Sonnet 4.6 | 57.3 |
| MiniMax-M2.7 | 55.0 |
| DeepSeek R1 | 47.9 |
| Qwen 3.5 | 47.4 |
| Gemini 3 Pro | 45.9 |
| MiMo-V2-Flash | 43.0 |