Back to leaderboard
#1
Hermes Agent
Multi-modal agentic harness with persistent memory, tool orchestration, and autonomous task execution. Built for production-grade AI workflows.
94.2+1.3% (24h)
| Model | Overall |
|---|---|
| Claude Sonnet 4.6 | 96.1 |
| GPT-5.4 | 93.8 |
| Grok 3 | 92.5 |
| Gemini 2.5 Pro | 91.2 |