Swirlsmith

AI Models Leaderboard

Real-time benchmark rankings of 464 AI models across40 benchmarks from 7 sources.

Total Models
464
Benchmarks
40
Data Sources
7
#ModelProviderBest HarnessSWE-benchGPQAPrice ($/1M)Speed (tok/s)# Benches
1Claude Opus 4.6Anthropic
76.9Claude Code
80.8091.30$10.004135
2Gemini 3.1 ProGoogle
66.7Devin
80.6094.30$2.508733
3Kimi K2.5Kimi
64.6Roo Code
76.8087.60$1.204327
4Claude Sonnet 4.6Anthropic
60.1Claude Code
79.6089.90$6.004330
5GPT-5.4OpenAI
59.6Devin
79.5092.80$5.636821
6MiniMax-M2.7MiniMax
57.5Roo Code
77.20-$0.53457
7DeepSeek R1DeepSeek
50.4Roo Code
49.2071.50$0.28-18
8Qwen 3.5Alibaba
49.1Devin
76.4088.40--14
9Gemini 3 ProGoogle
47.5Devin
76.2091.90-3016
10MiMo-V2-FlashXiaomi
44.5Devin
64.50-$0.151178
11GPT-5.2OpenAI
42.6Devin
80.0092.40$4.817115
12GLM-5Z AI
42.3Devin
77.80-$1.555910
13DeepSeek V3.2DeepSeek
41.5Devin
67.8079.90$0.323413
14Claude Sonnet 4.5Anthropic
41.2Claude Code
-83.40$3.00978
15Gemini 3 FlashGoogle
40.8Devin
78.0090.40$1.1317115
16GPT-5 (high)OpenAI
40.0Devin
-87.30$3.441014
17Qwen3.5 397B A17BAlibaba
39.9Devin
76.4088.40$1.358414
18DeepSeek V3DeepSeek
38.9Devin
38.8068.40$0.28-15
19Claude Opus 4.5Anthropic
38.8Devin
80.9087.00$10.00469
20MiniMax-M2.5MiniMax
38.8Roo Code
80.2085.20$0.535216
21GPT-5 miniOpenAI
38.0OpenClaw
68.5082.30$0.252212
22GPT-oss 120BOpenAI
37.1Devin
62.4080.90--14
23Qwen3.5 122B A10BAlibaba
37.0Devin
72.0086.60$1.1012017
24GLM-4.7Z AI
36.4Devin
73.8085.70$0.947411
25Grok 3xAI
35.3Roo Code
49.0084.60$6.006511
26Grok 4.20xAI
35.3Devin
75.80---5
27Kimi K2.5 ThinkingMoonshotAI
34.7Devin
73.80---5
28Llama 4 MaverickMeta
34.7Devin
70.4069.80$0.4911415
29Llama 4 BehemothMeta
33.4Devin
71.50---5
30GPT-5 (medium)OpenAI
33.3Devin
-88.10$3.44894
31Gemini 3.1 Flash-LiteGoogle
33.3Devin
66.8086.90$0.257113
32Grok 4.1xAI
32.9Devin
71.20---5
33gemma-3-27b-itGoogle
32.5Devin
65.30---9
34o3OpenAI
32.5Devin
70.10-$3.501016
35command-r-plus-08-2024Cohere
31.8Devin
66.40---9
36Nova PremierAmazon
31.4Devin
69.40-$5.00286
37Mistral Large 3Mistral
31.1Devin
69.70-$0.75456
38Llama 4 ScoutMeta
31.0Devin
68.90-$0.291296
39Command ACohere
30.9Devin
68.80-$4.38376
40Command A ReasoningCohere
30.9Devin
68.50---5
41Claude Opus 4.1Anthropic
29.2Claude Code
74.5080.90$15.00307
42GPT-5.1OpenAI
29.0OpenClaw
76.3088.10$3.44777
43Gemini 2.5 ProGoogle
29.0Devin
67.30-$3.441288
44Llama 3.3 70B InstructMeta
28.3Devin
66.70---5
45Kimi K2-Thinking-0905MoonshotAI
28.2Devin
71.3084.50$0.471028
46MiniMax-M2.1MiniMax
27.5Devin
67.0081.00$0.535611
47gpt-oss-20B (high)OpenAI
27.3Devin
64.80-$0.093016
48Jamba 1.5AI21 Labs
26.9Devin
64.70---5
49Mistral Small 4Mistral
26.9Devin
64.90-$0.261336
50Nova ProAmazon
26.8Devin
64.30-$1.40-6
51Jamba 2 LargeAI21 Labs
26.6Devin
64.10---5
52Arctic 2 LatestSnowflake
26.5Devin
64.20---5
53Yi-2 Large01.AI
26.4Devin
63.80---5
54GLM-4.6Z AI
26.1OpenClaw
68.0081.00$1.00279
55Jamba 2 MiniAI21 Labs
26.0Devin
63.50---5
56Arctic 2Snowflake
25.9Devin
63.90---5
57Granite 3.2IBM
25.9Devin
63.10---5
58InternLM3 20BShanghai AI Lab
25.9Devin
63.20---5
59Granite 3.3IBM
25.6Devin
62.40---5
60Qwen 3.5 14BAlibaba
25.4Devin
62.70---5
61Nemotron Cascade 2NVIDIA
24.5Devin
62.10---5
62GPT-5.1 ThinkingOpenAI
24.3OpenClaw
76.3088.10$1.25576
63InternLM3 8BShanghai AI Lab
24.2Devin
61.90---5
64Ministral 3 14BMistral
24.0Devin
61.50-$0.201146
65Phi-4 14BMicrosoft
23.6Devin
60.90---5
66Baichuan 4 13BBaichuan Intelligent
23.3Devin
60.30---5
67GPT-5.1 (high)OpenAI
22.9Roo Code
-88.10$3.44834
68Yi-2 34B01.AI
22.9Devin
59.40---5
69StableLM LatestStability AI
22.6Devin
59.10---5
70StableLM 2 12BStability AI
22.2Devin
58.70---5
71Claude Opus 4Anthropic
21.9Claude Code
72.5079.60$15.00528
72Nemotron Ultra 253BNvidiaNVIDIA
21.9Roo Code
-76.00--6
73Baichuan 3 13BBaichuan Intelligent
21.8Devin
56.80---5
74Phi-4 MiniMicrosoft Azure
21.8Devin
57.20--436
75Nemotron 3 NanoNVIDIA
20.9Devin
55.90---5
76Gemma 3 12BGoogle
20.7Devin
55.40--306
77Phi-4 7BMicrosoft
19.8Devin
54.30---5
78Mistral LargeMistral
19.6Roo Code
-43.90--9
79Ministral 3 3BMistral
19.1Devin
52.10-$0.102546
80Qwen3.5 9BAlibaba
17.6Devin
50.20-$0.081886
81Amazon Nova Premier 1.0 (2025-04-30)Amazon
0.0
42.40---1
82Amazon Q Developer Agent (v20240430-dev)Amazon
0.0
25.60---2
83Amazon Q Developer Agent (v20240719-dev)Amazon
0.0
38.80---2
84Amazon Q Developer Agent (v20241202-dev)Amazon
0.0
55.00---2
85Amazon Q Developer Agent (v20250405-dev)Amazon
0.0
65.40---2
86Apertus 70B InstructSwiss AI Initiative
0.0
--$1.34621
87Apertus 8B InstructSwiss AI Initiative
0.0
--$0.131311
88Apriel-v1.6-15B-ThinkerServiceNow
0.0
---741
89Claude 3 HaikuAnthropic
0.0
--$0.501321
90Claude 3.5 HaikuAnthropic
0.0
--$1.60-1
91Claude 3.7 SonnetAnthropic
0.0
--$6.00-1
92Claude 4 SonnetAnthropic
0.0
--$6.00431
93Claude 4.5 HaikuAnthropic
0.0
--$2.00911
94Claude 4.5 SonnetAnthropic
0.0
--$6.00421
95Claude Opus 4.6 (max)Anthropic
0.0
--$10.00431
96Claude Sonnet 4.6 (Non-reasoning, Low Effort)Anthropic
0.0
--$6.00431
97Claude Sonnet 4.6 (max)Anthropic
0.0
--$6.00681
98DeepHermes 3 - Llama-3.1 8BNous Research
0.0
----1
99DeepHermes 3 - Mistral 24BNous Research
0.0
----1
100DeepSeek R1 (Jan)DeepSeek
0.0
--$2.36-1

Scores updated daily · Benchmark-derived · Not affiliated with any model vendor