For frontier AI news

Leaderboard Overview

See how leading models stack up across text, image, vision, and beyond. This page gives you a snapshot of each Arena, you can explore deeper insights in their dedicated tabs. Learn more about it here.

Arena Overview

Scroll to the right to see full stats of each model

First Place
Second Place
Third Place
gemini-3-pro
1
3
1
3
3
1
3
3
grok-4.1-thinking
2
9
4
6
10
6
12
14
gemini-3-flash
3
6
5
9
2
2
5
6
Anthropicclaude-opus-4-5-20251101-thinking-32k
4
2
2
1
6
5
1
2
Anthropicclaude-opus-4-5-20251101
5
1
3
4
5
3
2
1
grok-4.1
6
18
8
13
18
14
16
13
gemini-3-flash (thinking-minimal)
7
15
9
11
7
9
11
10
gpt-5.1-high
8
8
11
14
4
10
9
9
gemini-2.5-pro
9
12
13
24
11
4
10
11
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
10
4
6
2
8
11
4
4
Anthropicclaude-opus-4-1-20250805-thinking-16k
11
10
7
5
12
8
6
5
Anthropicclaude-sonnet-4-5-20250929
12
7
10
7
21
7
7
7
gpt-4.5-preview-2025-02-27
13
38
34
41
44
13
15
19
Anthropicclaude-opus-4-1-20250805
14
17
12
8
17
12
8
8
gpt-5.2-high
15
5
14
12
1
41
14
20
chatgpt-4o-latest-20250326
16
43
18
32
55
16
19
26
gpt-5.1
17
13
16
17
37
18
18
17
gpt-5-high
18
16
20
23
14
44
30
46
o3-2025-04-16
19
21
29
39
9
42
45
52
Qwen Iconqwen3-max-preview
20
11
15
15
13
30
17
15
grok-4-1-fast-reasoning
21
27
33
43
49
21
46
48
Baiduernie-5.0-preview-1103
22
44
31
34
46
20
52
35
MoonshotAIkimi-k2-thinking-turbo
23
20
21
16
23
27
22
32
glm-4.6
24
23
25
30
19
25
20
28
gpt-5-chat
25
25
22
36
41
40
27
27
Qwen Iconqwen3-max-2025-09-23
26
40
24
19
16
32
26
29
deepseek-v3.2-exp
27
39
23
28
35
24
29
21
Anthropicclaude-opus-4-20250514-thinking-16k
28
24
17
10
30
15
13
12
Qwen Iconqwen3-235b-a22b-instruct-2507
29
19
19
25
20
49
23
24
deepseek-v3.2-thinking
30
47
30
26
27
19
25
33
deepseek-v3.2-exp-thinking
31
30
26
21
25
33
24
31
grok-4-fast-chat
32
41
45
44
28
43
44
38
deepseek-v3.2
33
42
27
40
24
36
34
25
deepseek-r1-0528
34
49
40
33
64
37
53
55
MoonshotAIkimi-k2-0905-preview
35
46
37
29
38
46
59
60
deepseek-v3.1
36
35
42
51
36
34
42
36
MoonshotAIkimi-k2-0711-preview
37
51
44
38
70
58
69
65
mistral-large-3
38
50
35
22
32
61
39
44
deepseek-v3.1-thinking
39
34
36
42
29
23
21
16
deepseek-v3.1-terminus
40
-
50
60
63
22
51
49
Qwen Iconqwen3-vl-235b-a22b-instruct
41
22
32
31
43
67
31
40
deepseek-v3.1-terminus-thinking
42
-
28
37
39
51
28
22
gpt-4.1-2025-04-14
43
59
47
46
83
26
48
39
Anthropicclaude-opus-4-20250514
44
36
39
35
58
17
33
18
mistral-medium-2508
45
54
43
49
54
50
47
50
grok-3-preview-02-24
46
57
49
56
80
29
38
34
grok-4-0709
47
33
54
63
15
31
49
47
glm-4.5
48
28
41
45
33
55
36
43
gemini-2.5-flash
49
37
61
77
45
28
40
42
gemini-2.5-flash-preview-09-2025
50
26
52
75
31
47
43
45
Anthropicclaude-haiku-4-5-20251001
51
31
38
18
66
52
35
30
grok-4-fast-reasoning
52
48
64
58
48
53
58
54
Qwen Iconqwen3-next-80b-a3b-instruct
53
60
51
53
26
100
60
62
o1-2024-12-17
54
62
60
67
50
48
41
51
longcat-flash-chat
55
45
48
20
22
87
54
69
Qwen Iconqwen3-235b-a22b-no-thinking
56
66
55
55
65
63
64
56
Anthropicclaude-sonnet-4-20250514-thinking-32k
57
32
46
27
52
35
32
23
Qwen Iconqwen3-235b-a22b-thinking-2507
58
14
53
57
57
59
56
59
deepseek-r1
59
67
57
54
47
57
50
61
Qwen Iconqwen3-vl-235b-a22b-thinking
60
29
56
48
42
79
62
58
gpt-5-mini-high
61
55
66
70
40
90
67
82
deepseek-v3-0324
62
69
67
74
82
38
68
67
Tencenthunyuan-vision-1.5-thinking
63
-
59
61
-
65
57
63
o4-mini-2025-04-16
64
52
68
68
34
81
74
89
mai-1-preview
65
58
69
71
62
71
70
64
Anthropicclaude-sonnet-4-20250514
66
64
58
52
72
45
55
41
o1-preview
67
78
76
79
73
64
65
77
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
68
53
62
50
76
39
37
37
Qwen Iconqwen3-coder-480b-a35b-instruct
69
79
63
47
81
68
61
57
Tencenthunyuan-t1-20250711
70
63
72
91
56
54
66
72
mistral-medium-2505
71
77
73
69
100
70
78
68
Qwen Iconqwen3-30b-a3b-instruct-2507
72
71
65
59
74
94
75
75
gpt-4.1-mini-2025-04-14
73
76
70
62
97
77
72
71
Tencenthunyuan-turbos-20250416
74
97
80
95
102
66
85
79
gemini-2.5-flash-lite-preview-09-2025-no-thinking
75
73
78
94
86
69
77
70
gemini-2.5-flash-lite-preview-06-17-thinking
76
84
84
108
85
60
71
74
Qwen Iconqwen3-235b-a22b
77
75
77
65
60
91
82
76
Qwen Iconqwen2.5-max
78
87
82
89
88
73
83
73
Anthropicclaude-3-5-sonnet-20241022
79
85
74
66
103
62
73
66
Anthropicclaude-3-7-sonnet-20250219
80
74
75
72
93
56
63
53
glm-4.5-air
81
70
79
73
67
92
79
78
Qwen Iconqwen3-next-80b-a3b-thinking
82
72
81
76
61
99
81
85
Minimaxminimax-m1
83
81
83
78
59
96
88
86
gemma-3-27b-it
84
106
98
132
111
75
92
90
o3-mini-high
85
68
71
64
51
104
76
84
grok-3-mini-high
86
61
86
100
68
89
80
80
gemini-2.0-flash-001
87
98
102
128
96
76
89
91
deepseek-v3
88
100
113
102
118
74
94
83
grok-3-mini-beta
89
80
92
105
84
82
84
88
mistral-small-2506
90
107
88
82
104
95
98
93
gpt-oss-120b
91
92
101
98
75
142
107
134
gemini-2.0-flash-lite-preview-02-05
92
109
114
150
109
78
105
106
glm-4.5v
93
56
90
87
89
106
96
115
Coherecommand-a-03-2025
94
103
97
99
120
85
95
87
PrimeIntellectintellect-3
95
86
91
83
78
126
101
101
gemini-1.5-pro-002
96
105
112
138
106
72
97
97
amazon-nova-experimental-chat-10-20
97
91
87
84
69
143
86
96
o3-mini
98
88
93
80
77
114
93
94
AntGroupling-flash-2.0
99
90
96
81
95
152
115
129
Tencenthunyuan-turbos-20250226
100
-
100
90
129
115
90
98
Minimaxminimax-m2
101
104
95
112
92
132
99
105
Stepfunstep-3
102
96
85
86
87
111
91
100
Nvidiallama-3.1-nemotron-ultra-253b-v1
103
-
89
97
79
86
87
109
amazon-nova-experimental-chat-10-09
104
-
106
101
-
123
114
117
Qwen Iconqwen3-32b
105
65
99
85
53
110
106
95
gpt-4o-2024-05-13
106
124
122
122
125
83
111
128
Qwen Iconqwen-plus-0125
107
82
111
111
110
103
103
92
glm-4-plus-0111
108
123
134
158
131
101
118
116
Anthropicclaude-3-5-sonnet-20240620
109
102
105
92
105
113
100
99
gemma-3-12b-it
110
154
126
164
113
88
113
103
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
111
83
103
93
71
109
112
113
nova-2-lite
112
89
94
88
101
146
102
108
gpt-5-nano-high
113
94
109
110
98
170
108
126
Tencenthunyuan-turbo-0110
114
-
115
117
151
118
124
110
o1-mini
115
99
107
103
91
144
104
107
Metallama-3.1-405b-instruct-bf16
116
126
118
114
115
117
126
135
gpt-4o-2024-08-06
117
125
136
133
122
93
117
118
Qwen Iconqwq-32b
118
95
108
106
90
125
109
119
grok-2-2024-08-13
119
119
138
134
138
102
130
124
gemini-advanced-0514
120
142
139
154
128
80
122
141
Metallama-3.1-405b-instruct-fp8
121
114
123
125
114
112
125
144
Stepfunstep-2-16k-exp-202412
122
111
121
119
116
84
119
112
01.AIyi-lightning
123
108
119
121
123
122
132
137
Metallama-4-maverick-17b-128e-instruct
124
110
120
116
112
107
123
123
Qwen Iconqwen3-30b-a3b
125
101
116
107
94
138
127
114
Nvidiallama-3.3-nemotron-49b-super-v1
126
-
104
129
-
124
110
122
Tencenthunyuan-large-2025-02-10
127
113
131
127
136
116
120
81
gpt-4-turbo-2024-04-09
128
138
149
147
134
97
137
148
Metallama-4-scout-17b-16e-instruct
129
128
130
130
117
129
141
130
Anthropicclaude-3-5-haiku-20241022
130
120
117
109
147
119
131
111
deepseek-v2.5-1210
131
130
132
113
137
105
121
120
Anthropicclaude-3-opus-20240229
132
117
133
140
121
133
128
132
gemini-1.5-pro-001
133
116
135
146
135
98
133
102
gpt-4.1-nano-2025-04-14
134
118
129
115
155
108
140
131
AntGroupring-flash-2.0
135
93
110
96
99
165
116
125
Stepfunstep-1o-turbo-202506
136
135
127
137
126
131
129
104
Metallama-3.3-70b-instruct
137
137
142
149
133
136
149
153
gemma-3n-e4b-it
138
150
150
168
164
121
154
149
gpt-oss-20b
139
115
141
118
107
178
157
156
glm-4-plus
140
131
146
143
144
130
139
138
Qwen Iconqwen-max-0919
141
133
147
142
139
139
138
136
gpt-4o-mini-2024-07-18
142
148
155
145
154
127
147
139
Qwen Iconqwen2.5-plus-1127
143
112
137
136
124
151
143
154
athene-v2-chat
144
121
128
123
119
171
136
143
mistral-large-2407
145
132
145
141
142
135
142
158
olmo-3-32b-think
146
147
124
120
108
163
135
140
gpt-4-1106-preview
147
146
152
155
127
137
145
161
gpt-4-0125-preview
148
152
157
160
130
145
152
155
mercury
149
-
140
126
-
177
160
152
Tencenthunyuan-standard-2025-02-10
150
144
154
159
140
150
156
127
gemini-1.5-flash-002
151
151
161
166
141
120
151
142
grok-2-mini-2024-08-13
152
141
159
156
157
154
155
147
deepseek-v2.5
153
136
143
124
143
157
150
150
magistral-medium-2506
154
143
125
104
146
134
134
121
mistral-large-2411
155
158
151
148
148
149
146
157
athene-70b-0725
156
134
153
144
168
155
161
164
mistral-small-3.1-24b-instruct-2503
157
122
144
131
152
153
144
133
gemma-3-4b-it
158
166
169
194
171
147
168
151
Qwen Iconqwen2.5-72b-instruct
159
139
148
139
132
168
148
146
Nvidiallama-3.1-nemotron-70b-instruct
160
140
156
162
153
148
158
172
Tencenthunyuan-large-vision
161
129
160
135
145
140
153
145
Metallama-3.1-70b-instruct
162
157
165
157
163
164
163
163
amazon-nova-pro-v1.0
163
156
163
152
162
182
162
160
jamba-1.5-large
164
149
171
170
178
162
169
190
ibm-granite-h-small
165
127
164
161
150
173
166
165
gemma-2-27b-it
166
170
172
178
180
128
167
162
reka-core-20240904
167
145
176
167
176
156
173
175
Nvidiallama-3.1-nemotron-51b-instruct
168
171
174
173
160
161
174
181
llama-3.1-tulu-3-70b
169
-
180
174
166
172
165
179
gpt-4-0314
170
165
158
163
149
169
159
177
gemini-1.5-flash-001
171
161
166
172
170
160
171
159
Anthropicclaude-3-sonnet-20240229
172
159
173
165
173
174
170
171
gemma-2-9b-it-simpo
173
184
182
197
202
141
181
167
Nvidianemotron-4-340b-instruct
174
169
175
177
172
180
175
168
Coherecommand-r-plus-08-2024
175
175
190
191
184
159
179
173
Metallama-3-70b-instruct
176
177
177
179
169
167
176
193
gpt-4-0613
177
174
167
171
158
158
164
176
mistral-small-24b-instruct-2501
178
167
170
169
167
184
178
174
glm-4-0520
179
172
179
175
175
176
177
184
reka-flash-20240904
180
155
186
186
181
175
183
187
Qwen Iconqwen2.5-coder-32b-instruct
181
153
162
151
159
194
172
169
Coherec4ai-aya-expanse-32b
182
163
184
185
182
183
182
166
gemma-2-9b-it
183
183
192
201
193
166
187
186
deepseek-coder-v2
184
162
168
153
161
195
180
170
Coherecommand-r-plus
185
179
194
202
196
181
190
189
Qwen Iconqwen2-72b-instruct
186
168
181
183
156
186
189
191
Anthropicclaude-3-haiku-20240307
187
173
187
181
183
189
186
185
amazon-nova-lite-v1.0
188
164
183
180
177
187
185
178
gemini-1.5-flash-8b-001
189
178
191
200
186
179
191
183
Azurephi-4
190
160
178
176
165
193
184
182
olmo-2-0325-32b-instruct
191
-
188
190
185
185
194
196
Coherecommand-r-08-2024
192
186
193
189
201
191
193
188
mistral-large-2402
193
188
189
184
179
190
192
197
amazon-nova-micro-v1.0
194
176
197
187
188
198
196
195
jamba-1.5-mini
195
198
200
203
220
192
205
209
ministral-8b-2410
196
180
195
196
194
188
199
192
Qwen Iconqwen1.5-110b-chat
197
187
196
192
189
204
195
201
gemini-pro-dev-api
198
209
206
220
219
196
204
205
Tencenthunyuan-standard-256k
199
-
185
182
174
201
188
180
Qwen Iconqwen1.5-72b-chat
200
182
199
198
200
207
201
198
reka-flash-21b-20240226-online
201
190
201
193
197
212
207
212
mixtral-8x22b-instruct-v0.1
202
195
198
195
187
206
198
211
Coherecommand-r
203
197
219
218
221
200
208
199
reka-flash-21b-20240226
204
196
203
204
206
214
214
210
gpt-3.5-turbo-0125
205
204
208
199
207
208
200
208
Coherec4ai-aya-expanse-8b
206
185
205
209
205
205
206
194
Metallama-3-8b-instruct
207
200
215
211
215
199
212
213
mistral-medium
208
189
202
205
190
203
202
207
llama-3.1-tulu-3-8b
209
-
212
210
198
202
203
202
gemini-pro
210
-
207
215
210
211
197
-
HuggingFacezephyr-orpo-141b-A35b-v0.1
211
212
218
217
208
217
213
223
01.AIyi-1.5-34b-chat
212
193
211
216
195
220
215
214
Metallama-3.1-8b-instruct
213
203
213
208
216
213
210
203
granite-3.1-8b-instruct
214
181
204
188
217
216
209
200
Qwen Iconqwen1.5-32b-chat
215
191
214
207
203
243
217
204
gpt-3.5-turbo-1106
216
199
209
206
204
224
211
221
gemma-2-2b-it
217
213
230
237
226
210
223
220
Azurephi-3-medium-4k-instruct
218
201
220
224
192
225
220
218
mixtral-8x7b-instruct-v0.1
219
205
221
222
218
219
219
224
dbrx-instruct-preview
220
207
217
213
212
218
216
219
InternLMinternlm2_5-20b-chat
221
192
210
214
199
245
218
216
Qwen Iconqwen1.5-14b-chat
222
206
223
221
223
234
225
217
Azurewizardlm-70b
223
-
237
240
228
197
226
225
granite-3.0-8b-instruct
224
202
222
219
211
235
224
215
01.AIyi-34b-chat
225
220
231
233
234
222
232
229
deepseek-llm-67b-chat
226
-
235
228
231
244
227
226
OpenChatopenchat-3.5-0106
227
218
229
225
229
228
229
230
OpenChatopenchat-3.5
228
217
236
235
244
209
233
222
granite-3.1-2b-instruct
229
194
216
212
209
226
221
206
Snowflakesnowflake-arctic-instruct
230
216
228
226
225
231
235
248
gemma-1.1-7b-it
231
214
225
227
230
227
231
232
tulu-2-dpo-70b
232
-
227
229
236
229
222
228
openhermes-2.5-mistral-7b
233
-
232
242
235
221
234
242
vicuna-33b
234
223
241
239
246
215
238
247
starling-lm-7b-beta
235
211
224
223
227
249
236
231
Azurephi-3-small-8k-instruct
236
215
226
234
214
240
230
234
Metallama-2-70b-chat
237
227
242
245
238
248
240
243
starling-lm-7b-alpha
238
226
240
232
240
237
239
236
Metallama-3.2-3b-instruct
239
208
239
246
224
230
237
235
nous-hermes-2-mixtral-8x7b-dpo
240
-
260
247
254
236
256
250
Qwen Iconqwq-32b-preview
241
219
238
248
191
250
228
227
granite-3.0-2b-instruct
242
210
233
231
222
255
243
237
Nvidiallama2-70b-steerlm-chat
243
-
252
262
247
247
245
264
solar-10.7b-instruct-v1.0
244
-
244
244
251
233
250
-
dolphin-2.2.1-mistral-7b
245
-
248
-
243
239
244
-
mpt-30b-chat
246
-
246
252
255
242
241
-
mistral-7b-instruct-v0.2
247
225
245
243
239
251
249
246
Azurewizardlm-13b
248
-
266
260
264
232
248
238
falcon-180b-chat
249
-
261
-
-
223
242
-
Qwen Iconqwen1.5-7b-chat
250
222
251
230
245
267
246
233
Azurephi-3-mini-4k-instruct-june-2024
251
221
234
236
213
256
247
254
Metallama-2-13b-chat
252
229
255
254
250
258
255
241
vicuna-13b
253
231
259
253
260
246
252
239
Qwen Iconqwen-14b-chat
254
-
254
238
242
253
251
249
Metacodellama-34b-instruct
255
-
257
256
252
265
258
257
palm-2
256
-
253
258
249
264
253
244
gemma-7b-it
257
230
249
250
248
254
259
252
HuggingFacezephyr-7b-beta
258
233
263
259
259
238
264
253
Azurephi-3-mini-128k-instruct
259
236
258
257
237
263
261
262
Azurephi-3-mini-4k-instruct
260
224
250
241
233
271
254
255
HuggingFacezephyr-7b-alpha
261
-
262
251
-
252
260
-
guanaco-33b
262
-
270
268
262
241
272
-
stripedhyena-nous-7b
263
-
268
267
257
259
265
263
HuggingFacesmollm2-1.7b-instruct
264
-
247
255
232
268
257
245
Metacodellama-70b-instruct
265
-
243
-
-
-
262
-
vicuna-7b
266
-
269
266
266
262
268
240
gemma-1.1-2b-it
267
234
256
249
253
260
263
251
Metallama-3.2-1b-instruct
268
237
264
261
241
266
266
256
mistral-7b-instruct
269
238
265
263
261
257
267
258
Metallama-2-7b-chat
270
228
271
270
256
270
269
261
gemma-2b-it
271
-
267
264
263
269
271
259
Qwen Iconqwen1.5-4b-chat
272
235
272
265
258
274
270
260
olmo-7b-instruct
273
232
273
269
265
278
275
-
koala-13b
274
-
276
273
272
275
276
-
alpaca-13b
275
-
281
279
269
261
277
-
gpt4all-13b-snoozy
276
-
274
-
268
272
273
-
mpt-7b-chat
277
-
277
272
270
273
278
-
chatglm3-6b
278
-
275
271
267
276
274
265
RWKVRWKV-4-Raven-14B
279
-
280
274
271
280
282
-
chatglm2-6b
280
-
278
277
274
279
280
-
oasst-pythia-12b
281
-
279
275
275
277
279
-
chatglm-6b
282
-
282
276
273
283
281
-
fastchat-t5-3b
283
-
285
281
277
281
283
-
dolly-v2-12b
284
-
283
280
276
282
284
-
Metallama-13b
285
-
286
282
278
284
285
-
Stabilitystablelm-tuned-alpha-7b
286
-
284
278
279
285
286
-