For frontier AI news

Leaderboard Overview

See how leading models stack up across text, image, vision, and beyond. This page gives you a snapshot of each Arena, you can explore deeper insights in their dedicated tabs. Learn more about it here.

Arena Overview

Scroll to the right to see full stats of each model

First Place
Second Place
Third Place
gemini-3-pro
1
3
2
3
2
1
3
2
grok-4.1-thinking
2
9
4
6
8
10
13
15
gemini-3-flash
3
5
5
8
1
3
6
7
Anthropicclaude-opus-4-5-20251101-thinking-32k
4
2
1
1
3
4
1
1
grok-4.1
5
21
9
12
20
14
15
13
Anthropicclaude-opus-4-5-20251101
6
1
3
4
6
2
2
3
gemini-3-flash (thinking-minimal)
7
13
7
10
4
9
10
8
gpt-5.1-high
8
7
11
13
5
11
9
10
gemini-2.5-pro
9
14
13
27
10
5
11
11
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
10
4
6
2
7
8
4
4
Anthropicclaude-opus-4-1-20250805-thinking-16k
11
8
8
5
12
7
5
5
Anthropicclaude-sonnet-4-5-20250929
12
6
10
7
26
6
7
6
Baiduernie-5.0-preview-1203
13
28
14
32
54
15
25
34
gpt-4.5-preview-2025-02-27
14
45
37
43
46
13
14
21
Anthropicclaude-opus-4-1-20250805
15
19
12
9
19
12
8
9
glm-4.7
16
35
15
14
11
19
16
14
chatgpt-4o-latest-20250326
17
50
19
35
62
17
19
29
gpt-5-high
18
20
23
23
16
46
34
50
gpt-5.1
19
15
18
18
45
20
17
19
gpt-5.2-high
20
11
17
17
13
43
20
18
Qwen Iconqwen3-max-preview
21
10
16
16
14
30
18
16
o3-2025-04-16
22
24
31
44
9
44
47
57
grok-4-1-fast-reasoning
23
27
32
51
52
26
49
48
MoonshotAIkimi-k2-thinking-turbo
24
18
21
15
15
35
22
27
Baiduernie-5.0-preview-1103
25
44
36
33
44
21
51
38
gpt-5.2
26
12
33
31
22
61
26
31
gpt-5-chat
27
23
25
40
42
41
30
30
Qwen Iconqwen3-max-2025-09-23
28
46
26
19
18
32
31
32
glm-4.6
29
25
30
39
27
27
27
33
deepseek-v3.2-exp
30
43
24
26
34
24
29
22
Anthropicclaude-opus-4-20250514-thinking-16k
31
26
20
11
31
16
12
12
Qwen Iconqwen3-235b-a22b-instruct-2507
32
22
22
25
24
45
24
24
deepseek-v3.2-exp-thinking
33
31
28
22
25
36
28
35
grok-4-fast-chat
34
49
49
48
29
52
48
43
deepseek-v3.2
35
34
27
29
21
33
21
23
deepseek-v3.2-thinking
36
41
35
34
33
25
32
28
deepseek-r1-0528
37
55
43
37
69
40
57
60
MoonshotAIkimi-k2-0905-preview
38
52
40
30
39
51
64
64
deepseek-v3.1
39
39
45
55
38
38
45
39
MoonshotAIkimi-k2-0711-preview
40
57
46
41
76
63
75
71
deepseek-v3.1-thinking
41
40
39
46
30
22
23
17
deepseek-v3.1-terminus
42
-
54
65
68
23
56
52
Qwen Iconqwen3-vl-235b-a22b-instruct
43
33
34
36
50
67
36
41
deepseek-v3.1-terminus-thinking
44
-
29
42
40
53
33
26
mistral-large-3
45
56
38
28
37
68
40
51
gpt-4.1-2025-04-14
46
65
50
50
90
28
52
42
Anthropicclaude-opus-4-20250514
47
42
41
38
63
18
37
20
mistral-medium-2508
48
58
47
52
58
48
50
53
grok-3-preview-02-24
49
63
53
60
88
29
42
37
grok-4-0709
50
38
58
68
17
34
54
49
glm-4.5
51
29
44
49
35
56
39
45
gemini-2.5-flash
52
48
64
83
49
31
43
46
gemini-2.5-flash-preview-09-2025
53
30
57
81
32
47
46
47
Anthropicclaude-haiku-4-5-20251001
54
32
42
20
72
58
38
36
grok-4-fast-reasoning
55
54
69
64
48
55
63
59
o1-2024-12-17
56
69
65
73
51
50
44
55
Qwen Iconqwen3-next-80b-a3b-instruct
57
66
55
57
28
107
67
67
longcat-flash-chat
58
51
51
21
23
93
58
75
Qwen Iconqwen3-235b-a22b-no-thinking
59
72
59
58
70
66
69
61
Anthropicclaude-sonnet-4-20250514-thinking-32k
60
37
48
24
57
37
35
25
Qwen Iconqwen3-235b-a22b-thinking-2507
61
17
60
61
59
64
60
65
Xiaomimimo-v2-flash (non-thinking)
62
47
52
62
74
57
53
54
deepseek-r1
63
71
62
59
47
60
55
66
amazon-nova-experimental-chat-12-10
64
-
56
67
55
80
62
69
Qwen Iconqwen3-vl-235b-a22b-thinking
65
36
61
53
43
84
68
63
deepseek-v3-0324
66
75
72
80
89
39
74
73
gpt-5-mini-high
67
62
73
76
41
96
73
88
Tencenthunyuan-vision-1.5-thinking
68
-
63
66
-
69
61
68
o4-mini-2025-04-16
69
59
74
75
36
86
80
95
mai-1-preview
70
64
75
77
67
74
76
70
Anthropicclaude-sonnet-4-20250514
71
70
66
56
79
49
59
44
o1-preview
72
85
82
86
80
70
72
84
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
73
60
67
54
83
42
41
40
Qwen Iconqwen3-coder-480b-a35b-instruct
74
83
68
47
87
71
65
62
Tencenthunyuan-t1-20250711
75
68
78
98
61
54
71
76
Minimaxminimax-m2.1-preview
76
16
70
45
56
92
66
56
mistral-medium-2505
77
84
79
74
107
73
83
74
Qwen Iconqwen3-30b-a3b-instruct-2507
78
78
71
63
84
100
81
80
gpt-4.1-mini-2025-04-14
79
82
76
69
105
82
78
77
Tencenthunyuan-turbos-20250416
80
103
86
102
109
72
92
85
gemini-2.5-flash-lite-preview-09-2025-no-thinking
81
77
84
101
92
76
84
78
gemini-2.5-flash-lite-preview-06-17-thinking
82
91
91
115
93
62
77
81
Qwen Iconqwen3-235b-a22b
83
81
83
70
65
97
88
82
Qwen Iconqwen2.5-max
84
93
89
95
95
77
89
79
Anthropicclaude-3-5-sonnet-20241022
85
94
80
72
110
65
79
72
Anthropicclaude-3-7-sonnet-20250219
86
80
81
78
101
59
70
58
glm-4.5-air
87
76
85
79
73
98
85
83
Qwen Iconqwen3-next-80b-a3b-thinking
88
79
88
82
66
105
87
91
Minimaxminimax-m1
89
87
90
85
64
102
95
92
gemma-3-27b-it
90
112
107
140
118
79
99
96
o3-mini-high
91
74
77
71
53
110
82
90
grok-3-mini-high
92
67
93
108
75
95
86
86
amazon-nova-experimental-chat-11-10
93
53
87
84
82
134
90
98
gemini-2.0-flash-001
94
105
109
135
104
81
96
97
deepseek-v3
95
108
120
109
126
78
101
89
grok-3-mini-beta
96
86
99
113
91
87
91
93
mistral-small-2506
97
114
94
89
111
101
104
100
PrimeIntellectintellect-3
98
92
97
91
71
121
107
106
glm-4.5v
99
61
96
94
97
111
103
118
gpt-oss-120b
100
98
108
105
81
150
114
141
gemini-2.0-flash-lite-preview-02-05
101
116
121
158
116
83
112
113
Coherecommand-a-03-2025
102
109
103
107
128
90
102
94
gemini-1.5-pro-002
103
113
119
146
113
75
105
103
o3-mini
104
95
100
87
85
119
100
101
amazon-nova-experimental-chat-10-20
105
89
98
90
77
149
94
104
Tencenthunyuan-turbos-20250226
106
-
105
96
137
122
97
105
amazon-nova-experimental-chat-10-09
107
-
110
106
-
130
120
123
AntGroupling-flash-2.0
108
99
102
88
103
160
122
136
Minimaxminimax-m2
109
111
101
119
100
137
106
111
Stepfunstep-3
110
102
92
93
94
117
98
108
Nvidiallama-3.1-nemotron-ultra-253b-v1
111
-
95
103
86
91
93
116
gpt-4o-2024-05-13
112
131
129
129
134
88
118
135
Qwen Iconqwen3-32b
113
73
106
92
60
116
113
102
Qwen Iconqwen-plus-0125
114
88
118
118
117
109
109
99
glm-4-plus-0111
115
130
145
166
139
106
125
124
Anthropicclaude-3-5-sonnet-20240620
116
110
113
100
112
120
108
107
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
117
90
111
99
78
114
119
119
gemma-3-12b-it
118
161
135
172
121
94
121
109
Tencenthunyuan-turbo-0110
119
-
122
124
160
125
130
117
gpt-5-nano-high
120
101
116
117
106
177
115
133
nova-2-lite
121
96
104
97
115
153
111
114
o1-mini
122
106
114
110
99
151
110
115
Metallama-3.1-405b-instruct-bf16
123
133
125
122
124
124
133
142
gpt-4o-2024-08-06
124
132
143
141
130
99
124
126
Qwen Iconqwq-32b
125
100
115
114
98
132
116
125
grok-2-2024-08-13
126
126
146
143
145
108
136
131
gemini-advanced-0514
127
150
148
162
136
85
129
147
Metallama-3.1-405b-instruct-fp8
128
122
130
132
122
118
132
151
Stepfunstep-2-16k-exp-202412
129
118
128
126
125
89
126
121
01.AIyi-lightning
130
115
126
128
131
129
139
144
Metallama-4-maverick-17b-128e-instruct
131
117
127
123
119
113
131
128
Qwen Iconqwen3-30b-a3b
132
107
123
112
102
146
134
122
Nvidiallama-3.3-nemotron-49b-super-v1
133
-
112
136
-
131
117
130
Tencenthunyuan-large-2025-02-10
134
119
137
133
144
123
127
87
gpt-4-turbo-2024-04-09
135
146
157
155
142
103
145
156
Anthropicclaude-3-5-haiku-20241022
136
127
124
116
155
126
137
120
deepseek-v2.5-1210
137
137
139
120
146
112
128
127
Metallama-4-scout-17b-16e-instruct
138
134
138
138
123
136
148
137
gemini-1.5-pro-001
139
123
142
154
143
104
140
110
Anthropicclaude-3-opus-20240229
140
124
140
148
129
140
138
140
gpt-4.1-nano-2025-04-14
141
125
134
121
163
115
146
138
Stepfunstep-1o-turbo-202506
142
143
132
145
133
139
135
112
AntGroupring-flash-2.0
143
97
117
104
108
172
123
132
Metallama-3.3-70b-instruct
144
144
151
157
141
143
157
160
glm-4-plus
145
138
154
151
152
138
147
145
gemma-3n-e4b-it
146
157
158
176
172
128
162
155
gpt-oss-20b
147
121
149
125
114
186
165
162
Qwen Iconqwen-max-0919
148
139
155
150
147
147
144
143
Nvidianvidia-nemotron-3-nano-30b-a3b-bf16
149
104
136
134
96
179
141
170
gpt-4o-mini-2024-07-18
150
155
163
153
162
133
155
146
Qwen Iconqwen2.5-plus-1127
151
120
144
144
132
158
152
161
mistral-large-2407
152
140
153
149
150
141
150
165
athene-v2-chat
153
129
133
130
127
178
143
149
gpt-4-0125-preview
154
159
165
168
138
152
161
163
gpt-4-1106-preview
155
154
160
163
135
144
153
168
mercury
156
-
147
127
-
185
168
159
Tencenthunyuan-standard-2025-02-10
157
152
162
167
148
157
164
134
gemini-1.5-flash-002
158
158
170
174
149
127
159
148
grok-2-mini-2024-08-13
159
149
167
164
164
161
163
154
deepseek-v2.5
160
145
150
131
151
164
158
157
athene-70b-0725
161
142
161
152
176
162
169
173
mistral-large-2411
162
165
159
156
156
156
154
164
olmo-3-32b-think
163
141
141
137
120
168
149
150
magistral-medium-2506
164
151
131
111
154
142
142
129
mistral-small-3.1-24b-instruct-2503
165
128
152
139
159
159
151
139
gemma-3-4b-it
166
174
178
201
178
155
176
158
Qwen Iconqwen2.5-72b-instruct
167
147
156
147
140
175
156
153
Nvidiallama-3.1-nemotron-70b-instruct
168
148
164
170
161
154
166
180
Tencenthunyuan-large-vision
169
136
168
142
153
145
160
152
Metallama-3.1-70b-instruct
170
164
173
165
171
171
171
171
jamba-1.5-large
171
156
179
179
186
170
177
197
amazon-nova-pro-v1.0
172
162
171
160
169
190
170
166
ibm-granite-h-small
173
135
172
169
158
181
174
172
gemma-2-27b-it
174
177
181
186
188
135
175
169
reka-core-20240904
175
153
184
175
184
163
181
183
gpt-4-0314
176
172
166
171
157
176
167
185
Nvidiallama-3.1-nemotron-51b-instruct
177
178
182
180
168
169
182
189
llama-3.1-tulu-3-70b
178
-
188
182
174
180
172
187
gemini-1.5-flash-001
179
168
175
181
179
167
179
167
Anthropicclaude-3-sonnet-20240229
180
166
180
173
181
182
178
179
gemma-2-9b-it-simpo
181
191
190
205
210
148
188
175
Nvidianemotron-4-340b-instruct
182
176
183
185
180
188
183
177
Coherecommand-r-plus-08-2024
183
182
198
199
191
166
187
182
Metallama-3-70b-instruct
184
184
185
187
177
174
184
201
gpt-4-0613
185
181
174
178
166
165
173
184
mistral-small-24b-instruct-2501
186
173
177
177
175
192
185
181
glm-4-0520
187
179
187
183
183
184
186
192
reka-flash-20240904
188
163
194
194
189
183
191
195
Qwen Iconqwen2.5-coder-32b-instruct
189
160
169
159
167
202
180
176
Coherec4ai-aya-expanse-32b
190
170
192
193
190
191
190
174
gemma-2-9b-it
191
189
200
209
201
173
196
193
deepseek-coder-v2
192
169
176
161
170
203
189
178
Coherecommand-r-plus
193
186
202
210
204
189
198
198
Qwen Iconqwen2-72b-instruct
194
175
189
191
165
194
197
199
Anthropicclaude-3-haiku-20240307
195
180
195
189
193
197
194
194
amazon-nova-lite-v1.0
196
171
191
188
185
195
193
186
gemini-1.5-flash-8b-001
197
185
199
208
194
187
199
191
Azurephi-4
198
167
186
184
173
201
192
190
olmo-2-0325-32b-instruct
199
-
196
198
192
193
202
204
Coherecommand-r-08-2024
200
193
201
197
209
200
201
196
mistral-large-2402
201
195
197
192
187
198
200
205
amazon-nova-micro-v1.0
202
183
205
195
196
206
203
203
jamba-1.5-mini
203
205
208
211
228
199
214
217
ministral-8b-2410
204
187
203
204
202
196
207
200
gemini-pro-dev-api
205
216
213
227
226
204
212
214
Qwen Iconqwen1.5-110b-chat
206
194
204
200
197
212
204
209
Qwen Iconqwen1.5-72b-chat
207
190
207
206
208
215
210
206
Tencenthunyuan-standard-256k
208
-
193
190
182
209
195
188
reka-flash-21b-20240226-online
209
197
209
202
205
220
215
220
mixtral-8x22b-instruct-v0.1
210
202
206
203
195
214
206
218
Coherecommand-r
211
204
227
226
229
208
216
207
reka-flash-21b-20240226
212
203
211
212
214
222
222
219
gpt-3.5-turbo-0125
213
211
216
207
215
216
208
216
Coherec4ai-aya-expanse-8b
214
192
214
217
213
213
213
202
mistral-medium
215
196
210
213
198
211
211
215
Metallama-3-8b-instruct
216
207
223
219
224
207
220
221
gemini-pro
217
-
215
223
218
219
205
-
llama-3.1-tulu-3-8b
218
-
220
218
206
210
209
210
HuggingFacezephyr-orpo-141b-A35b-v0.1
219
219
226
225
217
225
221
231
01.AIyi-1.5-34b-chat
220
200
219
224
203
227
223
222
Metallama-3.1-8b-instruct
221
210
221
216
225
221
218
211
granite-3.1-8b-instruct
222
188
212
196
223
224
217
208
Qwen Iconqwen1.5-32b-chat
223
198
222
214
211
251
225
213
gpt-3.5-turbo-1106
224
206
217
215
212
232
219
229
gemma-2-2b-it
225
220
238
245
234
218
230
228
Azurephi-3-medium-4k-instruct
226
208
228
232
200
233
228
226
mixtral-8x7b-instruct-v0.1
227
212
229
230
227
228
227
232
dbrx-instruct-preview
228
214
225
221
220
226
224
227
Qwen Iconqwen1.5-14b-chat
229
213
231
229
231
242
233
225
InternLMinternlm2_5-20b-chat
230
199
218
222
207
253
226
224
Azurewizardlm-70b
231
-
245
248
236
205
234
233
deepseek-llm-67b-chat
232
-
243
236
239
252
235
234
01.AIyi-34b-chat
233
227
239
241
242
230
240
237
granite-3.0-8b-instruct
234
209
230
228
219
243
232
223
OpenChatopenchat-3.5-0106
235
225
237
233
237
237
237
238
OpenChatopenchat-3.5
236
224
244
243
252
217
241
230
granite-3.1-2b-instruct
237
201
224
220
216
234
229
212
gemma-1.1-7b-it
238
221
233
235
238
235
239
240
Snowflakesnowflake-arctic-instruct
239
223
236
234
233
239
242
256
tulu-2-dpo-70b
240
-
235
237
244
236
231
236
openhermes-2.5-mistral-7b
241
-
240
250
243
229
243
250
vicuna-33b
242
230
249
247
254
223
246
255
starling-lm-7b-beta
243
218
232
231
235
257
244
239
Azurephi-3-small-8k-instruct
244
222
234
242
222
249
238
242
Metallama-2-70b-chat
245
234
250
253
246
256
248
251
starling-lm-7b-alpha
246
233
248
240
248
245
247
244
Metallama-3.2-3b-instruct
247
215
247
254
232
238
245
243
nous-hermes-2-mixtral-8x7b-dpo
248
-
268
256
262
244
264
259
Qwen Iconqwq-32b-preview
249
226
246
255
199
258
236
235
Nvidiallama2-70b-steerlm-chat
250
-
260
270
256
255
253
272
granite-3.0-2b-instruct
251
217
241
239
230
263
251
245
solar-10.7b-instruct-v1.0
252
-
251
252
259
241
258
-
dolphin-2.2.1-mistral-7b
253
-
255
-
250
247
252
-
mpt-30b-chat
254
-
254
260
263
250
249
-
mistral-7b-instruct-v0.2
255
232
253
251
247
259
256
254
Azurewizardlm-13b
256
-
274
268
272
240
257
246
falcon-180b-chat
257
-
269
-
-
231
250
-
Qwen Iconqwen1.5-7b-chat
258
229
259
238
253
275
255
241
Azurephi-3-mini-4k-instruct-june-2024
259
228
242
244
221
264
254
262
Metallama-2-13b-chat
260
236
263
262
258
266
263
249
vicuna-13b
261
238
266
261
268
254
260
248
Qwen Iconqwen-14b-chat
262
-
262
246
251
261
259
257
palm-2
263
-
261
266
257
272
261
252
Metacodellama-34b-instruct
264
-
265
264
260
273
266
265
gemma-7b-it
265
237
257
258
255
262
267
260
HuggingFacezephyr-7b-beta
266
240
271
267
267
246
272
261
Azurephi-3-mini-128k-instruct
267
243
267
265
245
271
269
270
Azurephi-3-mini-4k-instruct
268
231
258
249
241
279
262
263
HuggingFacezephyr-7b-alpha
269
-
270
259
-
260
268
-
guanaco-33b
270
-
278
276
270
248
280
-
stripedhyena-nous-7b
271
-
276
275
265
267
273
271
Metacodellama-70b-instruct
272
-
252
-
-
-
271
-
HuggingFacesmollm2-1.7b-instruct
273
-
256
263
240
276
265
253
vicuna-7b
274
-
277
273
274
270
276
247
gemma-1.1-2b-it
275
241
264
257
261
268
270
258
Metallama-3.2-1b-instruct
276
244
272
269
249
274
274
264
mistral-7b-instruct
277
245
273
271
269
265
275
266
Metallama-2-7b-chat
278
235
279
278
264
278
277
269
gemma-2b-it
279
-
275
272
271
277
279
267
Qwen Iconqwen1.5-4b-chat
280
242
280
274
266
282
278
268
olmo-7b-instruct
281
239
281
277
273
286
283
-
koala-13b
282
-
284
281
280
283
284
-
alpaca-13b
283
-
289
287
277
269
285
-
gpt4all-13b-snoozy
284
-
282
-
276
280
281
-
mpt-7b-chat
285
-
285
280
278
281
286
-
chatglm3-6b
286
-
283
279
275
284
282
273
RWKVRWKV-4-Raven-14B
287
-
288
282
279
288
290
-
chatglm2-6b
288
-
286
285
282
287
288
-
oasst-pythia-12b
289
-
287
283
283
285
287
-
chatglm-6b
290
-
290
284
281
291
289
-
fastchat-t5-3b
291
-
293
289
285
289
291
-
dolly-v2-12b
292
-
291
288
284
290
292
-
Metallama-13b
293
-
294
290
286
292
293
-
Stabilitystablelm-tuned-alpha-7b
294
-
292
286
287
293
294
-