For frontier AI news
Powered by Code Arena

WebDev Leaderboard

Compare the performance of AI models for web development tasks built in the Code Arena

Last Updated

Jan 12, 2026

Total Votes

99,992

Total Models

34

Rank Spread
1
1◄─►1
1510+10/-106,197
Anthropic
Proprietary
2
2◄─►4
1480+10/-105,853
Anthropic
Proprietary
3
2◄─►5
1476+16/-161,691
OpenAI
Proprietary
4
2◄─►5
1468+8/-812,331
Google
Proprietary
5
3◄─►6
1453+9/-95,810
Google
Proprietary
6
5◄─►6
1448+10/-104,253
Z.ai
MIT
7
7◄─►7
1427+9/-95,817
MiniMax
MIT
8
8◄─►14
1401+15/-151,627
OpenAI
Proprietary
9
8◄─►14
1397+12/-123,929
OpenAI
Proprietary
10
8◄─►14
1392+9/-96,588
OpenAI
Proprietary
11
8◄─►14
1392+8/-89,833
Anthropic
Proprietary
12
8◄─►14
1390+8/-89,117
Anthropic
Proprietary
13
8◄─►14
1386+8/-811,290
Anthropic
Proprietary
14
8◄─►16
1381+14/-141,892
Google
Proprietary
15
14◄─►18
1365+12/-122,691
DeepSeek
MIT
16
14◄─►18
1360+8/-88,882
Z.ai
MIT
17
15◄─►18
1356+8/-88,756
OpenAI
Proprietary
18
15◄─►20
1344+11/-112,790
Xiaomi
MIT
19
18◄─►20
1335+8/-88,478
Moonshot
Modified MIT
20
18◄─►21
1334+9/-96,658
OpenAI
Proprietary
21
20◄─►21
Minimax
1317+8/-88,991
MiniMax
Apache 2.0
22
22◄─►25
1294+8/-89,556
Anthropic
Proprietary
23
22◄─►25
1293+11/-113,475
DeepSeek
MIT
24
22◄─►25
1290+10/-105,128
DeepSeek
MIT
25
22◄─►25
1286+8/-89,381
Alibaba
Apache 2.0
26
26◄─►27
1263+15/-151,955
KwaiKAT
Proprietary
27
26◄─►29
1247+17/-171,538
OpenAI
Proprietary
28
27◄─►31
1225+12/-123,993
xAI
Proprietary
29
27◄─►31
1224+20/-201,037
Mistral
Apache 2.0
30
28◄─►31
1209+13/-133,453
Google
Proprietary
31
28◄─►31
1207+19/-191,265
xAI
Proprietary
32
32◄─►33
1156+22/-22970
xAI
Proprietary
33
32◄─►34
1143+21/-211,015
xAI
Proprietary
34
33◄─►34
1101+22/-221,020
Mistral
Proprietary

Remove Style Control Leaderboard Plots

Fraction of Model A Wins for All Non-tied A vs. B Battles

Battle Count for Each Combination of Models (without Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)