For frontier AI news
Powered by Code Arena

WebDev Leaderboard

Compare the performance of AI models for web development tasks built in the Code Arena
The legacy WebDev leaderboard is still available at web.lmarena.ai

Last Updated

Dec 16, 2025

Total Votes

66,698

Total Models

31

Rank Spread
(Upper-Lower)
1
1◄─►1
1518+13/-133,592
Anthropic
Proprietary
2
2◄─►5
1485+17/-171,646
OpenAI
Proprietary
3
2◄─►5
1484+12/-123,511
Anthropic
Proprietary
4
2◄─►5
1481+10/-108,535
Google
Proprietary
5
2◄─►5
1465+15/-151,725
Google
Proprietary
6
6◄─►12
1399+12/-123,948
OpenAI
Proprietary
7
6◄─►13
1399+15/-151,640
OpenAI
Proprietary
8
6◄─►13
1393+11/-114,645
OpenAI
Proprietary
9
6◄─►13
1393+9/-97,580
Anthropic
Proprietary
10
6◄─►13
1392+10/-107,296
Anthropic
Proprietary
11
6◄─►13
1387+9/-98,626
Anthropic
Proprietary
12
6◄─►15
1376+15/-151,690
Google
Proprietary
13
12◄─►15
1368+10/-106,981
Z.ai
MIT
14
7◄─►16
1367+20/-19955
DeepSeek AI
MIT
15
12◄─►16
1359+10/-106,540
OpenAI
Proprietary
16
14◄─►17
1345+10/-106,359
Moonshot
Modified MIT
17
16◄─►18
1335+10/-104,793
OpenAI
Proprietary
18
17◄─►18
Minimax
1317+10/-107,037
MiniMax
Apache 2.0
19
19◄─►22
1294+10/-105,156
DeepSeek AI
MIT
20
19◄─►22
1291+9/-97,246
Alibaba
Apache 2.0
21
19◄─►23
1289+10/-107,305
Anthropic
Proprietary
22
19◄─►23
1286+17/-171,230
DeepSeek AI
MIT
23
21◄─►24
1264+15/-151,945
KwaiKAT
Proprietary
24
23◄─►26
1252+17/-171,565
OpenAI
Proprietary
25
24◄─►28
1227+13/-133,715
xAI
Proprietary
26
24◄─►28
1226+20/-201,025
Mistral
Apache 2.0
27
25◄─►28
1213+13/-133,505
Google
Proprietary
28
25◄─►28
1206+19/-191,261
xAI
Proprietary
29
29◄─►30
1153+23/-23945
xAI
Proprietary
30
29◄─►31
1144+21/-211,014
xAI
Proprietary
31
30◄─►31
1103+22/-221,033
Mistral
Proprietary

Remove Style Control Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles