For frontier AI news
Powered by Code Arena

WebDev Leaderboard

Compare the performance of AI models for web development tasks built in the Code Arena
The legacy WebDev leaderboard is still available at web.lmarena.ai

Last Updated

Dec 11, 2025

Total Votes

59,038

Total Models

29

Rank Spread
(Upper-Lower)
1
1◄─►1
1519+13/-132,993
Anthropic
Proprietary
2
2◄─►4
1486+17/-171,641
OpenAI
Proprietary
3
2◄─►4
1483+13/-133,039
Anthropic
Proprietary
4
2◄─►4
1482+10/-107,897
Google
Proprietary
5
5◄─►11
1400+12/-123,945
OpenAI
Proprietary
6
5◄─►11
1399+15/-151,639
OpenAI
Proprietary
7
5◄─►11
1395+10/-106,974
Anthropic
Proprietary
8
5◄─►11
1395+10/-106,705
Anthropic
Proprietary
9
5◄─►11
1394+11/-114,119
OpenAI
Proprietary
10
5◄─►12
1387+9/-98,006
Anthropic
Proprietary
11
10◄─►13
1369+10/-106,461
Z.ai
MIT
12
5◄─►15
1369+29/-29410
DeepSeek AI
MIT
13
11◄─►15
1358+10/-105,955
OpenAI
Proprietary
14
12◄─►15
1345+10/-105,792
Moonshot
Modified MIT
15
12◄─►15
1340+11/-114,298
OpenAI
Proprietary
16
16◄─►17
Minimax
1317+10/-106,457
MiniMax
Apache 2.0
17
17◄─►20
1295+10/-105,155
DeepSeek AI
MIT
18
17◄─►20
1290+9/-96,674
Alibaba
Apache 2.0
19
16◄─►22
1289+22/-22725
DeepSeek AI
MIT
20
17◄─►21
1287+10/-106,702
Anthropic
Proprietary
21
19◄─►22
1265+15/-151,943
KwaiKAT
Proprietary
22
20◄─►24
1252+17/-171,565
OpenAI
Proprietary
23
22◄─►26
1228+13/-133,710
xAI
Proprietary
24
22◄─►26
1227+20/-201,023
Mistral
Apache 2.0
25
23◄─►26
1214+12/-123,504
Google
Proprietary
26
23◄─►26
1206+19/-191,260
xAI
Proprietary
27
27◄─►28
1154+23/-23944
xAI
Proprietary
28
27◄─►29
1144+21/-211,014
xAI
Proprietary
29
28◄─►29
1103+21/-211,032
Mistral
Proprietary

Remove Style Control Leaderboard Plots

Confidence Intervals on Model Strength (via Bootstrapping)

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Battle Count for Each Combination of Models (without Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles