Compare the performance of AI models for web development tasks built in the Code Arena
The legacy WebDev leaderboard is still available at web.lmarena.ai
Last Updated
Nov 26, 2025
Total Votes
41,430
Total Models
22
Rank Spread (Upper-Lower) | |||||||
|---|---|---|---|---|---|---|---|
| 1 | 1◄─►3 | 1493 | +20/-20 | 1,109 | Anthropic | Proprietary | |
| 2 | 1◄─►3 | 1479 | +17/-17 | 1,421 | Anthropic | Proprietary | |
| 3 | 1◄─►3 | 1473 | +11/-11 | 6,037 | Google | Proprietary | |
| 4 | 4◄─►8 | 1399 | +12/-12 | 3,937 | OpenAI | Proprietary | |
| 5 | 4◄─►8 | 1397 | +10/-10 | 5,376 | Anthropic | Proprietary | |
| 6 | 4◄─►8 | 1395 | +13/-13 | 2,431 | OpenAI | Proprietary | |
| 7 | 4◄─►8 | 1393 | +10/-10 | 5,204 | Anthropic | Proprietary | |
| 8 | 4◄─►9 | 1387 | +10/-10 | 6,422 | Anthropic | Proprietary | |
| 9 | 8◄─►11 | 1370 | +11/-11 | 5,035 | Z.ai | MIT | |
| 10 | 9◄─►12 | 1358 | +11/-11 | 4,258 | Moonshot | Modified MIT | |
| 11 | 9◄─►12 | 1358 | +11/-11 | 4,484 | OpenAI | Proprietary | |
| 12 | 10◄─►13 | 1340 | +12/-12 | 2,793 | OpenAI | Proprietary | |
| 13 | 12◄─►13 | 1321 | +11/-11 | 4,956 | MiniMax | Apache 2.0 | |
| 14 | 14◄─►16 | 1294 | +11/-11 | 4,650 | DeepSeek AI | MIT | |
| 15 | 14◄─►16 | 1293 | +11/-11 | 5,159 | Alibaba | Apache 2.0 | |
| 16 | 14◄─►16 | 1289 | +10/-10 | 5,158 | Anthropic | Proprietary | |
| 17 | 17◄─►18 | 1253 | +16/-16 | 1,563 | OpenAI | Proprietary | |
| 18 | 17◄─►20 | 1228 | +17/-17 | 1,534 | xAI | Proprietary | |
| 19 | 18◄─►20 | 1211 | +12/-12 | 3,503 | Google | Proprietary | |
| 20 | 18◄─►20 | 1207 | +19/-19 | 1,253 | xAI | Proprietary | |
| 21 | 21◄─►22 | 1124 | +24/-24 | 727 | xAI | Proprietary | |
| 22 | 21◄─►22 | 1120 | +29/-29 | 530 | xAI | Proprietary |