| Gemini 3.1 Pro Preview | Google | 57.2 | 110.4 | $4.50 |
| GPT-5.4 (xhigh) | OpenAI | 57.0 | 78.5 | $5.63 |
| GPT-5.3 Codex (xhigh) | OpenAI | 54.0 | 68.3 | $4.81 |
| Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 53.0 | 55.1 | $10.00 |
| Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | 51.7 | 68.5 | $6.00 |
| GPT-5.2 (xhigh) | OpenAI | 51.3 | 68.5 | $4.81 |
| GLM-5 (Reasoning) | Z AI | 49.8 | 50.0 | $1.55 |
| Claude Opus 4.5 (Reasoning) | Anthropic | 49.7 | 60.5 | $10.00 |
| GPT-5.2 Codex (xhigh) | OpenAI | 49.0 | 64.7 | $4.81 |
| Gemini 3 Pro Preview (high) | Google | 48.4 | 114.0 | $4.50 |
| GPT-5.1 (high) | OpenAI | 47.7 | 95.5 | $3.44 |
| GPT-5.2 (medium) | OpenAI | 46.6 | 0.0 | $4.81 |
| Claude Opus 4.6 (Non-reasoning, High Effort) | Anthropic | 46.5 | 49.1 | $10.00 |
| Gemini 3 Flash Preview (Reasoning) | Google | 46.4 | 163.8 | $1.13 |
| Qwen3.5 397B A17B (Reasoning) | Alibaba | 45.0 | 56.0 | $1.35 |
| Qwen3.5 397B A17B (Reasoning) | Alibaba | 45.0 | 54.5 | $1.35 |
| GPT-5 Codex (high) | OpenAI | 44.6 | 182.4 | $3.44 |
| GPT-5 (high) | OpenAI | 44.6 | 66.7 | $3.44 |
| Claude Sonnet 4.6 (Non-reasoning, High Effort) | Anthropic | 44.4 | 53.1 | $6.00 |
| GPT-5.1 Codex (high) | OpenAI | 43.1 | 123.4 | $3.44 |
| Claude Opus 4.5 (Non-reasoning) | Anthropic | 43.1 | 53.8 | $10.00 |
| Claude 4.5 Sonnet (Reasoning) | Anthropic | 43.0 | 53.0 | $6.00 |
| Claude Sonnet 4.6 (Non-reasoning, Low Effort) | Anthropic | 42.6 | 52.5 | $6.00 |
| Qwen3.5 27B (Reasoning) | Alibaba | 42.1 | 89.9 | $0.82 |
| GLM-4.7 (Reasoning) | Z AI | 42.1 | 102.3 | $1.00 |
| GPT-5 (medium) | OpenAI | 42.0 | 57.1 | $3.44 |
| DeepSeek V3.2 (Reasoning) | DeepSeek | 41.7 | 28.2 | $0.32 |
| Qwen3.5 122B A10B (Reasoning) | Alibaba | 41.6 | 153.1 | $1.10 |
| Grok 4 | xAI | 41.5 | 39.5 | $6.00 |
| MiMo-V2-Flash (Feb 2026) | Xiaomi | 41.5 | 119.0 | $0.15 |
| Gemini 3 Pro Preview (low) | Google | 41.3 | 110.3 | $4.50 |
| GPT-5 mini (high) | OpenAI | 41.2 | 68.9 | $0.69 |
| Kimi K2 Thinking | Moonshot AI | 40.9 | 69.5 | $1.07 |
| o3-pro | OpenAI | 40.7 | 14.0 | $35.00 |
| GLM-5 (Non-reasoning) | Z AI | 40.6 | 43.9 | $1.55 |
| Qwen3.5 397B A17B (Non-reasoning) | Alibaba | 40.1 | 52.8 | $1.35 |
| Qwen3 Max Thinking | Alibaba | 39.9 | 33.4 | $2.40 |
| MiniMax-M2.1 | MiniMax | 39.4 | 40.5 | $0.53 |
| GPT-5 (low) | OpenAI | 39.2 | 55.2 | $3.44 |
| MiMo-V2-Flash (Reasoning) | Xiaomi | 39.2 | 121.0 | $0.15 |
| GPT-5 mini (medium) | OpenAI | 38.9 | 77.2 | $0.69 |
| Claude 4 Sonnet (Reasoning) | Anthropic | 38.7 | 54.3 | $6.00 |
| GPT-5.1 Codex mini (high) | OpenAI | 38.6 | 159.0 | $0.69 |
| Grok 4.1 Fast (Reasoning) | xAI | 38.6 | 118.9 | $0.28 |
| o3 | OpenAI | 38.3 | 278.4 | $3.50 |
| Step 3.5 Flash | StepFun | 37.8 | 149.4 | $0.15 |
| Qwen3.5 27B (Non-reasoning) | Alibaba | 37.2 | 89.2 | $0.82 |
| Qwen3.5 35B A3B (Reasoning) | Alibaba | 37.1 | 135.7 | $0.69 |
| Claude 4.5 Haiku (Reasoning) | Anthropic | 37.1 | 128.6 | $2.00 |
| Claude 4.5 Sonnet (Non-reasoning) | Anthropic | 37.1 | 49.4 | $6.00 |
| MiniMax-M2.5 | MiniMax | 36.1 | 41.6 | $0.53 |
| KAT-Coder-Pro V1 | KwaiKAT | 36.0 | 58.7 | $0.53 |
| Qwen3.5 122B A10B (Non-reasoning) | Alibaba | 35.9 | 142.0 | $1.10 |
| Nova 2.0 Pro Preview (medium) | Amazon | 35.7 | 162.4 | $3.44 |
| MiniMax-M2 | MiniMax | 35.6 | 107.9 | $0.53 |
| Grok 4 Fast (Reasoning) | xAI | 35.1 | 169.5 | $0.28 |
| Gemini 3 Flash Preview (Non-reasoning) | Google | 35.0 | 163.1 | $1.13 |
| Claude 3.7 Sonnet (Reasoning) | Anthropic | 34.7 | 0.0 | $6.00 |
| Gemini 2.5 Pro | Google | 34.6 | 123.4 | $3.44 |
| GLM-4.7 (Non-reasoning) | Z AI | 34.2 | 93.9 | $0.94 |
| DeepSeek V3.2 Speciale | DeepSeek | 34.1 | 0.0 | Free |
| DeepSeek V3.1 Terminus (Reasoning) | DeepSeek | 33.9 | 0.0 | $0.80 |
| GPT-5.2 (Non-reasoning) | OpenAI | 33.6 | 61.5 | $4.81 |
| Gemini 3.1 Flash-Lite Preview | Google | 33.5 | 287.0 | $0.56 |
| Doubao Seed Code | ByteDance Seed | 33.5 | 0.0 | Free |
| gpt-oss-120B (high) | OpenAI | 33.3 | 268.6 | $0.26 |
| o4-mini (high) | OpenAI | 33.1 | 133.4 | $1.93 |
| Claude 4 Sonnet (Non-reasoning) | Anthropic | 33.0 | 49.8 | $6.00 |
| DeepSeek V3.2 Exp (Reasoning) | DeepSeek | 32.9 | 29.4 | $0.32 |
| Mercury 2 | Inception | 32.8 | 842.2 | $0.38 |
| GLM-4.6 (Reasoning) | Z AI | 32.5 | 81.4 | $0.98 |
| Qwen3 Max Thinking (Preview) | Alibaba | 32.5 | 40.1 | $2.40 |
| Qwen3.5 9B (Reasoning) | Alibaba | 32.4 | 134.9 | $0.11 |
| DeepSeek V3.2 (Non-reasoning) | DeepSeek | 32.1 | 29.5 | $0.32 |
| K-EXAONE (Reasoning) | LG AI Research | 32.1 | 0.0 | Free |
| Claude 4.1 Opus (Reasoning) | Anthropic | 31.9 | 38.5 | $30.00 |
| Nova 2.0 Pro Preview (low) | Amazon | 31.9 | 173.6 | $3.44 |
| Claude 4.5 Haiku (Non-reasoning) | Anthropic | 31.1 | 105.5 | $2.00 |
| Gemini 2.5 Flash Preview (Sep '25) (Reasoning) | Google | 31.1 | 0.0 | $0.85 |
| Kimi K2 0905 | Moonshot AI | 30.9 | 79.4 | $1.20 |
| Claude 3.7 Sonnet (Non-reasoning) | Anthropic | 30.8 | 0.0 | $6.00 |
| Qwen3.5 35B A3B (Non-reasoning) | Alibaba | 30.7 | 132.9 | $0.69 |
| MiMo-V2-Flash (Non-reasoning) | Xiaomi | 30.4 | 127.5 | $0.15 |
| Gemini 2.5 Pro Preview (Mar' 25) | Google | 30.3 | 0.0 | Free |
| GLM-4.6 (Non-reasoning) | Z AI | 30.2 | 78.0 | $1.00 |
| GLM-4.7-Flash (Reasoning) | Z AI | 30.1 | 117.7 | $0.15 |
| Nova 2.0 Lite (medium) | Amazon | 29.7 | 232.3 | $0.85 |
| Qwen3 235B A22B 2507 (Reasoning) | Alibaba | 29.5 | 44.5 | $2.63 |
| Gemini 2.5 Pro Preview (May' 25) | Google | 29.5 | 0.0 | $3.44 |
| ERNIE 5.0 Thinking Preview | Baidu | 29.1 | 0.0 | Free |
| Grok Code Fast 1 | xAI | 28.7 | 163.7 | $0.53 |
| DeepSeek V3.1 Terminus (Non-reasoning) | DeepSeek | 28.5 | 0.0 | $0.63 |
| DeepSeek V3.2 Exp (Non-reasoning) | DeepSeek | 28.4 | 30.3 | $0.32 |
| Qwen3 Coder Next | Alibaba | 28.3 | 160.9 | $0.53 |
| Apriel-v1.5-15B-Thinker | ServiceNow | 28.3 | 147.7 | Free |
| DeepSeek V3.1 (Non-reasoning) | DeepSeek | 28.1 | 0.0 | $0.83 |
| Qwen3 Coder Next | Alibaba | 28.1 | 111.8 | $0.53 |
| Nova 2.0 Omni (medium) | Amazon | 28.0 | 0.0 | $0.85 |
| DeepSeek V3.1 (Reasoning) | DeepSeek | 27.7 | 0.0 | $0.86 |
| Apriel-v1.6-15B-Thinker | ServiceNow | 27.6 | 77.0 | Free |
| Qwen3 VL 235B A22B (Reasoning) | Alibaba | 27.6 | 44.3 | $2.63 |
| GPT-5.1 (Non-reasoning) | OpenAI | 27.4 | 92.5 | $3.44 |
| Claude 4 Opus (Reasoning) | Anthropic | 27.4 | 42.0 | $30.00 |
| Qwen3.5 9B (Non-reasoning) | Alibaba | 27.3 | 0.0 | Free |
| Qwen3.5 9B (Non-reasoning) | Alibaba | 27.3 | 0.0 | Free |
| Qwen3.5 9B (Non-reasoning) | Alibaba | 27.3 | 0.0 | Free |
| Qwen3.5 9B (Non-reasoning) | Alibaba | 27.3 | 0.0 | Free |
| Qwen3.5 4B (Reasoning) | Alibaba | 27.1 | 0.0 | Free |
| DeepSeek R1 0528 (May '25) | DeepSeek | 27.1 | 0.0 | $2.36 |
| Gemini 2.5 Flash (Reasoning) | Google | 27.0 | 234.4 | $0.85 |
| GPT-5 nano (high) | OpenAI | 26.8 | 123.8 | $0.14 |
| Qwen3 Next 80B A3B (Reasoning) | Alibaba | 26.7 | 153.4 | $1.88 |
| GLM-4.5 (Reasoning) | Z AI | 26.4 | 36.5 | $0.84 |
| Kimi K2.5 (Reasoning) | Kimi | 26.3 | 40.4 | $1.07 |
| Kimi K2.5 (Non-reasoning) | Kimi | 26.3 | 40.4 | $1.07 |
| Kimi K2 | Moonshot AI | 26.2 | 43.4 | $1.07 |
| Qwen3 Max (Preview) | Alibaba | 26.1 | 50.3 | $2.40 |
| o3-mini | OpenAI | 25.9 | 163.6 | $1.93 |
| GPT-5 nano (medium) | OpenAI | 25.9 | 135.0 | $0.14 |
| o1-pro | OpenAI | 25.8 | 0.0 | $262.50 |
| Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) | Google | 25.7 | 0.0 | $0.85 |
| o3-mini (high) | OpenAI | 25.2 | 157.0 | $1.93 |
| Grok 3 mini Reasoning (high) | xAI | 25.2 | 77.0 | $6.00 |
| o1 | OpenAI | 25.2 | 207.5 | $26.25 |
| Seed-OSS-36B-Instruct | ByteDance Seed | 25.2 | 28.7 | $0.30 |
| Grok 3 | xAI | 25.0 | 67.9 | $6.00 |
| Qwen3 235B A22B 2507 Instruct | Alibaba | 25.0 | 66.0 | $1.23 |
| Qwen3 Coder 480B A35B Instruct | Alibaba | 24.8 | 64.7 | $3.00 |
| Qwen3 VL 32B (Reasoning) | Alibaba | 24.7 | 97.2 | $2.63 |
| Nova 2.0 Lite (low) | Amazon | 24.6 | 235.8 | $0.85 |
| gpt-oss-120B (low) | OpenAI | 24.5 | 282.6 | $0.26 |
| gpt-oss-20B (high) | OpenAI | 24.5 | 284.2 | $0.09 |
| MiniMax M1 80k | MiniMax | 24.4 | 0.0 | $0.96 |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | NVIDIA | 24.3 | 196.9 | $0.10 |
| Gemini 2.5 Flash Preview (Reasoning) | Google | 24.3 | 0.0 | Free |
| K2 Think V2 | MBZUAI Institute of Foundation Models | 24.1 | 0.0 | Free |
| GPT-5 (minimal) | OpenAI | 23.9 | 47.2 | $3.44 |
| HyperCLOVA X SEED Think (32B) | Naver | 23.7 | 0.0 | Free |
| o1-preview | OpenAI | 23.7 | 0.0 | $28.88 |
| Claude 4.1 Opus (Non-reasoning) | Anthropic | 23.6 | 36.8 | $30.00 |
| Grok 4.1 Fast (Non-reasoning) | xAI | 23.6 | 109.2 | $0.28 |
| GLM-4.6V (Reasoning) | Z AI | 23.4 | 30.9 | $0.45 |
| K-EXAONE (Non-reasoning) | LG AI Research | 23.4 | 0.0 | Free |
| Nova 2.0 Omni (low) | Amazon | 23.2 | 0.0 | $0.85 |
| GLM-4.5-Air | Z AI | 23.2 | 95.1 | $0.42 |
| Grok 4 Fast (Non-reasoning) | xAI | 23.1 | 155.5 | $0.28 |
| Nova 2.0 Pro Preview (Non-reasoning) | Amazon | 23.1 | 160.8 | $3.44 |
| Mi:dm K 2.5 Pro Preview | Korea Telecom | 23.1 | 0.0 | Free |
| Ring-1T | InclusionAI | 22.8 | 0.0 | Free |
| Mistral Large 3 | Mistral | 22.8 | 57.7 | $0.75 |
| Qwen3.5 4B (Non-reasoning) | Alibaba | 22.6 | 0.0 | Free |
| Qwen3.5 4B (Non-reasoning) | Alibaba | 22.6 | 0.0 | Free |
| Qwen3.5 4B (Non-reasoning) | Alibaba | 22.6 | 0.0 | Free |
| Qwen3.5 4B (Non-reasoning) | Alibaba | 22.6 | 0.0 | Free |
| Qwen3 30B A3B 2507 (Reasoning) | Alibaba | 22.4 | 138.6 | $0.75 |
| DeepSeek V3 0324 | DeepSeek | 22.3 | 0.0 | $1.25 |
| INTELLECT-3 | Prime Intellect | 22.2 | 0.0 | Free |
| Claude 4 Opus (Non-reasoning) | Anthropic | 22.2 | 37.6 | $30.00 |
| GLM-4.7-Flash (Non-reasoning) | Z AI | 22.1 | 187.6 | $0.15 |
| Devstral 2 | Mistral | 22.0 | 80.3 | Free |
| GPT-5 (ChatGPT) | OpenAI | 21.8 | 152.3 | $3.44 |
| Solar Open 100B (Reasoning) | Upstage | 21.7 | 0.0 | Free |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | Google | 21.6 | 282.9 | $0.17 |
| Grok 3 Reasoning Beta | xAI | 21.6 | 0.0 | Free |
| MiniMax M1 40k | MiniMax | 20.9 | 0.0 | Free |
| gpt-oss-20B (low) | OpenAI | 20.8 | 285.6 | $0.09 |
| Qwen3 VL 235B A22B Instruct | Alibaba | 20.8 | 58.1 | $1.23 |
| GPT-5 mini (minimal) | OpenAI | 20.7 | 77.0 | $0.69 |
| Gemini 2.5 Flash (Non-reasoning) | Google | 20.6 | 207.0 | $0.85 |
| K2-V2 (high) | MBZUAI Institute of Foundation Models | 20.6 | 0.0 | Free |
| o1-mini | OpenAI | 20.4 | 0.0 | Free |
| Qwen3 Next 80B A3B Instruct | Alibaba | 20.1 | 149.8 | $0.88 |
| Tri-21B-think Preview | Trillion Labs | 20.0 | 0.0 | Free |
| GPT-4.5 (Preview) | OpenAI | 20.0 | 0.0 | Free |
| Qwen3 Coder 30B A3B Instruct | Alibaba | 20.0 | 25.9 | $0.90 |
| Qwen3 235B A22B (Reasoning) | Alibaba | 19.8 | 60.1 | $2.63 |
| QwQ 32B | Alibaba | 19.7 | 32.9 | $0.47 |
| Qwen3 VL 30B A3B (Reasoning) | Alibaba | 19.7 | 107.5 | $0.75 |
| Gemini 2.0 Flash Thinking Experimental (Jan '25) | Google | 19.6 | 0.0 | Free |
| Devstral Small 2 | Mistral | 19.5 | 201.3 | Free |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | Google | 19.4 | 347.6 | $0.17 |
| Motif-2-12.7B-Reasoning | Motif Technologies | 19.1 | 0.0 | Free |
| Ling-1T | InclusionAI | 19.0 | 0.0 | Free |
| Nova Premier | Amazon | 19.0 | 70.3 | $5.00 |
| Magistral Medium 1.2 | Mistral | 18.8 | 0.0 | Free |
| Solar Pro 2 (Preview) (Reasoning) | Upstage | 18.8 | 0.0 | Free |
| Magistral Medium 1 | Mistral | 18.8 | 0.0 | Free |
| NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | NVIDIA | 18.8 | 134.9 | $0.10 |
| DeepSeek R1 (Jan '25) | DeepSeek | 18.8 | 0.0 | $2.36 |
| Devstral Medium | Mistral | 18.7 | 134.7 | $0.80 |
| Llama Nemotron Super 49B v1.5 (Reasoning) | NVIDIA | 18.7 | 83.3 | $0.17 |
| Claude 3.5 Haiku | Anthropic | 18.7 | 0.0 | $1.60 |
| K2-V2 (medium) | MBZUAI Institute of Foundation Models | 18.7 | 0.0 | Free |
| GPT-4o (Aug '24) | OpenAI | 18.6 | 82.3 | $4.38 |
| GPT-4o (March 2025, chatgpt-4o-latest) | OpenAI | 18.6 | 0.0 | Free |
| Hermes 4 - Llama-3.1 405B (Reasoning) | Nous Research | 18.6 | 33.9 | $1.50 |
| Tri-21B-Think | Trillion Labs | 18.6 | 0.0 | Free |
| Gemini 2.0 Flash (Feb '25) | Google | 18.5 | 0.0 | $0.26 |
| Llama 3.3 Nemotron Super 49B v1 (Reasoning) | NVIDIA | 18.5 | 0.0 | Free |
| Llama 4 Maverick | Meta | 18.4 | 126.1 | $0.46 |
| Qwen3 4B 2507 (Reasoning) | Alibaba | 18.2 | 0.0 | Free |
| Gemini 2.0 Pro Experimental (Feb '25) | Google | 18.1 | 0.0 | Free |
| Nova 2.0 Lite (Non-reasoning) | Amazon | 18.0 | 208.5 | $0.85 |
| Devstral Small (May '25) | Mistral | 18.0 | 0.0 | $0.07 |
| Sonar Reasoning Pro | Perplexity | 17.9 | 0.0 | Free |
| Sonar Reasoning | Perplexity | 17.9 | 0.0 | Free |
| Gemini 2.5 Flash Preview (Non-reasoning) | Google | 17.8 | 0.0 | Free |
| Hermes 4 - Llama-3.1 405B (Non-reasoning) | Nous Research | 17.6 | 35.4 | $1.50 |
| Gemini 2.5 Flash-Lite (Reasoning) | Google | 17.6 | 230.0 | $0.17 |
| Llama 3.1 Instruct 405B | Meta | 17.4 | 30.4 | $4.38 |
| GPT-4o (Nov '24) | OpenAI | 17.3 | 116.5 | $4.38 |
| Qwen3 VL 32B Instruct | Alibaba | 17.2 | 80.1 | $1.23 |
| DeepSeek R1 Distill Qwen 32B | DeepSeek | 17.2 | 59.9 | $0.27 |
| GLM-4.6V (Non-reasoning) | Z AI | 17.1 | 21.6 | $0.45 |
| Qwen3 235B A22B (Non-reasoning) | Alibaba | 17.0 | 61.0 | $1.23 |
| Magistral Small 1.2 | Mistral | 16.8 | 0.0 | Free |
| Magistral Small 1 | Mistral | 16.8 | 0.0 | Free |
| Gemini 2.0 Flash (experimental) | Google | 16.8 | 0.0 | Free |
| EXAONE 4.0 32B (Reasoning) | LG AI Research | 16.7 | 0.0 | $0.70 |
| Qwen3 VL 8B (Reasoning) | Alibaba | 16.7 | 136.8 | $0.66 |
| Nova 2.0 Omni (Non-reasoning) | Amazon | 16.6 | 228.4 | $0.85 |
| DeepSeek V3 (Dec '24) | DeepSeek | 16.5 | 0.0 | $0.63 |
| Qwen3 32B (Reasoning) | Alibaba | 16.5 | 57.4 | $2.63 |
| DeepSeek R1 0528 Qwen3 8B | DeepSeek | 16.4 | 0.0 | Free |
| Qwen3.5 2B (Reasoning) | Alibaba | 16.3 | 0.0 | Free |
| Qwen2.5 Max | Alibaba | 16.3 | 49.8 | $2.80 |
| Qwen3 14B (Reasoning) | Alibaba | 16.2 | 62.7 | $1.31 |
| Qwen3 VL 30B A3B Instruct | Alibaba | 16.1 | 127.1 | $0.35 |
| DeepSeek R1 Distill Llama 70B | DeepSeek | 16.0 | 61.9 | $0.88 |
| Hermes 4 - Llama-3.1 70B (Reasoning) | Nous Research | 16.0 | 83.4 | $0.20 |
| Solar Pro 2 (Preview) (Non-reasoning) | Upstage | 16.0 | 0.0 | Free |
| Ministral 3 14B | Mistral | 16.0 | 138.7 | $0.20 |
| Gemini 1.5 Pro (Sep '24) | Google | 16.0 | 0.0 | Free |
| Claude 3.5 Sonnet (Oct '24) | Anthropic | 15.9 | 0.0 | $6.00 |
| DeepSeek R1 Distill Qwen 14B | DeepSeek | 15.8 | 0.0 | Free |
| Falcon-H1R-7B | TII UAE | 15.8 | 0.0 | Free |
| Ling-flash-2.0 | InclusionAI | 15.7 | 69.1 | $0.25 |
| Qwen3 Omni 30B A3B (Reasoning) | Alibaba | 15.6 | 107.7 | $0.43 |
| Qwen2.5 Instruct 72B | Alibaba | 15.6 | 55.4 | Free |
| Sonar | Perplexity | 15.5 | 89.1 | $1.00 |
| Step3 VL 10B | StepFun | 15.4 | 0.0 | Free |
| Qwen3 30B A3B (Reasoning) | Alibaba | 15.3 | 62.9 | $0.75 |
| Devstral Small (Jul '25) | Mistral | 15.2 | 279.8 | $0.15 |
| QwQ 32B-Preview | Alibaba | 15.2 | 62.2 | $0.14 |
| Sonar Pro | Perplexity | 15.2 | 94.9 | $6.00 |
| GLM-4.5V (Reasoning) | Z AI | 15.1 | 47.1 | $0.90 |
| Mistral Large 2 (Nov '24) | Mistral | 15.1 | 45.7 | $3.00 |
| Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) | NVIDIA | 15.0 | 42.4 | $0.90 |
| ERNIE 4.5 300B A47B | Baidu | 15.0 | 26.6 | $0.48 |
| Qwen3 30B A3B 2507 Instruct | Alibaba | 15.0 | 89.9 | $0.35 |
| NVIDIA Nemotron Nano 12B v2 VL (Reasoning) | NVIDIA | 14.9 | 133.0 | $0.30 |
| Solar Pro 2 (Reasoning) | Upstage | 14.9 | 0.0 | Free |
| NVIDIA Nemotron Nano 9B V2 (Reasoning) | NVIDIA | 14.8 | 115.1 | $0.07 |
| Ministral 3 8B | Mistral | 14.8 | 183.0 | $0.15 |
| Qwen3.5 2B (Non-reasoning) | Alibaba | 14.7 | 0.0 | Free |
| Qwen3.5 2B (Non-reasoning) | Alibaba | 14.7 | 0.0 | Free |
| Qwen3.5 2B (Non-reasoning) | Alibaba | 14.7 | 0.0 | Free |
| Qwen3.5 2B (Non-reasoning) | Alibaba | 14.7 | 0.0 | Free |
| Gemini 2.0 Flash-Lite (Feb '25) | Google | 14.7 | 0.0 | Free |
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | NVIDIA | 14.6 | 83.7 | $0.17 |
| Mistral Small 3.1 | Mistral | 14.5 | 186.5 | $0.15 |
| Gemini 2.0 Flash-Lite (Preview) | Google | 14.5 | 0.0 | Free |
| GPT-4o (May '24) | OpenAI | 14.5 | 75.5 | $7.50 |
| Qwen3 32B (Non-reasoning) | Alibaba | 14.5 | 67.2 | $1.23 |
| Llama 3.3 Instruct 70B | Meta | 14.5 | 102.1 | $0.68 |
| Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) | NVIDIA | 14.4 | 0.0 | Free |
| Kimi Linear 48B A3B Instruct | Moonshot AI | 14.4 | 0.0 | Free |
| K2-V2 (low) | MBZUAI Institute of Foundation Models | 14.4 | 0.0 | Free |
| Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) | NVIDIA | 14.3 | 0.0 | Free |
| Qwen3 VL 8B Instruct | Alibaba | 14.3 | 137.7 | $0.31 |
| Claude 3.5 Sonnet (June '24) | Anthropic | 14.2 | 0.0 | $6.00 |
| Qwen3 4B (Reasoning) | Alibaba | 14.2 | 103.9 | $0.40 |
| Llama 3.1 Tulu3 405B | Allen Institute for AI | 14.1 | 0.0 | Free |
| GPT-4o (ChatGPT) | OpenAI | 14.1 | 0.0 | Free |
| Ring-flash-2.0 | InclusionAI | 14.0 | 83.0 | $0.25 |
| Pixtral Large | Mistral | 14.0 | 56.8 | $3.00 |
| Grok 2 (Dec '24) | xAI | 13.9 | 0.0 | Free |
| OLMo 3.1 32B Think | Allen Institute for AI | 13.9 | 93.4 | Free |
| GPT-5 nano (minimal) | OpenAI | 13.8 | 129.4 | $0.14 |
| Gemini 1.5 Flash (Sep '24) | Google | 13.8 | 0.0 | Free |
| GPT-4 Turbo | OpenAI | 13.7 | 32.1 | $15.00 |
| Qwen3 VL 4B (Reasoning) | Alibaba | 13.7 | 0.0 | Free |
| Solar Pro 2 (Non-reasoning) | Upstage | 13.6 | 0.0 | Free |
| Command A | Cohere | 13.5 | 52.5 | $4.38 |
| Llama 4 Scout | Meta | 13.5 | 128.2 | $0.29 |
| Nova Pro | Amazon | 13.5 | 0.0 | $1.40 |
| Llama 3.1 Nemotron Instruct 70B | NVIDIA | 13.4 | 36.5 | $1.20 |
| Grok Beta | xAI | 13.3 | 0.0 | Free |
| Qwen3 8B (Reasoning) | Alibaba | 13.2 | 80.5 | $0.66 |
| NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | NVIDIA | 13.2 | 202.7 | $0.09 |
| Qwen2.5 Instruct 32B | Alibaba | 13.2 | 0.0 | Free |
| GPT-4.1 nano | OpenAI | 13.0 | 139.7 | $0.17 |
| Mistral Large 2 (Jul '24) | Mistral | 13.0 | 0.0 | $3.00 |
| Qwen3 4B 2507 Instruct | Alibaba | 12.9 | 0.0 | Free |
| Qwen2.5 Coder Instruct 32B | Alibaba | 12.9 | 0.0 | $0.20 |
| GPT-4.1 mini | OpenAI | 12.8 | 41.8 | $37.50 |
| GPT-4.1 | OpenAI | 12.8 | 28.1 | $37.50 |
| Qwen3 14B (Non-reasoning) | Alibaba | 12.8 | 64.6 | $0.61 |
| GLM-4.5V (Non-reasoning) | Z AI | 12.7 | 47.4 | $0.90 |
| Gemini 2.5 Flash-Lite (Non-reasoning) | Google | 12.7 | 183.8 | $0.17 |
| Nova Lite | Amazon | 12.7 | 208.4 | $0.10 |
| Mistral Small 3 | Mistral | 12.7 | 234.1 | $0.15 |
| Mistral Small 3.2 | Mistral | 12.7 | 192.9 | $0.15 |
| GPT-4o mini | OpenAI | 12.6 | 54.9 | $0.26 |
| Hermes 4 - Llama-3.1 70B (Non-reasoning) | Nous Research | 12.6 | 84.7 | $0.20 |
| Claude 3 Opus | Anthropic | 12.5 | 0.0 | $30.00 |
| Llama 3.1 Instruct 70B | Meta | 12.5 | 31.8 | $0.56 |
| Qwen3 4B (Non-reasoning) | Alibaba | 12.5 | 105.0 | $0.19 |
| Qwen3 30B A3B (Non-reasoning) | Alibaba | 12.5 | 62.4 | $0.35 |
| DeepSeek-V2.5 | DeepSeek | 12.3 | 0.0 | Free |
| Claude 3 Haiku | Anthropic | 12.3 | 130.1 | $0.50 |
| Gemini 2.0 Flash Thinking Experimental (Dec '24) | Google | 12.3 | 0.0 | Free |
| DeepSeek-V2.5 (Dec '24) | DeepSeek | 12.3 | 0.0 | Free |
| Olmo 3.1 32B Instruct | Allen Institute for AI | 12.2 | 53.5 | $0.30 |
| DeepSeek R1 Distill Llama 8B | DeepSeek | 12.1 | 0.0 | Free |
| Mistral Saba | Mistral | 12.1 | 0.0 | Free |
| OLMo 3 32B Think | Allen Institute for AI | 12.1 | 0.0 | Free |
| Reka Flash (Sep '24) | Reka AI | 12.0 | 85.1 | $0.35 |
| Gemini 1.5 Pro (May '24) | Google | 12.0 | 0.0 | Free |
| R1 1776 | Perplexity | 12.0 | 0.0 | Free |
| Qwen2.5 Turbo | Alibaba | 12.0 | 67.4 | $0.09 |
| Solar Mini | Upstage | 11.9 | 93.5 | $0.15 |
| Llama 3.2 Instruct 90B (Vision) | Meta | 11.9 | 59.2 | $0.72 |
| Llama 3.1 Instruct 8B | Meta | 11.8 | 202.7 | $0.10 |
| EXAONE 4.0 32B (Non-reasoning) | LG AI Research | 11.7 | 0.0 | $0.70 |
| Grok-1 | xAI | 11.7 | 0.0 | Free |
| Qwen2 Instruct 72B | Alibaba | 11.7 | 0.0 | Free |
| Ministral 3 3B | Mistral | 11.2 | 247.8 | $0.10 |
| Gemini 1.5 Flash-8B | Google | 11.1 | 0.0 | Free |
| DeepHermes 3 - Mistral 24B Preview (Non-reasoning) | Nous Research | 10.9 | 0.0 | Free |
| Phi-4 Mini Instruct | Microsoft Azure | 10.9 | 44.1 | Free |
| Jamba 1.7 Large | AI21 Labs | 10.9 | 56.5 | $3.50 |
| Granite 4.0 H Small | IBM | 10.8 | 540.9 | $0.11 |
| Jamba 1.5 Large | AI21 Labs | 10.7 | 0.0 | $3.50 |
| Qwen3 Omni 30B A3B Instruct | Alibaba | 10.7 | 107.5 | $0.43 |
| OLMo 2 32B | Allen Institute for AI | 10.6 | 0.0 | Free |
| Hermes 3 - Llama-3.1 70B | Nous Research | 10.6 | 43.6 | $0.30 |
| Qwen3 8B (Non-reasoning) | Alibaba | 10.6 | 88.5 | $0.31 |
| Jamba 1.6 Large | AI21 Labs | 10.6 | 57.3 | $3.50 |
| DeepSeek-Coder-V2 | DeepSeek | 10.6 | 0.0 | Free |
| Gemini 1.5 Flash (May '24) | Google | 10.5 | 0.0 | Free |
| LFM2 24B A2B | Liquid AI | 10.5 | 251.2 | $0.05 |
| Phi-4 | Microsoft Azure | 10.4 | 7.5 | $0.22 |
| Nova Micro | Amazon | 10.3 | 330.0 | $0.06 |
| Gemma 3 27B Instruct | Google | 10.3 | 36.7 | Free |
| Claude 3 Sonnet | Anthropic | 10.3 | 0.0 | $6.00 |
| Mistral Small (Sep '24) | Mistral | 10.2 | 172.0 | $0.30 |
| Phi-3 Mini Instruct 3.8B | Microsoft Azure | 10.1 | 0.0 | $0.23 |
| Gemma 3n E4B Instruct Preview (May '25) | Google | 10.1 | 0.0 | Free |
| Gemini 1.0 Ultra | Google | 10.1 | 0.0 | Free |
| NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) | NVIDIA | 10.1 | 135.7 | $0.30 |
| Phi-4 Multimodal Instruct | Microsoft Azure | 10.0 | 16.8 | Free |
| Qwen2.5 Coder Instruct 7B | Alibaba | 10.0 | 0.0 | Free |
| Mistral Large (Feb '24) | Mistral | 9.9 | 0.0 | $6.00 |
| Mixtral 8x22B Instruct | Mistral | 9.8 | 0.0 | Free |
| Llama 3.2 Instruct 3B | Meta | 9.7 | 53.4 | $0.08 |
| Llama 2 Chat 7B | Meta | 9.7 | 123.9 | $0.10 |
| Jamba Reasoning 3B | AI21 Labs | 9.6 | 0.0 | Free |
| Qwen3 VL 4B Instruct | Alibaba | 9.6 | 0.0 | Free |
| Reka Flash 3 | Reka AI | 9.5 | 55.5 | $0.35 |
| Qwen1.5 Chat 110B | Alibaba | 9.5 | 0.0 | Free |
| OLMo 3 7B Think | Allen Institute for AI | 9.4 | 75.6 | $0.14 |
| OLMo 2 7B | Allen Institute for AI | 9.3 | 0.0 | Free |
| Claude 2.1 | Anthropic | 9.3 | 0.0 | Free |
| Molmo 7B-D | Allen Institute for AI | 9.2 | 0.0 | Free |
| Ling-mini-2.0 | InclusionAI | 9.2 | 177.8 | $0.12 |
| DeepSeek-V2-Chat | DeepSeek | 9.1 | 0.0 | Free |
| DeepSeek R1 Distill Qwen 1.5B | DeepSeek | 9.1 | 0.0 | Free |
| Claude 2.0 | Anthropic | 9.1 | 0.0 | Free |
| Mistral Medium 3 | Mistral | 9.0 | 99.3 | $4.09 |
| Qwen3.5 0.8B (Reasoning) | Alibaba | 9.0 | 0.0 | Free |
| GPT-3.5 Turbo | OpenAI | 9.0 | 107.7 | $0.75 |
| Mistral Medium 3.1 | Mistral | 9.0 | 108.5 | $4.09 |
| Mistral Small (Feb '24) | Mistral | 9.0 | 198.9 | $1.50 |
| Llama 3 Instruct 70B | Meta | 8.9 | 44.6 | $0.87 |
| Gemma 3 12B Instruct | Google | 8.8 | 36.1 | Free |
| Arctic Instruct | Snowflake | 8.8 | 0.0 | Free |
| LFM 40B | Liquid AI | 8.8 | 0.0 | Free |
| Qwen Chat 72B | Alibaba | 8.8 | 0.0 | Free |
| Llama 3.2 Instruct 11B (Vision) | Meta | 8.7 | 86.2 | $0.16 |
| PALM-2 | Google | 8.6 | 0.0 | Free |
| DeepSeek Coder V2 Lite Instruct | DeepSeek | 8.5 | 0.0 | Free |
| Gemini 1.0 Pro | Google | 8.5 | 0.0 | Free |
| DeepSeek LLM 67B Chat (V1) | DeepSeek | 8.4 | 0.0 | Free |
| Mistral Medium | Mistral | 8.4 | 87.2 | $4.09 |
| Llama 2 Chat 70B | Meta | 8.4 | 0.0 | Free |
| Llama 2 Chat 13B | Meta | 8.4 | 0.0 | Free |
| Exaone 4.0 1.2B (Reasoning) | LG AI Research | 8.3 | 0.0 | Free |
| OpenChat 3.5 (1210) | OpenChat | 8.3 | 0.0 | Free |
| DBRX Instruct | Databricks | 8.3 | 0.0 | Free |
| Command-R+ (Apr '24) | Cohere | 8.3 | 0.0 | $6.00 |
| OLMo 3 7B Instruct | Allen Institute for AI | 8.2 | 155.5 | $0.13 |
| Jamba 1.7 Mini | AI21 Labs | 8.1 | 0.0 | Free |
| Exaone 4.0 1.2B (Non-reasoning) | LG AI Research | 8.1 | 0.0 | Free |
| LFM2.5-1.2B-Thinking | Liquid AI | 8.1 | 0.0 | Free |
| LFM2 2.6B | Liquid AI | 8.0 | 0.0 | Free |
| LFM2.5-1.2B-Instruct | Liquid AI | 8.0 | 0.0 | Free |
| Jamba 1.5 Mini | AI21 Labs | 8.0 | 0.0 | $0.25 |
| Granite 4.0 H 1B | IBM | 8.0 | 0.0 | Free |
| Qwen3 1.7B (Reasoning) | Alibaba | 8.0 | 139.5 | $0.40 |
| Jamba 1.6 Mini | AI21 Labs | 7.9 | 183.9 | $0.25 |
| Gemma 3 270M | Google | 7.7 | 0.0 | Free |
| Granite 4.0 Micro | IBM | 7.7 | 0.0 | Free |
| Mixtral 8x7B Instruct | Mistral | 7.7 | 0.0 | $0.54 |
| DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) | Nous Research | 7.6 | 0.0 | Free |
| Llama 65B | Meta | 7.4 | 0.0 | Free |
| Mistral 7B Instruct | Mistral | 7.4 | 169.0 | $0.25 |
| Command-R (Mar '24) | Cohere | 7.4 | 0.0 | $0.75 |
| Claude Instant | Anthropic | 7.4 | 0.0 | Free |
| Qwen Chat 14B | Alibaba | 7.4 | 0.0 | Free |
| Granite 4.0 1B | IBM | 7.3 | 0.0 | Free |
| Molmo2-8B | Allen Institute for AI | 7.3 | 137.1 | Free |
| LFM2 8B A1B | Liquid AI | 7.0 | 0.0 | Free |
| Granite 3.3 8B (Non-reasoning) | IBM | 7.0 | 25.3 | $0.09 |
| Qwen3 1.7B (Non-reasoning) | Alibaba | 6.8 | 139.3 | $0.19 |
| Qwen3 0.6B (Reasoning) | Alibaba | 6.5 | 225.9 | $0.40 |
| Llama 3 Instruct 8B | Meta | 6.4 | 85.1 | $0.07 |
| Llama 3.2 Instruct 1B | Meta | 6.3 | 164.8 | $0.10 |
| LFM2 1.2B | Liquid AI | 6.3 | 0.0 | Free |
| Gemma 3 4B Instruct | Google | 6.3 | 38.3 | Free |
| Gemma 3n E4B Instruct | Google | 6.3 | 61.8 | $0.03 |
| LFM2.5-VL-1.6B | Liquid AI | 6.2 | 0.0 | Free |
| Granite 4.0 350M | IBM | 6.1 | 0.0 | Free |
| Qwen3 0.6B (Non-reasoning) | Alibaba | 5.7 | 225.4 | $0.19 |
| Gemma 3 1B Instruct | Google | 5.5 | 55.6 | Free |
| Granite 4.0 H 350M | IBM | 5.4 | 0.0 | Free |
| Gemma 3n E2B Instruct | Google | 4.8 | 52.4 | Free |
| Tiny Aya Global | Cohere | 4.7 | 0.0 | Free |
| GPT-4o mini Realtime (Dec '24) | OpenAI | 0.0 | 0.0 | Free |
| Grok Voice Agent | xAI | 0.0 | 0.0 | Free |
| GPT-4o Realtime (Dec '24) | OpenAI | 0.0 | 0.0 | Free |
| Cogito v2.1 (Reasoning) | Deep Cogito | 0.0 | 99.2 | $1.25 |
| GPT-5.4 Pro (xhigh) | OpenAI | 0.0 | 0.0 | $67.50 |
| GPT-3.5 Turbo (0613) | OpenAI | 0.0 | 0.0 | Free |