| Gemma 3n E4B Instruct | Google | $0.03 | 6.3 | 252.0 |
| LFM2 24B A2B | Liquid AI | $0.05 | 10.5 | 201.9 |
| Nova Micro | Amazon | $0.06 | 10.3 | 168.9 |
| NVIDIA Nemotron Nano 9B V2 (Reasoning) | NVIDIA | $0.07 | 14.8 | 211.4 |
| Llama 3 Instruct 8B | Meta | $0.07 | 6.4 | 91.4 |
| Devstral Small (May '25) | Mistral | $0.07 | 18.0 | 240.0 |
| Llama 3.2 Instruct 3B | Meta | $0.08 | 9.7 | 121.2 |
| Granite 3.3 8B (Non-reasoning) | IBM | $0.09 | 7.0 | 82.4 |
| NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | NVIDIA | $0.09 | 13.2 | 151.7 |
| Qwen2.5 Turbo | Alibaba | $0.09 | 12.0 | 137.9 |
| gpt-oss-20B (high) | OpenAI | $0.09 | 24.5 | 260.6 |
| gpt-oss-20B (low) | OpenAI | $0.09 | 20.8 | 221.3 |
| Llama 3.1 Instruct 8B | Meta | $0.10 | 11.8 | 118.0 |
| Ministral 3 3B | Mistral | $0.10 | 11.2 | 112.0 |
| Llama 2 Chat 7B | Meta | $0.10 | 9.7 | 97.0 |
| Llama 3.2 Instruct 1B | Meta | $0.10 | 6.3 | 63.0 |
| NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | NVIDIA | $0.10 | 18.8 | 184.3 |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | NVIDIA | $0.10 | 24.3 | 231.4 |
| Nova Lite | Amazon | $0.10 | 12.7 | 121.0 |
| Granite 4.0 H Small | IBM | $0.11 | 10.8 | 100.9 |
| Qwen3.5 9B (Reasoning) | Alibaba | $0.11 | 32.4 | 286.7 |
| Ling-mini-2.0 | InclusionAI | $0.12 | 9.2 | 75.4 |
| OLMo 3 7B Instruct | Allen Institute for AI | $0.13 | 8.2 | 65.6 |
| QwQ 32B-Preview | Alibaba | $0.14 | 15.2 | 112.6 |
| GPT-5 nano (high) | OpenAI | $0.14 | 26.8 | 194.2 |
| GPT-5 nano (medium) | OpenAI | $0.14 | 25.9 | 187.7 |
| GPT-5 nano (minimal) | OpenAI | $0.14 | 13.8 | 100.0 |
| OLMo 3 7B Think | Allen Institute for AI | $0.14 | 9.4 | 67.1 |
| MiMo-V2-Flash (Feb 2026) | Xiaomi | $0.15 | 41.5 | 276.7 |
| MiMo-V2-Flash (Reasoning) | Xiaomi | $0.15 | 39.2 | 261.3 |
| Step 3.5 Flash | StepFun | $0.15 | 37.8 | 252.0 |
| MiMo-V2-Flash (Non-reasoning) | Xiaomi | $0.15 | 30.4 | 202.7 |
| Devstral Small (Jul '25) | Mistral | $0.15 | 15.2 | 101.3 |
| Ministral 3 8B | Mistral | $0.15 | 14.8 | 98.7 |
| Mistral Small 3.1 | Mistral | $0.15 | 14.5 | 96.7 |
| Mistral Small 3 | Mistral | $0.15 | 12.7 | 84.7 |
| Mistral Small 3.2 | Mistral | $0.15 | 12.7 | 84.7 |
| Solar Mini | Upstage | $0.15 | 11.9 | 79.3 |
| GLM-4.7-Flash (Reasoning) | Z AI | $0.15 | 30.1 | 198.0 |
| GLM-4.7-Flash (Non-reasoning) | Z AI | $0.15 | 22.1 | 145.4 |
| Llama 3.2 Instruct 11B (Vision) | Meta | $0.16 | 8.7 | 54.4 |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | Google | $0.17 | 21.6 | 123.4 |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | Google | $0.17 | 19.4 | 110.9 |
| Llama Nemotron Super 49B v1.5 (Reasoning) | NVIDIA | $0.17 | 18.7 | 106.9 |
| Gemini 2.5 Flash-Lite (Reasoning) | Google | $0.17 | 17.6 | 100.6 |
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | NVIDIA | $0.17 | 14.6 | 83.4 |
| GPT-4.1 nano | OpenAI | $0.17 | 13.0 | 74.3 |
| Gemini 2.5 Flash-Lite (Non-reasoning) | Google | $0.17 | 12.7 | 72.6 |
| Qwen3 4B (Non-reasoning) | Alibaba | $0.19 | 12.5 | 66.5 |
| Qwen3 1.7B (Non-reasoning) | Alibaba | $0.19 | 6.8 | 36.2 |