AI Pricing Analysis — Compare API Costs Across Providers

Comprehensive pricing comparison for 283 AI models. Compare input/output costs, value metrics, and find the best price-to-performance ratio.

AI Model Pricing (sorted by cost)

ModelProviderPrice/M TokensIntelligenceValue Score
Gemma 3n E4B InstructGoogle$0.036.3252.0
LFM2 24B A2BLiquid AI$0.0510.5201.9
Nova MicroAmazon$0.0610.3168.9
NVIDIA Nemotron Nano 9B V2 (Reasoning)NVIDIA$0.0714.8211.4
Llama 3 Instruct 8BMeta$0.076.491.4
Devstral Small (May '25)Mistral$0.0718.0240.0
Llama 3.2 Instruct 3BMeta$0.089.7121.2
Granite 3.3 8B (Non-reasoning)IBM$0.097.082.4
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)NVIDIA$0.0913.2151.7
Qwen2.5 TurboAlibaba$0.0912.0137.9
gpt-oss-20B (high)OpenAI$0.0924.5260.6
gpt-oss-20B (low)OpenAI$0.0920.8221.3
Llama 3.1 Instruct 8BMeta$0.1011.8118.0
Ministral 3 3BMistral$0.1011.2112.0
Llama 2 Chat 7BMeta$0.109.797.0
Llama 3.2 Instruct 1BMeta$0.106.363.0
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)NVIDIA$0.1018.8184.3
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA$0.1024.3231.4
Nova LiteAmazon$0.1012.7121.0
Granite 4.0 H SmallIBM$0.1110.8100.9
Qwen3.5 9B (Reasoning)Alibaba$0.1132.4286.7
Ling-mini-2.0InclusionAI$0.129.275.4
OLMo 3 7B InstructAllen Institute for AI$0.138.265.6
QwQ 32B-PreviewAlibaba$0.1415.2112.6
GPT-5 nano (high)OpenAI$0.1426.8194.2
GPT-5 nano (medium)OpenAI$0.1425.9187.7
GPT-5 nano (minimal)OpenAI$0.1413.8100.0
OLMo 3 7B ThinkAllen Institute for AI$0.149.467.1
MiMo-V2-Flash (Feb 2026)Xiaomi$0.1541.5276.7
MiMo-V2-Flash (Reasoning)Xiaomi$0.1539.2261.3
Step 3.5 FlashStepFun$0.1537.8252.0
MiMo-V2-Flash (Non-reasoning)Xiaomi$0.1530.4202.7
Devstral Small (Jul '25)Mistral$0.1515.2101.3
Ministral 3 8BMistral$0.1514.898.7
Mistral Small 3.1Mistral$0.1514.596.7
Mistral Small 3Mistral$0.1512.784.7
Mistral Small 3.2Mistral$0.1512.784.7
Solar MiniUpstage$0.1511.979.3
GLM-4.7-Flash (Reasoning)Z AI$0.1530.1198.0
GLM-4.7-Flash (Non-reasoning)Z AI$0.1522.1145.4
Llama 3.2 Instruct 11B (Vision)Meta$0.168.754.4
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Google$0.1721.6123.4
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)Google$0.1719.4110.9
Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA$0.1718.7106.9
Gemini 2.5 Flash-Lite (Reasoning)Google$0.1717.6100.6
Llama Nemotron Super 49B v1.5 (Non-reasoning)NVIDIA$0.1714.683.4
GPT-4.1 nanoOpenAI$0.1713.074.3
Gemini 2.5 Flash-Lite (Non-reasoning)Google$0.1712.772.6
Qwen3 4B (Non-reasoning)Alibaba$0.1912.566.5
Qwen3 1.7B (Non-reasoning)Alibaba$0.196.836.2