What is the Price-Performance score?

It's a custom metric we calculate: quality score divided by the total cost per 1M tokens (input + output). Higher is better. It measures how much quality you get per dollar spent. Free models get the maximum score.

Why don't you include every model?

We curate models that are actually useful for AI agent workloads. We exclude deprecated models, models with extremely limited availability, and models that don't offer meaningful advantages over alternatives in their price range.

How often is this data updated?

We aim for monthly updates or sooner when major models are released. AI model pricing and performance change frequently. Check the 'Last updated' date at the top of the page.

Explore Tools

AI Tools

AI Model Benchmarks

Compare AI models by quality, cost, speed, and context window. Pick the right model for your workload in minutes.

20 models trackedUpdated February 2026

Quick Picks

Best for Coding

GPT-5.5

OpenAI · Score 60 · $35.00/1M tokens

Best value:

DeepSeek R1 (Free)

Best for Writing

GPT-5.5

OpenAI · Score 60 · $35.00/1M tokens

Best value:

Llama 3.3 70B (Free)

Best for Analysis

GPT-5.5

OpenAI · Score 60 · $35.00/1M tokens

Best value:

DeepSeek R1 (Free)

Cheapest for Email

Llama 3.3 70B (Free)

Meta · Score 14 · Free/1M tokens

Filter:

#	Model	Tier	Quality	Price (In/Out)	Speed	Context	Value Score
1	GPT-5.5 OpenAI	Frontier	60	$5.00 / $30.00	81 tok/s	1.1M	2
2	Claude Opus 4.7 Anthropic	Frontier	57	$5.00 / $25.00	27 tok/s	1.0M	2
3	Gemini 3.1 Pro Google	Frontier	57	$2.50 / $15.00	113 tok/s	1.0M	3
4	GPT-5.4 OpenAI	Premium	57	$2.50 / $15.00	61 tok/s	1.1M	3
5	GPT-5.3 OpenAI	Premium	54	$2.00 / $10.00	80 tok/s	400K	5
6	Grok 4 xAI	Premium	53	$3.00 / $15.00	75 tok/s	2.0M	3
7	Claude Sonnet 4.6 Anthropic	Premium	52	$3.00 / $15.00	50 tok/s	1.0M	3
8	Claude Sonnet 4.5 Anthropic	Premium	48	$3.00 / $15.00	70 tok/s	1.0M	3
9	MiniMax M2.1 MiniMax	Mid-Range	48	$0.28 / $1.20	110 tok/s	1.0M	32
10	Gemini 3 Pro Google	Premium	46	$2.00 / $12.00	90 tok/s	1.0M	3
11	Gemini 3 Flash Google	Budget	46	$0.07 / $0.30	160 tok/s	1.0M	123
12	Claude Sonnet 4.1 Anthropic	Mid-Range	44	$3.00 / $15.00	80 tok/s	200K	2
13	KAT-Coder-Pro V2 KwaiPilot	Mid-Range	44	$0.30 / $1.20	100 tok/s	256K	29
14	GPT-5 OpenAI	Mid-Range	42	$1.25 / $10.00	100 tok/s	400K	4
15	DeepSeek V3.2 DeepSeek	Mid-Range	40	$0.25 / $0.38	120 tok/s	128K	63
16	GPT-4o Mini OpenAI	Budget	38	$0.15 / $0.60	180 tok/s	128K	51
17	Claude Haiku 4.5 Anthropic	Budget	37	$1.00 / $5.00	95 tok/s	200K	6
18	DeepSeek R1 (Free) DeepSeek	Free	27	Free / Free	40 tok/s	64K	270
19	Qwen 2.5 VL 72B (Free) Alibaba	Free	15	Free / Free	50 tok/s	128K	150
20	Llama 3.3 70B (Free) Meta	Free	14	Free / Free	80 tok/s	128K	140

Explore by Task

Best for Coding

Code generation, debugging, refactoring, code review

Best for Writing

Blog posts, marketing copy, content creation

Best for Analysis

Data analysis, competitive research, report generation

Best for Research

Web research, fact-checking, information synthesis

Best for Email

Email drafting, inbox triage, reply generation

Best for Summarization

Document summaries, meeting notes, content condensing

Best for Image Analysis

Image description, OCR, visual understanding

Best for Math & Reasoning

Complex calculations, logic puzzles, mathematical proofs

Best for Creative

Brainstorming, storytelling, creative writing

Best for General Chat

Q&A, casual conversation, simple tasks

FAQ

Frequently Asked Questions

Quality scores come from Artificial Analysis Intelligence Index, an independent benchmark. Pricing is from OpenRouter and provider APIs. Speed data is from Artificial Analysis performance benchmarks. Data is refreshed on each site build.

Related Tools

Explore more tools

Free AI optimization and data conversion tools.

All Tools

AI Model Benchmarks

Quick Picks

Explore by Task

Frequently Asked Questions

AI Model Selector

Cost Calculator

Config Generator

Explore more tools

AI Model Benchmarks

Quick Picks

Explore by Task

Frequently Asked Questions

AI Model Selector

Cost Calculator

Config Generator

Explore more tools

AI Model Benchmarks

Quick Picks

Explore by Task

Frequently Asked Questions

Where does the benchmark data come from?

What is the Price-Performance score?

Why don't you include every model?

How often is this data updated?

AI Model Selector

Cost Calculator

Config Generator

Explore more tools

AI Model Benchmarks

Quick Picks

Explore by Task

Frequently Asked Questions

Where does the benchmark data come from?

What is the Price-Performance score?

Why don't you include every model?

How often is this data updated?

AI Model Selector

Cost Calculator

Config Generator

Explore more tools