Pricing WatchFebruary 3, 20254 min read

Every AI pricing change in January 2025, tracked

Seven providers changed prices in January alone. Anthropic dropped Claude 3.5 Haiku's price. Google cut Gemini Flash. I updated the master table. The cheapest frontier-class model is now $0.10 per million input tokens.

January 2025 was a busy month for AI pricing. Seven providers made changes. I updated my master comparison table.

Quick summary: everything got cheaper. Again.

January 2025 price changes

| Provider | Model | Direction | Old price (input/M) | New price (input/M) | Change | |----------|-------|-----------|---------------------|---------------------|--------| | Anthropic | Claude 3.5 Haiku | Down | $0.80 | $0.80 | Held (output cut to $4) | | Google | Gemini 2.0 Flash | New | N/A | $0.10 | New model | | OpenAI | GPT-4o mini | Held | $0.15 | $0.15 | No change | | DeepSeek | DeepSeek V3 | New | N/A | $0.27 | New model | | DeepSeek | DeepSeek R1 | New | N/A | $0.55 | New model | | Mistral AI | Mistral Small | Down | $0.20 | $0.10 | -50% | | Groq | Various | Down | Varies | Varies | ~20-30% cuts |

Sources: Provider pricing pages and changelogs, January 2025.

The big news: Gemini 2.0 Flash at $0.10 per million input tokens. That's a frontier-capable model for essentially nothing. DeepSeek V3 at $0.27/M is almost as cheap and performs at a higher tier.

Mistral quietly cut Mistral Small by 50%. Groq reduced inference pricing across most of their hosted models by 20-30%.

The master pricing table (February 2025)

| Model | Provider | Input/M | Output/M | Quality tier | |-------|----------|---------|----------|-------------| | Gemini 2.0 Flash | Google | $0.10 | $0.40 | Mid-high | | Mistral Small | Mistral AI | $0.10 | $0.30 | Mid | | GPT-4o mini | OpenAI | $0.15 | $0.60 | Mid | | DeepSeek V3 | DeepSeek | $0.27 | $1.10 | High | | DeepSeek R1 | DeepSeek | $0.55 | $2.19 | High (reasoning) | | Claude 3.5 Haiku | Anthropic | $0.80 | $4.00 | Mid-high | | GPT-4o | OpenAI | $2.50 | $10.00 | High | | Claude 3.5 Sonnet | Anthropic | $3.00 | $15.00 | Highest | | Grok 3 | xAI | $3.00 | $15.00 | High | | Gemini 2.0 Pro | Google | $3.50 | $10.50 | High | | Claude 3 Opus | Anthropic | $15.00 | $75.00 | Legacy flagship | | GPT-4 Turbo | OpenAI | $10.00 | $30.00 | Legacy |

Sources: Anthropic, OpenAI, Google, DeepSeek, Mistral AI, Groq, xAI.

The spread between cheapest and most expensive: 150x on input tokens ($0.10 vs $15.00). A year ago, the spread was about 80x.

What $0.10 per million tokens actually means

Let me put Gemini 2.0 Flash's pricing in context:

| Use case | Tokens | Cost at $0.10/M | |----------|--------|-----------------| | Summarize a 10-page document | ~5,000 | $0.0005 | | Classify 1,000 emails | ~200,000 | $0.02 | | Generate 100 product descriptions | ~50,000 | $0.005 | | Process a full book | ~200,000 | $0.02 |

Two cents to process an entire book. Five hundredths of a cent to summarize a document.

At these prices, the cost of the API call is essentially a rounding error. Infrastructure, networking, and application logic cost more than the AI inference itself.

The deflation trend

| Date | Cheapest frontier-capable model | Input price/M | |------|-------------------------------|---------------| | Mar 2023 | GPT-3.5 Turbo | $2.00 | | Nov 2023 | GPT-3.5 Turbo (updated) | $1.00 | | Jul 2024 | GPT-4o mini | $0.15 | | Dec 2024 | Gemini 2.0 Flash | $0.10 | | Jan 2025 | Gemini 2.0 Flash / Mistral Small | $0.10 |

Sources: Historical pricing pages, provider announcements.

In 22 months, the cheapest frontier-capable model went from $2.00/M to $0.10/M. A 20x reduction.

And "frontier-capable" isn't what it used to be. Gemini 2.0 Flash at $0.10/M is comparable in quality to what GPT-4 was at $30/M two years ago.

The price floor keeps dropping. I keep updating the spreadsheet. My "cheapest useful model" cell has been overwritten 11 times in the last 18 months.


If you found this interesting, you might also like:

-- dataku

More from dataku