Question 1

What is a token?

Accepted Answer

A token is roughly 3-4 characters of text or about 0.75 of a word in English. "Hello world" is ~2 tokens. The exact tokenization depends on the model.

Question 2

Why is output more expensive than input?

Accepted Answer

Generating output requires the model to make decisions for each token, which is computationally heavy. Input is just read once. Typical ratio: output costs 2-5x more per token than input.

Question 3

Should I use a cheaper model and accept lower quality?

Accepted Answer

For routine tasks (summarization, classification, simple chat), Haiku 4.5 or Gemini Flash are 10-20x cheaper than top-tier models with surprisingly competitive quality. For complex reasoning, Opus or GPT-5 still pull ahead.

Question 4

What about prompt caching?

Accepted Answer

Anthropic, OpenAI, and Google all offer prompt caching that reduces input cost by 50-90% for repeated prompt prefixes. If you're sending the same system prompt thousands of times, this can cut bills by 80%+. The calculator does NOT factor caching — assume 50% savings if you use it.

Question 5

Can I run my own model to save money?

Accepted Answer

Yes, but at low volumes API is almost always cheaper than self-hosting. Break-even for self-hosting (Llama 3.1 70B on H100 GPUs) is roughly 100M+ tokens per day. Below that, APIs win on cost AND ops simplicity.

Model	Per Call	Daily	Monthly	Annual
gpt-5	$0.0069	$6.88	$209.00	$2,509.38
gpt-4o	$0.0088	$8.75	$266.00	$3,193.75
gpt-4o-mini	$0.0005	$0.53	$15.96	$191.63
claude-opus-4	$0.06	$60.00	$1,824.00	$21,900.00
claude-sonnet-4-5	$0.01	$12.00	$364.80	$4,380.00
claude-haiku-4-5	$0.0032	$3.20	$97.28	$1,168.00
gemini-2-pro	$0.0044	$4.38	$133.00	$1,596.88
gemini-2-flash	$0.0003	$0.26	$7.98	$95.81

LLM API Cost Calculator — Compare GPT, Claude, Gemini

Monthly Cost by Model

How to Use This LLM API Cost Calculator

Frequently Asked Questions

LLM API Cost Calculator — Compare GPT, Claude, Gemini

Monthly Cost by Model

How to Use This LLM API Cost Calculator

Frequently Asked Questions

Related Calculators