Question 1

How accurate is the token count?

Accepted Answer

The counter uses a BPE-style heuristic that blends character count and word count. It lands within roughly 10% of the real tokenizer for natural English text, code and mixed content. For absolute precision, use the vendor's own tokenizer (tiktoken for OpenAI, Anthropic SDK, etc.).

Question 2

Why do output tokens dominate the bill?

Accepted Answer

Output tokens are typically priced 3 to 5 times higher than input tokens. Even a small output ratio matters — set the slider to match your real traffic pattern (most agents are 80/20 input-heavy, chat is closer to 50/50).

Question 3

Does this tool send my text anywhere?

Accepted Answer

No. Everything runs in your browser. Open DevTools → Network and paste anything — no requests fire.

Question 4

Why does the cost vary so much between vendors?

Accepted Answer

Frontier models can be 100x more expensive than cheap-bulk models for the same prompt. Most production loads do not need flagship reasoning for every request — pair a workhorse (Sonnet, GPT-5, Gemini Pro) with a cheap fast model (Haiku, GPT-5 mini, Gemini Flash) and route per task.

Model	$ Input /1M	$ Output /1M	Input cost	Output cost	Per request	× 1 requests
Claude Opus 4.7 Anthropic	$15	$75	$0	$0	$0	$0
Claude Sonnet 4.6 Anthropic	$3	$15	$0	$0	$0	$0
Claude Haiku 4.5 Anthropic	$1	$5	$0	$0	$0	$0
GPT-5 OpenAI	$10	$30	$0	$0	$0	$0
GPT-5 mini OpenAI	$0.5	$2	$0	$0	$0	$0
o3 OpenAI	$20	$80	$0	$0	$0	$0
Gemini 2.5 Pro Google	$2.5	$15	$0	$0	$0	$0
Gemini 2.5 Flash Google	$0.3	$1.2	$0	$0	$0	$0
Llama 4 Maverick Meta	$0.5	$1.5	$0	$0	$0	$0
Llama 4 Scout Meta	$0.2	$0.6	$0	$0	$0	$0
DeepSeek R1 DeepSeek	$0.55	$2.19	$0	$0	$0	$0
DeepSeek V3 DeepSeek	$0.27	$1.1	$0	$0	$0	$0
Grok 4 xAI	$5	$15	$0	$0	$0	$0
Mistral Large 2 Mistral	$2	$6	$0	$0	$0	$0

LLM Token Counter & Cost Estimator

Cost across 14 models

How this counter works

What to watch in your bill

Frequently Asked Questions