Skip to main content

What Are Tokens?

Tokens are the units that measure AI model usage. Think of them as the “currency” that AI models consume when they process your requests.Simple analogy: If AI models were cars, tokens would be gallons of gas. Different cars (models) use different amounts of gas (tokens) to go the same distance.

How to Think About Tokens

  • 1 token ≈ 3/4 of a word in English
  • “Hello, how are you?” = approximately 6 tokens
  • A typical email (200 words) = approximately 265 tokens
  • A full page of text (500 words) = approximately 665 tokens
Both your question AND the AI’s answer consume tokens. You pay for both.

Why Different Models Cost Different Amounts

Just like cars, AI models come in different sizes with different capabilities:

Economy Models

0.04 to 0.15 USD per 1M tokensLike a compact car - efficient for everyday tasks.Best for:
  • Simple questions
  • Categorization
  • Basic summaries
  • High-volume tasks

Standard Models

0.60 to 3.00 USD per 1M tokensLike a sedan - reliable for most needs.Best for:
  • Business emails
  • Reports
  • Research
  • General conversations

Premium Models

10.00 to 75.00 USD per 1M tokensLike a luxury car - maximum capability.Best for:
  • Critical decisions
  • Complex analysis
  • Strategic planning
  • Maximum accuracy

Available Models in WonkaChat

OpenAI Models

ModelBest For
GPT-5.2Most advanced general tasks
GPT-5High-quality responses
GPT-5 series offers cutting-edge performance for complex tasks.
ModelBest For
GPT-4oHigh-quality responses, complex reasoning
GPT-4o MiniGeneral tasks, best value
GPT-4o Mini is recommended for most business tasks - excellent balance of quality and cost.

Anthropic Claude Models

ModelBest For
Claude Sonnet 4.6Latest version, enhanced performance
Claude Sonnet 4.5Balanced performance
Claude Sonnet 4Complex reasoning and instructions
Claude Sonnet models excel at following complex instructions and nuanced tasks.
ModelBest For
Claude Haiku 4.5Fast, efficient tasks
Claude Opus 4.5Maximum capability and accuracy

Google Gemini Models

ModelBest For
Gemini 3 Pro PreviewNext-gen complex reasoning
Gemini 3 Flash PreviewNext-gen fast tasks
Gemini 3 models are preview versions with cutting-edge capabilities.
ModelBest For
Gemini 2.5 ProComplex reasoning tasks
Gemini 2.5 FlashFast, cost-effective tasks
Gemini 2.5 Flash offers excellent performance at very competitive pricing.

Mistral AI Models

ModelBest For
Mistral Large LatestAdvanced tasks requiring high capability
Mistral Medium LatestBalanced performance for general use
Mistral “Latest” models automatically update to the newest versions, ensuring you always have access to improvements.

Which Model Should You Use?

1

Start with the question: How important is this task?

Low stakes, high volume:
  • Customer FAQs
  • Simple categorization
  • Basic summaries
  • Internal notes
→ Use Economy Models: GPT-4o Mini, Gemini 2.5 Flash, Claude Haiku 4.5Medium importance:
  • Customer-facing emails
  • Internal reports
  • Research tasks
  • General business tasks
→ Use Standard Models: GPT-4o, Claude Sonnet 4.6, Gemini 2.5 ProHigh stakes, critical:
  • Client proposals
  • Strategic decisions
  • Complex analysis
  • Sensitive communications
→ Use Premium Models: GPT-5.2, Claude Opus 4.5, Gemini 3 Pro Preview
2

Consider your volume

How many requests per day?
  • High volume (1,000+ per day): Every cent matters → Use economy models
  • Medium volume (100-1,000 per day): Balance quality and cost → Use standard models
  • Low volume (less than 100 per day): Quality over cost → Use premium models where needed
3

Test and adjust

Pro tip: Start with a mid-tier model (like GPT-4o Mini), evaluate the quality, then adjust:
  • Quality not good enough? → Try a better model
  • Quality is great? → Test a cheaper model to save money

Common Questions

To estimate your costs:
  1. Track your actual usage for a few days
  2. Count tokens per typical request
  3. Calculate based on your chosen model’s rate
Your Sales Support team can help you estimate costs based on your specific use case and expected volume.Contact Sales Support Team
Yes! This is actually a smart strategy:
  • Customer-facing tasks: Use better models (they represent your brand)
  • Internal tasks: Use economy models (quality matters less)
  • Testing/development: Use the cheapest models
You can configure different models for different AI agents.
No problem! You can switch models anytime:
  1. Test with a mid-tier model
  2. Evaluate quality for your use case
  3. Adjust up or down based on results
There’s no penalty for switching, and you only pay for what you use.
When creating or editing an AI agent in WonkaChat, you select the model in the agent settings. Each agent can use a different model.Our recommendation: Start with GPT-4o Mini for most business tasks, then adjust based on your quality needs.

Not sure where to start? Here are our top recommendations:

Best Overall Value

GPT-4o MiniExcellent quality for most business tasks. This is what we recommend to 80% of our customers.Use for: Emails, reports, general questions, customer support

Best for Speed

Gemini 2.5 FlashVery fast and cost-effective. Great for high-volume, time-sensitive tasks.Use for: Quick responses, FAQs, simple classifications

Best for Complex Tasks

Claude Sonnet 4.6Exceptional at following complex instructions and nuanced reasoning.Use for: Analysis, strategy, sensitive communications

Need More Details?