What Are Tokens?
Tokens are the units that measure AI model usage. Think of them as the “currency” that AI models consume when they process your requests.Simple analogy: If AI models were cars, tokens would be gallons of gas. Different cars (models) use different amounts of gas (tokens) to go the same distance.
How to Think About Tokens
- 1 token ≈ 3/4 of a word in English
- “Hello, how are you?” = approximately 6 tokens
- A typical email (200 words) = approximately 265 tokens
- A full page of text (500 words) = approximately 665 tokens
Both your question AND the AI’s answer consume tokens. You pay for both.
Why Different Models Cost Different Amounts
Just like cars, AI models come in different sizes with different capabilities:Economy Models
0.04 to 0.15 USD per 1M tokensLike a compact car - efficient for everyday tasks.Best for:
- Simple questions
- Categorization
- Basic summaries
- High-volume tasks
Standard Models
0.60 to 3.00 USD per 1M tokensLike a sedan - reliable for most needs.Best for:
- Business emails
- Reports
- Research
- General conversations
Premium Models
10.00 to 75.00 USD per 1M tokensLike a luxury car - maximum capability.Best for:
- Critical decisions
- Complex analysis
- Strategic planning
- Maximum accuracy
Available Models in WonkaChat
- By Provider
- By Use Case
OpenAI Models
GPT-5 Series (Latest)
GPT-5 Series (Latest)
| Model | Best For |
|---|---|
| GPT-5.2 | Most advanced general tasks |
| GPT-5 | High-quality responses |
GPT-4o Series
GPT-4o Series
| Model | Best For |
|---|---|
| GPT-4o | High-quality responses, complex reasoning |
| GPT-4o Mini | General tasks, best value |
GPT-4o Mini is recommended for most business tasks - excellent balance of quality and cost.
Anthropic Claude Models
Claude Sonnet Series
Claude Sonnet Series
| Model | Best For |
|---|---|
| Claude Sonnet 4.6 | Latest version, enhanced performance |
| Claude Sonnet 4.5 | Balanced performance |
| Claude Sonnet 4 | Complex reasoning and instructions |
Claude Haiku & Opus
Claude Haiku & Opus
| Model | Best For |
|---|---|
| Claude Haiku 4.5 | Fast, efficient tasks |
| Claude Opus 4.5 | Maximum capability and accuracy |
Google Gemini Models
Gemini 3 Series (Preview)
Gemini 3 Series (Preview)
| Model | Best For |
|---|---|
| Gemini 3 Pro Preview | Next-gen complex reasoning |
| Gemini 3 Flash Preview | Next-gen fast tasks |
Gemini 3 models are preview versions with cutting-edge capabilities.
Gemini 2.5 Series
Gemini 2.5 Series
| Model | Best For |
|---|---|
| Gemini 2.5 Pro | Complex reasoning tasks |
| Gemini 2.5 Flash | Fast, cost-effective tasks |
Gemini 2.5 Flash offers excellent performance at very competitive pricing.
Mistral AI Models
Mistral Latest Series
Mistral Latest Series
| Model | Best For |
|---|---|
| Mistral Large Latest | Advanced tasks requiring high capability |
| Mistral Medium Latest | Balanced performance for general use |
Mistral “Latest” models automatically update to the newest versions, ensuring you always have access to improvements.
Which Model Should You Use?
Start with the question: How important is this task?
Low stakes, high volume:
- Customer FAQs
- Simple categorization
- Basic summaries
- Internal notes
- Customer-facing emails
- Internal reports
- Research tasks
- General business tasks
- Client proposals
- Strategic decisions
- Complex analysis
- Sensitive communications
Consider your volume
How many requests per day?
- High volume (1,000+ per day): Every cent matters → Use economy models
- Medium volume (100-1,000 per day): Balance quality and cost → Use standard models
- Low volume (less than 100 per day): Quality over cost → Use premium models where needed
Common Questions
How much will this actually cost me?
How much will this actually cost me?
To estimate your costs:
- Track your actual usage for a few days
- Count tokens per typical request
- Calculate based on your chosen model’s rate
Can I mix different models?
Can I mix different models?
Yes! This is actually a smart strategy:
- Customer-facing tasks: Use better models (they represent your brand)
- Internal tasks: Use economy models (quality matters less)
- Testing/development: Use the cheapest models
What if I pick the wrong model?
What if I pick the wrong model?
No problem! You can switch models anytime:
- Test with a mid-tier model
- Evaluate quality for your use case
- Adjust up or down based on results
How do I know which model I'm using?
How do I know which model I'm using?
When creating or editing an AI agent in WonkaChat, you select the model in the agent settings. Each agent can use a different model.Our recommendation: Start with GPT-4o Mini for most business tasks, then adjust based on your quality needs.
Recommended Starting Models
Not sure where to start? Here are our top recommendations:Best Overall Value
GPT-4o MiniExcellent quality for most business tasks. This is what we recommend to 80% of our customers.Use for: Emails, reports, general questions, customer support
Best for Speed
Gemini 2.5 FlashVery fast and cost-effective. Great for high-volume, time-sensitive tasks.Use for: Quick responses, FAQs, simple classifications
Best for Complex Tasks
Claude Sonnet 4.6Exceptional at following complex instructions and nuanced reasoning.Use for: Analysis, strategy, sensitive communications
