Skip to main content

What Are Tokens?

Tokens are the units that measure AI model usage. Both your input (prompt) and the AI’s response (completion) consume tokens. Think of tokens as roughly 3/4 of a word in English.Example: “Hello, how are you today?” ≈ 6 tokens
All token prices are listed in USD per 1 million tokens.

Available Models in WonkaChat

The following models are currently available in WonkaChat. For specific pricing information, please contact our Sales Support team.

OpenAI Models

ModelBest For
GPT-5.2Most advanced general-purpose AI, cutting-edge performance
GPT-5High-quality responses, complex reasoning
GPT-5 series represents the latest in AI capabilities with enhanced reasoning and generation quality.
ModelBest For
GPT-4o MiniGeneral tasks, best value for most use cases
GPT-4oHigh-quality responses, complex reasoning
GPT-4o Mini is our most popular choice - excellent balance of quality and cost-effectiveness.

Anthropic Claude Models

ModelBest For
Claude Sonnet 4.6Latest version, enhanced performance and instruction following
Claude Sonnet 4.5Balanced performance for complex tasks
Claude Sonnet 4Complex reasoning, detailed analysis
Claude Sonnet models are renowned for following complex, nuanced instructions accurately.
ModelBest For
Claude Haiku 4.5Fast, efficient tasks with quick turnaround
Claude Opus 4.5Maximum capability, most complex and critical tasks
Claude Haiku is optimized for speed, while Opus provides the highest quality for critical work.

Google Gemini Models

ModelBest For
Gemini 3 Pro PreviewNext-generation complex reasoning (preview)
Gemini 3 Flash PreviewNext-generation fast processing (preview)
Gemini 3 models are currently in preview. Features and availability may change.
ModelBest For
Gemini 2.5 ProComplex reasoning, analytical tasks
Gemini 2.5 FlashFast, cost-effective general tasks
Gemini 2.5 Flash offers excellent performance for high-volume operations at competitive pricing.

Mistral AI Models

ModelBest For
Mistral Large LatestAdvanced tasks requiring high capability
Mistral Medium LatestBalanced performance for general business use
Mistral models are automatically updated to the latest versions, ensuring you always have access to improvements.

Frequently Asked Questions

Tokens are counted for both input (your prompt + agent instructions) and output (the AI’s response).Rough estimate: 1 token ≈ 0.75 English wordsExample conversation:
  • Your question: “Summarize this document” (3 tokens)
  • Document content: 2,000 words (≈2,666 tokens)
  • AI summary: 200 words (≈267 tokens)
  • Total: ~2,936 tokens consumed
Start here:
  • Most teams: GPT-4o Mini - best value
  • Speed-focused: Gemini 2.5 Flash or Claude Haiku 4.5
  • Complex tasks: Claude Sonnet 4.6 or GPT-4o
  • Maximum capability: GPT-5.2 or Claude Opus 4.5
Test with your actual use case to find the sweet spot.
Yes! You can configure different models for different agents. Use expensive models only where quality matters most.Strategy:
  • Customer-facing: Premium or standard models
  • Internal tools: Economy models
  • Testing: Economy models
Token pricing is based on:
  1. Input tokens: Your prompt + system instructions + context
  2. Output tokens: The AI’s generated response
Different models have different pricing for input vs output. For specific pricing details, please contact our Sales Support team.
Newer versions (like Claude Sonnet 4.6 vs 4.5, or GPT-5.2 vs GPT-5) generally offer:
  • Improved reasoning capabilities
  • Better instruction following
  • Enhanced accuracy
  • Sometimes better pricing
The “latest” variants (like Mistral Large Latest) automatically update to the newest version.
Preview models (like Gemini 3 series) are:
  • Cutting-edge but may have changes
  • Best for testing new capabilities
  • Not recommended for critical production workloads
For production use, stick with stable releases like GPT-4o, Claude Sonnet 4.6, or Gemini 2.5 series.

Need Help Choosing?

Contact Our Team

Not sure which models fit your use case? Our Sales Support team can:
  • Analyze your requirements
  • Recommend the optimal model mix
  • Provide detailed pricing for your expected volume
  • Help you test different options
Get Model Recommendations