Token Pricing - WonkaChat Wiki

What Are Tokens?

Tokens are the units that measure AI model usage. Both your input (prompt) and the AI’s response (completion) consume tokens. Think of tokens as roughly 3/4 of a word in English.Example: “Hello, how are you today?” ≈ 6 tokens

All token prices are listed in USD per 1 million tokens.

Available Models in WonkaChat

The following models are currently available in WonkaChat. For specific pricing information, please contact our Sales Support team.

By Provider
By Use Case
Model Comparison

OpenAI Models

GPT-5 Series (Latest)

Model	Best For
GPT-5.2	Most advanced general-purpose AI, cutting-edge performance
GPT-5	High-quality responses, complex reasoning

GPT-5 series represents the latest in AI capabilities with enhanced reasoning and generation quality.

GPT-4o Series

Model	Best For
GPT-4o Mini	General tasks, best value for most use cases
GPT-4o	High-quality responses, complex reasoning

GPT-4o Mini is our most popular choice - excellent balance of quality and cost-effectiveness.

Anthropic Claude Models

Claude Sonnet Series

Model	Best For
Claude Sonnet 4.6	Latest version, enhanced performance and instruction following
Claude Sonnet 4.5	Balanced performance for complex tasks
Claude Sonnet 4	Complex reasoning, detailed analysis

Claude Sonnet models are renowned for following complex, nuanced instructions accurately.

Claude Specialized Models

Model	Best For
Claude Haiku 4.5	Fast, efficient tasks with quick turnaround
Claude Opus 4.5	Maximum capability, most complex and critical tasks

Claude Haiku is optimized for speed, while Opus provides the highest quality for critical work.

Google Gemini Models

Gemini 3 Series (Preview)

Model	Best For
Gemini 3 Pro Preview	Next-generation complex reasoning (preview)
Gemini 3 Flash Preview	Next-generation fast processing (preview)

Gemini 3 models are currently in preview. Features and availability may change.

Gemini 2.5 Series

Model	Best For
Gemini 2.5 Pro	Complex reasoning, analytical tasks
Gemini 2.5 Flash	Fast, cost-effective general tasks

Gemini 2.5 Flash offers excellent performance for high-volume operations at competitive pricing.

Mistral AI Models

Model	Best For
Mistral Large Latest	Advanced tasks requiring high capability
Mistral Medium Latest	Balanced performance for general business use

Mistral models are automatically updated to the latest versions, ensuring you always have access to improvements.

Quick Comparison Guide

Entry Level (Economy)

When to use: High volume, simple tasks, internal tools

Model	Strengths
GPT-4o Mini	Best all-around value, reliable quality
Gemini 2.5 Flash	Very cost-effective, fast
Claude Haiku 4.5	Quick turnaround, efficient

Mid-Tier (Standard)

When to use: General business tasks, customer-facing content

Model	Strengths
GPT-4o	High quality, complex reasoning
Claude Sonnet 4.6	Excellent instruction following
Gemini 2.5 Pro	Strong analytical capabilities
Mistral Large Latest	Advanced general performance

Premium (Advanced)

When to use: Critical decisions, complex analysis, maximum quality

Model	Strengths
GPT-5.2	Cutting-edge AI capabilities
Claude Opus 4.5	Maximum accuracy and quality
Gemini 3 Pro Preview	Next-generation reasoning

By Workload Type

High-Volume Operations: → GPT-4o Mini, Gemini 2.5 FlashCustomer-Facing: → GPT-4o, Claude Sonnet 4.6, Mistral Large LatestInternal Analysis: → Gemini 2.5 Pro, Claude Sonnet 4.5, GPT-4oCritical Tasks: → GPT-5.2, Claude Opus 4.5, Claude Sonnet 4.6

Frequently Asked Questions

How are tokens counted?

Tokens are counted for both input (your prompt + agent instructions) and output (the AI’s response).Rough estimate: 1 token ≈ 0.75 English wordsExample conversation:

Your question: “Summarize this document” (3 tokens)
Document content: 2,000 words (≈2,666 tokens)
AI summary: 200 words (≈267 tokens)
Total: ~2,936 tokens consumed

Which model should I use?

Start here:

Most teams: GPT-4o Mini - best value
Speed-focused: Gemini 2.5 Flash or Claude Haiku 4.5
Complex tasks: Claude Sonnet 4.6 or GPT-4o
Maximum capability: GPT-5.2 or Claude Opus 4.5

Test with your actual use case to find the sweet spot.

Can I switch models?

Yes! You can configure different models for different agents. Use expensive models only where quality matters most.Strategy:

Customer-facing: Premium or standard models
Internal tools: Economy models
Testing: Economy models

How is pricing calculated?

Token pricing is based on:

Input tokens: Your prompt + system instructions + context
Output tokens: The AI’s generated response

Different models have different pricing for input vs output. For specific pricing details, please contact our Sales Support team.

What's the difference between model versions?

Newer versions (like Claude Sonnet 4.6 vs 4.5, or GPT-5.2 vs GPT-5) generally offer:

Improved reasoning capabilities
Better instruction following
Enhanced accuracy
Sometimes better pricing

The “latest” variants (like Mistral Large Latest) automatically update to the newest version.

Are preview models stable for production?

Preview models (like Gemini 3 series) are:

Cutting-edge but may have changes
Best for testing new capabilities
Not recommended for critical production workloads

For production use, stick with stable releases like GPT-4o, Claude Sonnet 4.6, or Gemini 2.5 series.

Need Help Choosing?

Contact Our Team

Not sure which models fit your use case? Our Sales Support team can:

Analyze your requirements
Recommend the optimal model mix
Provide detailed pricing for your expected volume
Help you test different options

Get Model Recommendations

​What Are Tokens?

​Available Models in WonkaChat

​OpenAI Models

​Anthropic Claude Models

​Google Gemini Models

​Mistral AI Models

​Choose by Task Type

​Quick Comparison Guide

​Entry Level (Economy)

​Mid-Tier (Standard)

​Premium (Advanced)

​By Workload Type

​Frequently Asked Questions

​Need Help Choosing?

Contact Our Team

What Are Tokens?

Available Models in WonkaChat

OpenAI Models

Anthropic Claude Models

Google Gemini Models

Mistral AI Models

Choose by Task Type

Quick Comparison Guide

Entry Level (Economy)

Mid-Tier (Standard)

Premium (Advanced)

By Workload Type

Frequently Asked Questions

Need Help Choosing?