Cheap250K contextMistral AI

Mistral: Codestral 2508

Mistral: Codestral 2508 is built for code-heavy work where speed and cost matter more than flashy general-chat behavior. At $0.30 input and $0.90 output per 1M tokens, it sits firmly in the cheap tier: 1,000 coding completions cost about $0.42, and even a 50-step agent workflow is only around $0.04. The non-obvious win is that its low price makes repetitive developer automation easier to justify than many broader models.

Best for

•High-volume coding completions where you care about cost per run.
•Code correction and test generation inside automated dev workflows.
•Large repository or long document coding tasks that benefit from 250K context.

Not ideal for

•Users looking for an all-purpose chat model with broad non-coding strengths.
•Workflows that need capabilities beyond tools, structured output, and long context.

What it costs in real life

Computed from OpenRouter API pricing ($0.30 input / $0.90 output per 1M tokens)

100 short chats(50K in / 30K out)

$0.04Cheap

1 long PDF + questions(80K in / 5K out)

$0.03Cheap

1,000 coding completions(200K in / 400K out)

$0.42Cheap

Agent workflow (50 steps)(50K in / 25K out)

$0.04Cheap

Frequently Asked Questions

Is Mistral: Codestral 2508 worth it for coding?

Yes, if your main job is code completion, code correction, or test generation at scale. The pricing is unusually low for repeated dev tasks, so it makes more sense for automation-heavy teams than for people who just want a general chatbot.

How much does Mistral: Codestral 2508 cost to use?

API pricing is $0.30 per 1M input tokens and $0.90 per 1M output tokens. In practice, the precomputed examples are very cheap: roughly $0.42 for 1,000 coding completions and about $0.04 for a 50-step agent workflow.

Can I use Mistral: Codestral 2508 for big codebases or long files?

Yes. It supports 250K context, which is useful when you need to pass large code sections, long technical documents, or bigger multi-file prompts into one workflow.

Capabilities

Vision

Tool calling

Structured output

Reasoning

Open weights

Long context

Cheapest access path

The cheapest known way to use it is direct API usage, since no subscriptions in our catalog currently include this model. That makes costs easy to predict: about $0.04 for 100 short chats, $0.03 for one long PDF plus questions, and $0.42 for 1,000 coding completions according to StackTrim AI.

Alternatives

gemma-3-27b-itCheaper gemma-4-26b-a4b-itCheaper gemma-4-31b-itCheaper llama-4-scoutCheaper claude-opus-4-6Longer context

codingcheaplong-contexttoolsstructured-outputlow-latencyagent-friendly