Mistral Small

Mistral Small for low-latency high-concurrency and cost-sensitive workloads

Published

Mistral Small is a Mistral model for low-latency high-concurrency and cost-sensitive workloads, with options across enterprise, coding, vision and open-weight workflows.

starsCapabilities

visibilityVision understandingcodeFunction callingstreamStreaming output

paymentsContext and pricing

Context limit256,000

Max output256,000

Knowledge cutoff2025-06

Input price$0.15/ 1M tokens

Output price$0.6/ 1M tokens

descriptionOverview

Overview

Mistral Small is listed in Mistral's official model catalog, with model ID mistral-small-latest. Mistral's lineup spans advanced general models, lightweight models, coding models, vision models and open-weight MoE models.

Best for

Use Mistral Small when your product needs multilingual capability, European vendor options, coding assistance, vision understanding or controlled deployment. Test multilingual quality, reasoning, deployment method, cost and compliance requirements before production.

lightbulbUse cases

Multilingual assistants and enterprise Q&A
Code completion and developer help
Multimodal vision understanding
Research and self-hosting

thumb_upStrengths

Broad Mistral model family
Useful for multilingual and enterprise scenarios
Includes coding, vision and open-weight branches
Can be tiered by deployment and cost

infoLimitations

Capabilities vary widely by model
Self-hosting requires engineering effort
Chinese quality needs separate evaluation
Limits depend on Mistral documentation

linkReferences

open_in_newhttps://docs.mistral.ai/getting-started/models/