GPT-5 Mini

GPT-5 Mini

Cheaper GPT-5 for structured tasks and reasoning at scale.

Community Sentiment

Mixed
Based on Reddit reviews

Community Verdict

Best for Budget Tasks
Based on Reddit reviews

Input Modalities

Text, Images

Output Modalities

Text

Price / 1M tokens

$0.25$2
InputOutput

Best For

Cost-effective for straightforward tasks
Well-defined work with clear prompts
Fast responses with low overhead
Overflow when larger models hit rate limits
Simple coding and routine ops
High-volume, low-complexity workloads

Avoid For

Complex tasks needing deep reasoning or context
Cache-heavy apps (reported inconsistencies)
Nuanced understanding and high-level comprehension
Heavy, resource-intensive workflows
When GPT-5 / GPT-5.1 quality is required
128,000 context window
65,536 max output tokens
Jan 1, 2024 knowledge cutoff
Reasoning supported

Parameters

While OpenAI documents a unified parameter set for the Chat Completions and Responses APIs, each model supports only a limited subset of model-specific parameters and values. This table lists the supported model-specific GPT-5 Mini API parameters and allowed values.
ParameterDescriptionPathSupported Values
ModelSelects the model version for the requestmodel
gpt-5-mini
Message rolesDefines the role of the message in the inputmessages[].role
developersystemuserassistant
Reasoning effortControls the depth of internal reasoning used by the modelreasoning_effort
minimallowmediumhigh
Reasoning summaryControls whether the model produces a concise or detailed reasoning summaryreasoning_summary
detailedconciseautonull
Max output tokensMax output tokens the model may generatemax_completion_tokens16 .. 65,536
VerbosityControls how brief or detailed the generated response isverbosity
lowmediumhigh
Output formatSpecifies the output format, including structured JSON schemasresponse_format
textjson_objectjson_schema
TemperatureControls how random or deterministic the output istemperatureNot supported
Top PControls how diverse the output tokens aretop_pNot supported
Presence penaltyEncourages the model to introduce new topicspresence_penaltyNot supported
Frequency penaltyReduces repetition of the same words or phrasesfrequency_penaltyNot supported

Pricing

GPT-5 Mini API pricing is based on token usage for input and output. Prices are listed per 1M tokens, with lower rates for cached input. Tool-specific features may add per-call fees.
Text tokens
Per 1M tokens
Input$0.25
Cached input$0.03
Output$2.00
Example costs (GPT-5 Mini)
TASK
APPROX COST
Simple chat (1k tokens)
~$0.0003
Document summarization
~$0.005–$0.02
High-volume batch processing
~$0.01–$0.05 per request

Modalities

What the model can accept and produce
TextInput and output
ImagesInput only
AudioNot supported
VideoNot supported

Features

Platform-level capabilities
StreamingSupported
Function callingSupported
Structured outputsSupported
Fine-tuningNot supported
DistillationNot supported
Predicted outputsNot supported

Tools

Tools supported by this model when using the Responses API.
Web searchSupported
File searchSupported
Image generationSupported
Code interpreterSupported
MCPSupported
Computer useNot supported

Snapshots

GPT-5 Mini model snapshots ensure stable behavior by locking a specific version. See all available snapshots and aliases below.
GPT-5 Mini
gpt-5-mini
↪ gpt-5-mini-2025-08-07
gpt-5-mini-2025-08-07

FAQs

Ship prompts that pass the tests
Don't wait until they break in production
© 2026 LangFast. All rights reserved. Privacy Policy. Terms of Service.