Skip to content

GPT Models

📖 5 min read openaigptmodelsreferencevendor-comparison
Deep comparison of GPT-5.5 vs GPT-5.4 vs GPT-5.4 mini vs GPT-5.4 nano — capabilities, pricing, context, reasoning levels, specialized models (Image-2, Realtime, Whisper, TTS), and model selection guide.
Key Takeaways
  • Four current GPT tiers: GPT-5.5 ($5/$30, 1M ctx), GPT-5.4 ($2.50/$15, 1M), GPT-5.4 mini ($0.75/$4.50, 400K), GPT-5.4 nano ($0.20/$1.25, 400K)
  • All GPT models support text + image input, tool use, streaming, structured outputs, and prompt caching
  • GPT-5.5 has configurable reasoning levels (none/low/medium/high/xhigh) for balancing speed vs depth
  • Specialized models: GPT Image 2 (image), Realtime-2 (voice), Sora (video), Whisper/TTS (speech)

Current GPT Models — May 2026

FeatureGPT-5.5GPT-5.4GPT-5.4 miniGPT-5.4 nano
DescriptionFlagship — new class of intelligenceAffordable professional tierStrong mini for coding & agentsFastest, cheapest
Model IDgpt-5.5gpt-5.4gpt-5.4-minigpt-5.4-nano
Input Pricing$5 / 1M tokens$2.50 / 1M tokens$0.75 / 1M tokens$0.20 / 1M tokens
Cached Input$0.50 / 1M tokens$0.25 / 1M tokens$0.075 / 1M tokens$0.02 / 1M tokens
Output Pricing$30 / 1M tokens$15 / 1M tokens$4.50 / 1M tokens$1.25 / 1M tokens
Context Window1M tokens1M tokens400K tokens400K tokens
Max Output128K tokens128K tokens128K tokens128K tokens
Reasoning Levelsnone/low/medium/high/xhighnone/low/medium/high/xhighnone/low/medium/high/xhighnone/low/medium
Vision (Image Input)YesYesYesYes
Tool UseFunctions, Web, File search, Computer useFunctions, Web, File search, Computer useFunctions, Web, File search, Computer useFunctions, Web
StreamingYesYesYesYes
Prompt CachingYes (10% of input)Yes (10% of input)Yes (10% of input)Yes
Batch API (50% off)YesYesYesYes
Flex ProcessingYesYesYesYes
Knowledge CutoffDec 1, 2025Aug 31, 2025Aug 31, 2025Aug 31, 2025

Pricing History: GPT-5.5 dropped from 15/15/75 (GPT-4 tier) to 5/5/30. GPT-5.4 at 2.50/2.50/15 offers near-flagship capability at a fraction of the price. All models have prompt caching at 10% of base input cost.

Reasoning Levels

GPT-5.5 and 5.4 models have configurable reasoning depth:

LevelBehaviorCostLatencyBest For
noneStandard response, no explicit reasoningLowestFastestSimple Q&A, classification, routing
lowLight reasoning for moderate problemsLowFastCode completion, summarization
mediumBalanced depth — good defaultMediumMediumAnalysis, code review, research
highDeep reasoning for complex tasksHighSlowerArchitecture design, debugging
xhighMaximum reasoning — spends significant tokens “thinking”HighestSlowestHard math, complex multi-step problems
response = client.responses.create(
model="gpt-5.5",
input="Design a distributed rate limiter...",
reasoning={"effort": "high"} # Controls thinking depth
)

Specialized Models

GPT Image 2 — Image Generation

FeatureDetail
Model IDgpt-image-2
Input (image)8/1Mtokens(8 / 1M tokens (2 cached)
Output (image)$30 / 1M tokens
Input (text)5/1Mtokens(5 / 1M tokens (1.25 cached)
Use CasesProduct images, illustrations, design mockups, photo editing

Realtime API — Voice & Audio

ModelUse CasePricing
GPT Realtime 2Voice agents, interactive audioAudio: 32in/32 in / 64 out per 1M. Text: 4in/4 in / 24 out
GPT Realtime TranslateLive speech-to-speech translation$0.034/min
GPT Realtime WhisperStreaming speech-to-text$0.017/min
GPT-4o TranscribeHigh-quality speech-to-textPay-per-use
GPT-4o mini TTSText-to-speech generationPay-per-use

Sora — Video Generation

Cinematic video generation available via ChatGPT Pro and API. Pricing varies by resolution and duration.

Whisper / TTS

Traditional speech-to-text (Whisper) and text-to-speech (TTS) models available at lower cost than Realtime API variants.

Model Selection Guide

What matters most?
├─ Maximum quality, complex reasoning → GPT-5.5
│ Use when: R&D, architecture work, deep analysis
│ Cost: $5/$30 per 1M. Batch: $2.50/$15
├─ Best value for production → GPT-5.4
│ Use when: most APIs, coding, content, analysis
│ Cost: $2.50/$15 per 1M. Batch: $1.25/$7.50
├─ Cost-efficient at scale → GPT-5.4 mini
│ Use when: high-volume, computer use, subagents
│ Cost: $0.75/$4.50 per 1M. Batch: $0.375/$2.25
├─ Fastest, cheapest → GPT-5.4 nano
│ Use when: classification, routing, simple automation
│ Cost: $0.20/$1.25 per 1M
├─ Generate images → GPT Image 2
├─ Real-time voice/audio → Realtime API
├─ Speech-to-text → Realtime Whisper or GPT-4o Transcribe
└─ Text-to-speech → GPT-4o mini TTS

Cost Optimization

StrategySavingsWhen to Apply
Prompt Caching90% on inputRepeated system prompts, same-context queries
Batch API50% all token costsAsync, non-urgent workloads
Flex ProcessingLower costNon-production, lower-priority tasks
Model Routing30-70%Route to nano/mini for simple tasks, 5.5 for complex
Data Residency+10% surchargeOpt-in for regional processing compliance

Comparing Across Models

For a broader comparison across GPT, Claude, Gemini, and DeepSeek, see the Models Decision Guide.