GPT-5.4 mini

OpenAI - gpt-5.4-mini

8.6/10

Provisional 0 practitioner reviews Source verified

Data source: This model page uses public provider documentation, pricing/context data, benchmark signals where available, and manual curation. Practitioner reviews are listed separately below and do not change the score until approved.

Context window

400,000 tokens

Input price

$0.75/1M

Output price

$4.5/1M

Checked

May 16, 2026

Pricing caveat: Standard API pricing below 270K context length.

Decision Notes

Best for

  • - Coding assistants and subagents that need OpenAI tool support at lower cost
  • - Interactive product workflows where latency matters
  • - General agent tasks below a 400K context budget

Avoid when

  • - Tasks that truly require the best frontier reasoning quality
  • - Workloads needing more than 400K tokens of context

Cost example

1M input tokens plus 200K output tokens costs about $1.65 before caching.

Dimension Breakdown

Tool Calling 9/10

Reliability of function/tool calling — correct schema adherence and parameter extraction

Cost Efficiency 8/10

Price per token relative to output quality for agent tasks

Latency 9/10

Response time — time to first token and total generation time under load

API Reliability 9/10

Uptime, rate limit headroom, and error rates in production

Context Quality 8/10

Long-context coherence and instruction following over turns

Share Your Experience

Have production experience with GPT-5.4 mini? Add the first real review to improve confidence in this score.

Submit a Review

Top Use Cases

Coding Customer Service General

Summary

A strong default OpenAI choice for cost-aware coding agents and subagents. It trades some frontier depth for much better unit economics than GPT-5.5.

Sources

Practitioner Reviews

No reviews yet

Be the first to share your experience with GPT-5.4 mini.

Related Models

Last updated: May 16, 2026