News

OpenAI launches GPT-5: simulated reasoning for all, fewer hallucinations and new APIs

Article Highlights:
  • Simulated reasoning available to free users
  • Unified architecture with real-time routing
  • OpenAI’s strongest coding model per internal benchmarks
  • Fewer hallucinations vs. GPT‑4o and o3 ChatGPT
  • UI updates: colors, personalities, Advanced Voice
  • Pro integrations with Gmail, Calendar and Contacts
  • Three API SKUs: gpt‑5, mini and nano
  • Context window up to 256k tokens (1M with GPT-4.1)
  • Competitive pricing with cache discounts
  • Safe completions for helpful answers within safety bounds
OpenAI launches GPT-5: simulated reasoning for all, fewer hallucinations and new APIs

OpenAI introduced the GPT‑5 family with Pro, mini, and nano variants. The release emphasizes simulated reasoning available to free users, fewer hallucinations, multimodal interactions, and expanded tools for developers.

What’s new in GPT‑5

  • Simulated reasoning for the free tier, breaking problems into multiple steps.
  • Unified architecture: an efficient model for most tasks, a deeper "GPT‑5 Thinking" mode for hard problems, and a real‑time router to pick the best path.
  • Multimodal input/output across text, voice, and images.
  • "Safe completions": helpful responses within safety limits instead of blanket refusals.

Performance and benchmarks

Coding

OpenAI claims GPT‑5 is its strongest coding model: 74.9% on SWE‑bench Verified and 88% on Aider Polyglot. It can finish end‑to‑end development tasks with minimal prompting and generate UI designs for non‑coders.

Reasoning, accuracy, and multimodal

  • Math (no tools): 94.6% on AIME 2025.
  • Multimodal understanding: 84.2% on MMMU.
  • GPQA (no tools) with GPT‑5 Pro: 88.4%.
  • With web search, answers show ~45% fewer factual errors vs. GPT‑4o; in "thinking" mode, ~80% fewer vs. o3.

Health use cases

On HealthBench Hard, GPT‑5 scores 46.2% per OpenAI, with the standard caveat: ChatGPT is not a medical professional. It can help interpret results and draft questions, but verification is essential.

ChatGPT experience updates

  • Custom chat colors and preset "personalities" (Cynic, Robot, Listener, Nerd) that adjust the system prompt.
  • Pro‑tier integrations with Gmail, Google Calendar, and Google Contacts.
  • Unified "Advanced Voice" for better instruction following and adaptive speaking style.
  • Reduced sycophancy: targeted tests show a drop from 14.5% to under 6%.

"Tackling complex problems by splitting them into sub‑problems, checking one’s own answers, and self‑correcting in real time—capabilities that bring it notably closer to human‑like reasoning."

Sam Altman, CEO OpenAI

Availability and rollout

The rollout begins today for ChatGPT’s 700M weekly active users, with tier‑based limits. Pro subscribers get unlimited GPT‑5 and GPT‑5 Pro; Plus users receive higher caps than free. Where available, GPT‑5 Pro replaces o3‑pro. Simulated reasoning is automatically applied when beneficial; paid users can select "GPT‑5 Thinking" or prompt with "think hard about this". Standard Voice Mode will sunset within 30 days in favor of Advanced Voice. After free‑tier limits, usage falls back to GPT‑5 mini.

APIs, pricing, and developer features

  • API SKUs: gpt‑5, gpt‑5‑mini, gpt‑5‑nano, trading off latency and cost.
  • Context window: 256,000 tokens (up to 1M tokens with GPT‑4.1).
  • Indicative pricing: $1.25/M input (‑90% with cache) and $10/M output for GPT‑5; $0.25/M input and $2/M output for Mini; $0.05/M input and $0.40/M output for Nano. GPT‑5 Pro API pricing is TBD.
  • New tools: free‑form function calling (send raw strings like SQL to tools), verbosity controls, and reasoning effort controls for speed vs. depth.

Competitive context and default model

The launch lands amid pressure from Gemini, Claude, and Llama. GPT‑5 replaces GPT‑4o, OpenAI o3, o4‑mini, GPT‑4.1, and GPT‑4.5 as the default model for signed‑in users, aiming to unify capabilities and cut errors with a dynamic routing layer.

Why it matters

The leap from GPT‑4 to GPT‑5 is less dramatic than the GPT‑3 to GPT‑4 jump, partly due to intermediate releases. Still, there are clear gains in coding, accuracy, and safer handling of sensitive queries. Critical reading and fact‑checking remain best practice.

OpenAI introduced the GPT‑5 family with Pro, mini, and nano variants. The release emphasizes simulated reasoning available to free users, fewer hallucinations [...] Evol Magazine