The reasoning model family (o1 → o3 → GPT-5.4 Thinking) represents a fundamental shift. Instead of generating answers token-by-token immediately, they use reinforcement learning to generate a hidden Chain of Thought (CoT) before producing the final output.
| Model | Released | Key Advance |
|---|---|---|
| o1 | Sep 2024 | First reasoning model. No system prompts, no tools. |
| o3-mini | Jan 2025 | Cheaper reasoning with effort levels (low/medium/high). |
| GPT-5.4 Thinking | 2026 | Unified reasoning + full API features (tools, system prompts, structured outputs). |
developer instead of system.// Reasoning models use the "developer" role:
const response = await openai.responses.create({
model: "gpt-5.4-thinking",
reasoning: { effort: "high" }, // low | medium | high
input: [
{ role: "developer", content: "You are a math olympiad judge. Be rigorous." },
{ role: "user", content: "Prove that sqrt(2) is irrational." }
]
});