Classification by processing depth and capabilities
Updated: November 2025
(<2 seconds)
Query → Direct Response
Instant prediction without visible reasoning steps.
$0.6-3 / 1M tokens
(2-10 seconds)
Query → Analysis (3-8 steps) → Response
Visible problem decomposition into multiple reasoning steps.
$3-15 / 1M tokens
(10-120 seconds)
Query → Exploration (10-100+ steps) → Response
Multi-branch exploration with internal validation and backtracking.
$15-60 / 1M tokens
| Criterion | Immediate | Short | Extended |
|---|---|---|---|
| Latency | <2s | 2-10s | 10-120s |
| Reasoning steps | 0 | 3-8 | 10-100+ |
| Tool usage | |||
| Complex tasks | |||
| GPQA Diamond (accuracy) | 45-60% | 60-75% | 75-85% |
Modality legend:
Text Vision Audio Multimedia generation
Immediate Response
Fast and direct
Short Reasoning
Structured analysis
Extended Reasoning
Deep exploration