GPT-4.1, launched by OpenAI on April 14, 2025, introduces a 1 million token context window and supports outputs of up to 32,768 tokens per request. It delivers outstanding performance on coding tasks, achieving 54.6% on the SWE-Bench Verified benchmark, and shows a 10.5% improvement over GPT-4o on MultiChallenge for instruction following. The model's knowledge cutoff is set at June 2024. Pricing is $2.00 per million tokens for input and $8.00 per million tokens for output, with a 75% discount applied to cached inputs, making it highly cost-efficient for repeated queries.
Claude Opus 4 | GPT-4.1 | |
---|---|---|
Provider | ||
Web Site | ||
Release Date | May 22, 2025 3 days ago | Apr 14, 2025 1 month ago |
Modalities | text images | text images |
API Providers | Anthropic API, Amazon Bedrock, Google Cloud's Vertex AI | OpenAI API |
Knowledge Cut-off Date | Unknown | - |
Open Source | No | No |
Pricing Input | $15 | $2.00 per million tokens |
Pricing Output | $75 per million tokens | $8.00 per million tokens |
MMLU | 88.8% Source | 90.2% pass@1 Source |
MMLU Pro | - | - |
MMMU | 76.5% Source | 74.8% Source |
HellaSwag | - | - |
HumanEval | - | - |
MATH | - | - |
GPQA | 79.6% Diamond Source | 66.3% Diamond Source |
IFEval | - | - |
Array | - | - |
AIME 2024 | - | 48.1% Source |
AIME 2025 | 75.5% Source | - |
Array | - | - |
Array | - | - |
Array | - | 87.3% pass@1 Source |
Array | - | - |
Mobile Application |
Compare AI. Test. Benchmarks. Mobile Apps Chatbots, Sketch
Copyright © 2025 All Right Reserved.