Claude Opus 4.1

Comments: 0
Claude Opus 4.1 #0
Claude Opus 4.1 #1
Claude Opus 4.1 #2
3651
704

Position in the overall ranking as of
June 2026
20
User rating
https://compare-ai.foundtt.com
4.1

Model Overview

Web Site
AI Model Web Page
Provider
The entity that provides this model.
Chat
Input a message to start chatting
Release Date
When the model was first released.
10 months ago
Aug 05, 2025
Modalities
Types of data this model can process
text ?
images ?
API Providers
The providers that offer this model. (This is not an exhaustive list.)
Anthropic API, Claude Code, Amazon Bedrock, Vertex AI, GitHub Copilot
Knowledge Cut-off Date
When the model's knowledge was last updated.
-
Open Source
Whether the model's code is available for public use.
No
Pricing Input
Cost for processing tokens in your prompts
$15 per million tokens
Pricing Output
Cost for tokens generated by the model
$75 per million tokens
MMLU
Massive Multitask Language Understanding - Tests knowledge across 57 subjects including mathematics, history, law, and more
89.5%
Source
MMLU-Pro
A more robust MMLU benchmark with harder, reasoning-focused questions, a larger choice set, and reduced prompt sensitivity
-
MMMU
Massive Multitask Multimodal Understanding - Tests understanding across text, images, audio, and video
77.1%
Source
HellaSwag
A challenging sentence completion benchmark
-
HumanEval
Evaluates code generation and problem-solving capabilities
-
MATH
Tests mathematical problem-solving abilities across various difficulty levels
-
GPQA
Tests PhD-level knowledge in chemistry, biology, and physics through multiple choice questions that require deep domain expertise
80.9%
Diamond
Source
IFEval
Tests model's ability to accurately follow explicit formatting instructions, generate appropriate outputs, and maintain consistent instruction adherence across different tasks
-
SimpleQA
Assessing the accuracy of simple questions
-
AIME 2024
-
AIME 2025
78.0%
Source
Aider Polyglot
Multilingual programming benchmark.
-
LiveCodeBench v5
Benchmark for real-time programming
-
Global MMLU (Lite)
A simplified version of the benchmark for assessing the universality of models at the global level.
-
MathVista
Evaluates the mathematical reasoning abilities of AI models within visual contexts
-
Mobile Application

Add a Comment

Compare LLMs


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.