Grok 4

DeepSeek-R1

DeepSeek-R1 is a 671B parameter Mixture-of-Experts (MoE) model with 37B activated parameters per token, trained via large-scale reinforcement learning with a focus on reasoning capabilities. It incorporates two RL stages for discovering improved reasoning patterns and aligning with human preferences, along with two SFT stages for seeding reasoning and non-reasoning capabilities. The model achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks.

Grok 4DeepSeek-R1
Web Site ?
Provider ?
Chat ?
Release Date ?
Modalities ?
text ?
images ?
voice ?
video ?
text ?
API Providers ?
xAI
DeepSeek, HuggingFace
Knowledge Cut-off Date ?
-
Unknown
Open Source ?
No
Yes
Pricing Input ?
$3.00 per million tokens
$0.55 per million tokens
Pricing Output ?
$15.00 per million tokens
$2.19 per million tokens
MMLU ?
-
90.8%
Pass@1
Source
MMLU-Pro ?
-
84%
EM
Source
MMMU ?
-
-
HellaSwag ?
-
-
HumanEval ?
-
-
MATH ?
-
-
GPQA ?
87.5%
Science
Source
71.5%
Pass@1
Source
IFEval ?
-
83.3%
Prompt Strict
Source
SimpleQA ?
-
-
AIME 2024
-
-
AIME 2025
91.7%
Competition Math
Source
-
Aider Polyglot ?
-
-
LiveCodeBench v5 ?
79%
Competitive Coding
Source
-
Global MMLU (Lite) ?
-
-
MathVista ?
-
-
Mobile Application

MathArena ?

Avg. Score
89%
82%
AIME 2025
A test based on problems from the American Invitational Mathematics Examination, designed to assess the mathematical skills of models.
91%
89%
HMMT February 2025
A test based on problems from the Harvard-MIT Mathematics Tournament, February 2025, designed to assess the mathematical skills of models.
92%
77%
BRUMO 2025
95%
92%
SMT 2025
A test based on problems from the Stanford Math Tournament, 2025, designed to assess the mathematical skills of models.
86%
83%
CMIMC 2025
A test based on problems from the Canadian Mathematical Olympiad, 2025, designed to assess the mathematical skills of models.
83%
69%

Compare LLMs

Add a Comment


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.