DeepSeek-R1

DeepSeek-R1 is a 671B parameter Mixture-of-Experts (MoE) model with 37B activated parameters per token, trained via large-scale reinforcement learning with a focus on reasoning capabilities. It incorporates two RL stages for discovering improved reasoning patterns and aligning with human preferences, along with two SFT stages for seeding reasoning and non-reasoning capabilities. The model achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks.

GPT-4.5

OpenAI's newest GPT-4.5 model, released on February 27, 2025, is a groundbreaking AI with 12.8 trillion parameters and a 128K-token context window. It features advanced general knowledge, enhanced emotional intelligence, multimodal input processing (text and image),sophisticated function calling, and real-time streaming responses. Initially rolled out for ChatGPT Pro users, with later availability for Plus and Team subscribers, it delivers responses at approximately 37 tokens per second, making it an excellent choice for tasks requiring high-level reasoning and emotional depth.

DeepSeek-R1GPT-4.5
Provider
Web Site
Release Date
Jan 21, 2025
3 months ago
Feb 27, 2025
1 month ago
Modalities
text ?
text ?
images ?
API Providers
DeepSeek, HuggingFace
OpenAI, Azure OpenAI Service
Knowledge Cut-off Date
Unknown
2023-10
Open Source
Yes
No
Pricing Input
$0.55 per million tokens
$75.00 per million tokens
Pricing Output
$2.19 per million tokens
$150.00 per million tokens
MMLU
90.8%
Pass@1
Source
Not available
MMLU Pro
84%
EM
Source
Not available
MMMU
-
74.4%
Source
HellaSwag
-
Not available
HumanEval
-
Not available
MATH
-
Not available
GPQA
71.5%
Pass@1
Source
71.4%
science
Source
IFEval
83.3%
Prompt Strict
Source
Not available
Mobile Application

Compare LLMs

Add a Comment


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.