Gemini 2.5 Pro

Gemini 2.5 Pro is Google's most advanced AI model, engineered for deep reasoning and thoughtful response generation. It outperforms on key benchmarks, demonstrating exceptional logic and coding proficiency. Optimized for building dynamic web applications, autonomous code systems, and code adaptation, it delivers high-level performance. With built-in multimodal capabilities and an extended context window, the model efficiently processes large datasets and integrates diverse information sources to tackle complex challenges.

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct, created by Meta, is a multilingual large language model specifically fine-tuned for instruction-based tasks and optimized for conversational applications. It is capable of processing and generating text in multiple languages, with a context window supporting up to 128,000 tokens. Launched on December 6, 2024, the model surpasses numerous open-source and proprietary chat models in various industry benchmarks. It utilizes Grouped-Query Attention (GQA) to improve scalability and has been trained on a diverse dataset comprising over 15 trillion tokens from publicly available sources. The model's knowledge is current up to December 2023.

Gemini 2.5 ProLlama 3.3 70B Instruct
Web Site ?
Provider ?
Chat ?
Release Date ?
Modalities ?
text ?
images ?
voice ?
video ?
text ?
API Providers ?
Google AI Studio, Vertex AI, Gemini app
Fireworks, Together, DeepInfra, Hyperbolic
Knowledge Cut-off Date ?
-
12.2024
Open Source ?
No
Yes
Pricing Input ?
Not available
$0.23 per million tokens
Pricing Output ?
Not available
$0.40 per million tokens
MMLU ?
Not available
86%
0-shot, CoT
Source
MMLU-Pro ?
Not available
68.9%
5-shot, CoT
Source
MMMU ?
81.7%
Source
Not available
HellaSwag ?
Not available
Not available
HumanEval ?
Not available
88.4%
pass@1
Source
MATH ?
Not available
77%
0-shot, CoT
Source
GPQA ?
84.0%
Diamond Science
Source
50.5%
0-shot, CoT
Source
IFEval ?
Not available
92.1%
Source
SimpleQA ?
52.9%
-
AIME 2024
92.0%
-
AIME 2025
86.7%
-
Aider Polyglot ?
74.0% / 68.6%
-
LiveCodeBench v5 ?
70.4%
-
Global MMLU (Lite) ?
89.8%
-
MathVista ?
-
-
Mobile Application
-

VideoGameBench ?

Total score
0.48%
-
Doom II
0%
-
Dream DX
4.8%
-
Awakening DX
0%
-
Civilization I
0%
-
Pokemon Crystal
0%
-
The Need for Speed
0%
-
The Incredible Machine
0%
-
Secret Game 1
0%
-
Secret Game 2
0%
-
Secret Game 3
0%
-

Compare LLMs

Add a Comment


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.