Grok 3 Beta

Grok 3 is xAI's most advanced model, trained on the Colossus supercluster with 10 times the computational power of previous state-of-the-art models. It boasts a 1M-token context window and advanced reasoning capabilities, enhanced through large-scale reinforcement learning, enabling deep thought processes ranging from seconds to minutes for solving complex problems. The model achieves top-tier performance across academic benchmarks and real-world user evaluations, earning an Elo score of 1402 in the Chatbot Arena. It was released alongside Grok 3 Mini, a cost-efficient variant optimized for streamlined reasoning.

Mistral Large 2

Mistral Large 2, developed by Mistral, offers a 128K-token context window and is priced at $3.00 per million input tokens and $9.00 per million output tokens. Released on July 24, 2024, the model scored 84.0 on the MMLU benchmark in a 5-shot evaluation, demonstrating strong performance in diverse tasks.

Grok 3 BetaMistral Large 2
Web Site ?
Provider ?
Chat ?
Release Date ?
Modalities ?
text ?
images ?
video ?
text ?
API Providers ?
xAI
Azure AI, AWS Bedrock, Google AI Studio, Vertex AI, Snowflake Cortex
Knowledge Cut-off Date ?
2025-01
Unknown
Open Source ?
No
Yes
Pricing Input ?
Not available
$3.00 per million tokens
Pricing Output ?
Not available
$9.00 per million tokens
MMLU ?
Not available
84%
5-shot
Source
MMLU-Pro ?
79.9%
Base model
Source
50.69%
Source
MMMU ?
78%
With Think mode
Source
Not available
HellaSwag ?
Not available
Not available
HumanEval ?
Not available
Not available
MATH ?
Not available
1.13%
Source
GPQA ?
84.6%
With Think mode, Diamond
Source
24.94%
IFEval ?
Not available
84.01%
SimpleQA ?
-
-
AIME 2024
-
-
AIME 2025
-
-
Aider Polyglot ?
-
-
LiveCodeBench v5 ?
-
-
Global MMLU (Lite) ?
-
-
MathVista ?
-
-
Mobile Application
-

Compare LLMs

Add a Comment


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.