Grok 3 Beta

Grok 3 is xAI's most advanced model, trained on the Colossus supercluster with 10 times the computational power of previous state-of-the-art models. It boasts a 1M-token context window and advanced reasoning capabilities, enhanced through large-scale reinforcement learning, enabling deep thought processes ranging from seconds to minutes for solving complex problems. The model achieves top-tier performance across academic benchmarks and real-world user evaluations, earning an Elo score of 1402 in the Chatbot Arena. It was released alongside Grok 3 Mini, a cost-efficient variant optimized for streamlined reasoning.

Mistral Large 2

Mistral Large 2, developed by Mistral, offers a 128K-token context window and is priced at $3.00 per million input tokens and $9.00 per million output tokens. Released on July 24, 2024, the model scored 84.0 on the MMLU benchmark in a 5-shot evaluation, demonstrating strong performance in diverse tasks.

Grok 3 BetaMistral Large 2
Provider
Web Site
Release Date
Jan 19, 2025
3 months ago
Jun 24, 2024
9 months ago
Modalities
text ?
images ?
video ?
text ?
API Providers
xAI
Azure AI, AWS Bedrock, Google AI Studio, Vertex AI, Snowflake Cortex
Knowledge Cut-off Date
2025-01
Unknown
Open Source
No
Yes
Pricing Input
Not available
$3.00 per million tokens
Pricing Output
Not available
$9.00 per million tokens
MMLU
Not available
84%
5-shot
Source
MMLU Pro
79.9%
Base model
Source
50.69%
Source
MMMU
78%
With Think mode
Source
Not available
HellaSwag
Not available
Not available
HumanEval
Not available
Not available
MATH
Not available
1.13%
Source
GPQA
84.6%
With Think mode, Diamond
Source
24.94%
IFEval
Not available
84.01%
Mobile Application
-

Compare LLMs

Add a Comment


10%
Our site uses cookies.

Privacy and Cookie Policy: This site uses cookies. By continuing to use the site, you agree to their use.