Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction.

Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput.

Share

Model details

Context window1,048,576 tokens
Max completion size87 tokens
Prompt cost / 1K tokens$0.00000015
Completion cost / 1K tokens$0.0000006
Accepts
Produces

Benchmark performance

Overall

68
score
17th
placement

Cost

99
score
2nd
placement

Logic

75
score
9th
placement

Speed

91
score
9th
placement

Scoring

47
score
10th
placement

Tool Use

26
score
8th
placement

Hallucination

80
score
8th
placement

Classification

29
score
3rd
placement

Structured Output

33
score
8th
placement

Pricing

Usage pricing
Prompt
$0.00000015
Completion
$0.0000006
Request
FREE
Image
FREE
Web Search
FREE
Internal Reasoning
FREE

Best Overall scoring LLMs

88
score
1st
placement
86
score
2nd
placement
84
score
3rd
placement
82
score
4th
placement
82
score
4th
placement
80
score
5th
placement