Meta
Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction.

Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput.

Model details

Context window1,048,576 tokens

Max completion size87 tokens

Prompt cost / 1K tokens$0.00000015

Completion cost / 1K tokens$0.0000006

Usage pricing
Prompt	$0.00000015
Completion	$0.0000006
Request	FREE
Image	FREE
Web Search	FREE
Internal Reasoning	FREE

placement

Browse all LLMs

Model details

Benchmark performanceAll scores have maximum of 100 points.

Overall

Cost

Logic

Speed

Scoring

Tool Use

Hallucination

Classification

Structured Output

Pricing

Grok 4 Fast

Qwen3 VL 235B A22B Instruct

Grok 4.1 Fast

GPT-5.1 Chat

GPT-5.1-Codex

Claude Haiku 4.5

Benchmark performance