Meta
Llama 3.3 70B Instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.

Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Model Card

Model details

Context window131,072 tokens

Max completion size16 tokens

Prompt cost / 1K tokens$0.00000013

Completion cost / 1K tokens$0.00000038

100

score

1st

placement

Classification

score

5th

placement

Structured Output

score

5th

placement

Pricing

Usage pricing
Prompt	$0.00000013
Completion	$0.00000038
Request	FREE
Image	FREE
Web Search	FREE
Internal Reasoning	FREE

placement

Browse all LLMs

Model details

Benchmark performanceAll scores have maximum of 100 points.

Overall

Cost

Logic

Speed

Scoring

Tool Use

Hallucination

Classification

Structured Output

Pricing

Grok 4 Fast

Qwen3 VL 235B A22B Instruct

Grok 4.1 Fast

GPT-5.1 Chat

GPT-5.1-Codex

Claude Haiku 4.5

Benchmark performance