Llama 3.3 70B Instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks.

Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Model Card

Share

Model details

Context window131,072 tokens
Max completion size16 tokens
Prompt cost / 1K tokens$0.00000013
Completion cost / 1K tokens$0.00000038
Accepts
Produces

Benchmark performance

Overall

65
score
20th
placement

Cost

99
score
2nd
placement

Logic

59
score
15th
placement

Speed

79
score
20th
placement

Scoring

55
score
6th
placement

Tool Use

6
score
11th
placement

Hallucination

100
score
1st
placement

Classification

10
score
5th
placement

Structured Output

67
score
5th
placement

Pricing

Usage pricing
Prompt
$0.00000013
Completion
$0.00000038
Request
FREE
Image
FREE
Web Search
FREE
Internal Reasoning
FREE

Best Overall scoring LLMs

88
score
1st
placement
86
score
2nd
placement
84
score
3rd
placement
82
score
4th
placement
82
score
4th
placement
80
score
5th
placement