Llama 3.1 8B

by Meta

Llama 3.1 8B is Meta's fastest and cheapest Llama model, operating at very high throughput for classification, extraction, and lightweight summarization tasks. At $0.05/M tokens with a 128K context window, it's one of the best-value models on the platform.

Pricing

Input$0.05 / M tokens

Output$0.08 / M tokens

Context128K tokens

Get API Key →View all pricing →

Llama 3.1 8B

temp: 0.7

Send a message to start the conversation.

Temperature

00.72

System Prompt

High Contrast

Get API Key to run live →

More from Meta

View all →

Text

Llama 3.3 70B

Llama 3.3 70B is Meta's most capable 70B model, delivering performance competitive with much larger models on instruction-following and coding tasks. It's fully open-source under a permissive commercial license, making it the default choice for open deployments.

$0.20 / M tokens

Text

Llama 3.1 405B

Llama 3.1 405B is Meta's largest open-source model and the only fully open model competitive with GPT-4 on a broad range of benchmarks. It's the foundation for many fine-tuned specialized models across the open-source community.

$2.00 / M tokens

Text

Llama 3.1 70B

Llama 3.1 70B is the production workhorse of the Llama 3.1 family, offering an excellent balance of capability and inference cost for RAG pipelines and chat applications. It's widely deployed in production and benefits from the largest open fine-tune ecosystem.

$0.35 / M tokens

Text

Llama 3.2 90B Vision

Llama 3.2 90B Vision is Meta's largest multimodal open model, enabling high-quality image understanding across documents, charts, and natural scenes. It maintains strong text capability while adding best-in-class open vision performance.

$0.90 / M tokens