Llama 3.1 8B
by Meta
Llama 3.1 8B is Meta's fastest and cheapest Llama model, operating at very high throughput for classification, extraction, and lightweight summarization tasks. At $0.05/M tokens with a 128K context window, it's one of the best-value models on the platform.
Send a message to start the conversation.
More from Meta
View all →Llama 3.3 70B
Llama 3.3 70B is Meta's most capable 70B model, delivering performance competitive with much larger models on instruction-following and coding tasks. It's fully open-source under a permissive commercial license, making it the default choice for open deployments.
$0.20 / M tokens
Llama 3.1 405B
Llama 3.1 405B is Meta's largest open-source model and the only fully open model competitive with GPT-4 on a broad range of benchmarks. It's the foundation for many fine-tuned specialized models across the open-source community.
$2.00 / M tokens
Llama 3.1 70B
Llama 3.1 70B is the production workhorse of the Llama 3.1 family, offering an excellent balance of capability and inference cost for RAG pipelines and chat applications. It's widely deployed in production and benefits from the largest open fine-tune ecosystem.
$0.35 / M tokens
Llama 3.2 90B Vision
Llama 3.2 90B Vision is Meta's largest multimodal open model, enabling high-quality image understanding across documents, charts, and natural scenes. It maintains strong text capability while adding best-in-class open vision performance.
$0.90 / M tokens