Llama 3.1 8B Instruct

von Meta

Specifications

Input
Output
Context window: 128K tokens
Veröffentlicht: Jul 2024

Performance

Speed: 60 t/s
TTFT: 494 ms
Latency: 218 ms
Intelligence: —

Pricing

Eingabe: €0.15
Ausgabe: €0.15

Über dieses Modell

Meta Llama 3.1 8B Instruct is an efficient multilingual instruction-tuned model optimized for dialogue and assistant use cases. With 8 billion parameters and 128K context length, it provides strong performance across general tasks, code generation, and multilingual understanding. Supports function calling and tool use with Grouped-Query Attention architecture. Ideal for deployment scenarios requiring lower compute resources while maintaining quality across English and 7 additional languages including German, French, Spanish, and Hindi.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: No

Knowledge horizon

Wissensstand Dec 2023

Veröffentlicht Jul 2024

Today

Training to release 7 mo Since release 24 mo

Llama 3.1 8B Instruct

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B