Llama 3.1 8B Instruct
Specifications
- Input
- Output
- Context window
- 131K tokens
- Veröffentlicht
- Jul 2024
Performance
- Speed
- 43 t/s
- TTFT
- 128 ms
- Latency
- 282 ms
- Intelligence
- —
Pricing
- Eingabe
- €0.10 per 1M tokens
- Ausgabe
- €0.10 per 1M tokens
Über dieses Modell
Meta Llama 3.1 8B Instruct is an efficient multilingual instruction-tuned model optimized for dialogue and assistant use cases. With 8 billion parameters and 128K context length, it provides strong performance across general tasks, code generation, and multilingual understanding. Supports function calling and tool use with Grouped-Query Attention architecture. Ideal for deployment scenarios requiring lower compute resources while maintaining quality across English and 7 additional languages including German, French, Spanish, and Hindi.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- No
Knowledge horizon
Wissensstand Dec 2023
Veröffentlicht Jul 2024
Today
Training to release 7 mo Since release 23 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen