Qwen3 Embedding 8B
Specifications
- Input
- Output
- Context window
- 41K tokens
- Veröffentlicht
- Jun 2025
Performance
- Speed
- 1862 t/s
- TTFT
- —
- Latency
- 410 ms
- Intelligence
- —
Pricing
- Eingabe
- €0.01 per 1M tokens
- Ausgabe
- €0.00 per 1M tokens
Über dieses Modell
Qwen3 Embedding 8B is a dense retrieval embedding model with 8 billion parameters, optimized for semantic search, text similarity, and feature extraction. Trained on diverse multilingual data providing strong cross-lingual retrieval capabilities. Supports 262K context for embedding long documents and extensive text passages. Excels at document retrieval, semantic search, clustering, and recommendation systems. Compatible with standard embedding frameworks and optimized for production deployment with efficient inference.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- No
Knowledge horizon
Veröffentlicht Jun 2025
Today
Since release 12 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen