Qwen3 Embedding 8B
Specifications
- Input
- Output
- Context window
- 41K tokens
- Released
- Jun 2025
Performance
- Speed
- 1862 t/s
- TTFT
- —
- Latency
- —
- Intelligence
- —
Pricing
- Input
- €0.02 per 1M tokens
- Output
- €0.00 per 1M tokens
About this model
Qwen3 Embedding 8B is a dense retrieval embedding model with 8 billion parameters, optimized for semantic search, text similarity, and feature extraction. Trained on diverse multilingual data providing strong cross-lingual retrieval capabilities. Supports 262K context for embedding long documents and extensive text passages. Excels at document retrieval, semantic search, clustering, and recommendation systems. Compatible with standard embedding frameworks and optimized for production deployment with efficient inference.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- No
Knowledge horizon
Released Jun 2025
Today
Since release 11 mo
See also
Add Model to Comparison
Search for a model to add