Command Palette
Search for a command to run

Qwen3 Embedding 8B

by Qwen

Specifications

Input
Output
Context window
41K tokens
Released
Jun 2025

Performance

Speed
1862 t/s
TTFT
Latency
Intelligence

Pricing

Input
€0.02
per 1M tokens
Output
€0.00
per 1M tokens

About this model

Qwen3 Embedding 8B is a dense retrieval embedding model with 8 billion parameters, optimized for semantic search, text similarity, and feature extraction. Trained on diverse multilingual data providing strong cross-lingual retrieval capabilities. Supports 262K context for embedding long documents and extensive text passages. Excels at document retrieval, semantic search, clustering, and recommendation systems. Compatible with standard embedding frameworks and optimized for production deployment with efficient inference.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
No

Knowledge horizon

Released Jun 2025
Today
Since release 11 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run