BGE M3
Specifications
- Input
- Output
- Context window
- 8K tokens
- Released
- Jan 2024
Performance
- Speed
- 2121 t/s
- TTFT
- —
- Latency
- 359 ms
- Intelligence
- —
Pricing
- Input
- €0.01 per 1M tokens
- Output
- €0.00 per 1M tokens
About this model
BAAI BGE-M3 is a versatile multilingual embedding model supporting dense, sparse, and multi-vector retrieval in a unified architecture. Handles 100+ languages with strong cross-lingual capabilities and flexible retrieval modes for different use cases. Features hybrid retrieval combining dense embeddings for semantic similarity, sparse representations for lexical matching, and multi-vector approaches for fine-grained relevance. Ideal for multilingual search engines, hybrid retrieval systems, and complex information retrieval scenarios requiring multiple matching strategies.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- No
Knowledge horizon
Released Jan 2024
Today
Since release 29 mo
See also
Add Model to Comparison
Search for a model to add