Qwen3 30B A3B Instruct
Specifications
- Input
- Output
- Context window
- 262K tokens
- Released
- Jul 2025
Performance
- Speed
- 176 t/s
- TTFT
- 346 ms
- Latency
- 111 ms
- Intelligence
- —
Pricing
- Input
- €0.10 per 1M tokens
- Output
- €0.30 per 1M tokens
About this model
Qwen3 30B A3B Instruct is a compact Mixture-of-Experts model with 30B total parameters and 3B activated per token, offering excellent efficiency for general-purpose tasks. Features 262K native context with extension to 1M tokens, strong multilingual capabilities, and enhanced instruction following. Balances performance and computational efficiency with support for tool calling, code generation, and logical reasoning. Ideal for deployment scenarios requiring lower resource usage while maintaining quality across diverse task types.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- No
Knowledge horizon
Released Jul 2025
Today
Since release 11 mo
See also
Add Model to Comparison
Search for a model to add