Qwen3 30B A3B Instruct
Specifications
- Input
- Output
- Context window
- 262K tokens
- Released
- Apr 2025
Performance
- Speed
- 49 t/s
- TTFT
- 161 ms
- Latency
- —
- Intelligence
- —
Pricing
- Input
- €0.13 per 1M tokens
- Output
- €0.38 per 1M tokens
About this model
Qwen3 30B A3B Instruct is a compact Mixture-of-Experts model with 30B total parameters and 3B activated per token, offering excellent efficiency for general-purpose tasks. Features 262K native context with extension to 1M tokens, strong multilingual capabilities, and enhanced instruction following. Balances performance and computational efficiency with support for tool calling, code generation, and logical reasoning. Ideal for deployment scenarios requiring lower resource usage while maintaining quality across diverse task types.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- No
Knowledge horizon
Knowledge cutoff Mar 2024
Released Apr 2025
Today
Training to release 13 mo Since release 13 mo
See also
Add Model to Comparison
Search for a model to add