Qwen3 235B A22B Instruct
Specifications
- Input
- Output
- Context window
- 262K tokens
- Released
- Apr 2025
Performance
- Speed
- 93 t/s
- TTFT
- 311 ms
- Latency
- —
- Intelligence
- —
Pricing
- Input
- €0.25 per 1M tokens
- Output
- €0.75 per 1M tokens
About this model
Qwen3 235B A22B Instruct is a Mixture-of-Experts model with 235B total parameters and 22B activated, featuring 128 experts with 8 activated per token. Native 262K context extended to 1M tokens via Dual Chunk Attention. Achieves SOTA: 83.0 MMLU-Pro, 70.3 AIME25, 41.8 ARC-AGI, 79.2 Arena-Hard v2, 51.8 LiveCodeBench, 70.9 BFCL-v3. Non-thinking mode focused on direct task execution with enhanced instruction following, logical reasoning, and long-tail knowledge across multiple languages. Dramatically more efficient than full 235B models.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- No
Knowledge horizon
Knowledge cutoff Mar 2024
Released Apr 2025
Today
Training to release 13 mo Since release 13 mo
See also
Add Model to Comparison
Search for a model to add