Qwen 3.5 9B
Specifications
- Input
- Output
- Context window
- 262K tokens
- Released
- Mar 2026
Performance
- Speed
- 18 t/s
- TTFT
- 286 ms
- Latency
- —
- Intelligence
- —
Pricing
- Input
- €0.18 per 1M tokens
- Output
- €0.23 per 1M tokens
About this model
Qwen 3.5 9B is a 9B‑parameter multimodal large language model with a gated‑delta mixture‑of‑experts architecture and a vision encoder. It supports a native context window of 262,144 tokens and operates in a default thinking mode that can be disabled. The model achieves strong results such as 82.5% on MMLU‑Pro, 88.2% on C‑Eval, and 78.4% on MMMU benchmarks. It is released under the Apache 2.0 license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Mar 2026
Today
Since release 2 mo
See also
Add Model to Comparison
Search for a model to add