Qwen 3.6 35B A3B
Specifications
- Input
- Output
- Context window
- 256K tokens
- Released
- Apr 2026
Performance
- Speed
- 68 t/s
- TTFT
- —
- Latency
- 96 ms
- Intelligence
- —
Pricing
- Input
- €0.25 per 1M tokens
- Output
- €1.50 per 1M tokens
About this model
Qwen 3.6 35B A3B is a 35 B parameter mixture‑of‑experts vision‑language model with a gated delta network architecture and a vision encoder. It supports a native context window of 262 K tokens (extendable to over 1 M tokens) and can be used for image‑text‑to‑text tasks. The model achieves strong performance on benchmarks, scoring 85.2 % on MMLU‑Pro, 81.4 % on MMMU, and 85.3 % on RealWorldQA. It is released under the Apache 2.0 license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Apr 2026
Today
Since release 2 mo
See also
Add Model to Comparison
Search for a model to add