Qwen3.5 397B A17B
Specifications
- Input
- Output
- Context window
- 256K tokens
- Released
- Feb 2026
Performance
- Speed
- 41 t/s
- TTFT
- 243 ms
- Latency
- —
- Intelligence
- —
Pricing
- Input
- €0.75 per 1M tokens
- Output
- €4.50 per 1M tokens
About this model
Qwen 3.5 397B A17B is a 397B-parameter mixture-of-experts vision-language foundation model with a gated delta network architecture and a vision encoder. It supports a native context window of 262,144 tokens (extendable to over 1 million) and operates in a default thinking mode that can be disabled. The model achieves strong results such as 87.8% on MMLU‑Pro, 85.0% on MMMU, and 88.6% on MathVision benchmarks. It is released under the Apache 2.0 license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Feb 2026
Today
Since release 3 mo
See also
Add Model to Comparison
Search for a model to add