Command Palette
Search for a command to run

Qwen3.5 397B A17B

by Qwen

Specifications

Input
Output
Context window
256K tokens
Released
Feb 2026

Performance

Speed
41 t/s
TTFT
243 ms
Latency
Intelligence

Pricing

Input
€0.75
per 1M tokens
Output
€4.50
per 1M tokens

About this model

Qwen 3.5 397B A17B is a 397B-parameter mixture-of-experts vision-language foundation model with a gated delta network architecture and a vision encoder. It supports a native context window of 262,144 tokens (extendable to over 1 million) and operates in a default thinking mode that can be disabled. The model achieves strong results such as 87.8% on MMLU‑Pro, 85.0% on MMMU, and 88.6% on MathVision benchmarks. It is released under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
Hybrid Default on

Knowledge horizon

Released Feb 2026
Today
Since release 3 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run