Qwen 3.5 9B

by Qwen

Specifications

Input
Output
Context window: 262K tokens
Released: Mar 2026

Performance

Speed: 18 t/s
TTFT: 286 ms
Latency: —
Intelligence: —

Pricing

Input: €0.18
Output: €0.23

About this model

Qwen 3.5 9B is a 9B‑parameter multimodal large language model with a gated‑delta mixture‑of‑experts architecture and a vision encoder. It supports a native context window of 262,144 tokens and operates in a default thinking mode that can be disabled. The model achieves strong results such as 82.5% on MMLU‑Pro, 88.2% on C‑Eval, and 78.4% on MMMU benchmarks. It is released under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Hybrid Default on

Knowledge horizon

Released Mar 2026

Today

Since release 2 mo

Qwen 3.5 9B

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.5

MiniMax M2.5

DeepSeek V4 Flash