Command Palette
Search for a command to run

Qwen 3.5 9B

by Qwen

Specifications

Input
Output
Context window
262K tokens
Released
Mar 2026

Performance

Speed
18 t/s
TTFT
286 ms
Latency
Intelligence

Pricing

Input
€0.18
per 1M tokens
Output
€0.23
per 1M tokens

About this model

Qwen 3.5 9B is a 9B‑parameter multimodal large language model with a gated‑delta mixture‑of‑experts architecture and a vision encoder. It supports a native context window of 262,144 tokens and operates in a default thinking mode that can be disabled. The model achieves strong results such as 82.5% on MMLU‑Pro, 88.2% on C‑Eval, and 78.4% on MMMU benchmarks. It is released under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
Hybrid Default on

Knowledge horizon

Released Mar 2026
Today
Since release 2 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run