Qwen3 30B A3B Instruct

by Qwen

Specifications

Input
Output
Context window: 262K tokens
Released: Jul 2025

Performance

Speed: 176 t/s
TTFT: 346 ms
Latency: 111 ms
Intelligence: —

Pricing

Input: €0.10
Output: €0.30

About this model

Qwen3 30B A3B Instruct is a compact Mixture-of-Experts model with 30B total parameters and 3B activated per token, offering excellent efficiency for general-purpose tasks. Features 262K native context with extension to 1M tokens, strong multilingual capabilities, and enhanced instruction following. Balances performance and computational efficiency with support for tool calling, code generation, and logical reasoning. Ideal for deployment scenarios requiring lower resource usage while maintaining quality across diverse task types.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: No

Knowledge horizon

Released Jul 2025

Today

Since release 11 mo

Qwen3 30B A3B Instruct

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.5

Qwen 3.6 27B

Gemma 4 31B