Command Palette
Search for a command to run

Qwen3 235B A22B Instruct

by Qwen

Specifications

Input
Output
Context window
262K tokens
Released
Apr 2025

Performance

Speed
93 t/s
TTFT
311 ms
Latency
Intelligence

Pricing

Input
€0.25
per 1M tokens
Output
€0.75
per 1M tokens

About this model

Qwen3 235B A22B Instruct is a Mixture-of-Experts model with 235B total parameters and 22B activated, featuring 128 experts with 8 activated per token. Native 262K context extended to 1M tokens via Dual Chunk Attention. Achieves SOTA: 83.0 MMLU-Pro, 70.3 AIME25, 41.8 ARC-AGI, 79.2 Arena-Hard v2, 51.8 LiveCodeBench, 70.9 BFCL-v3. Non-thinking mode focused on direct task execution with enhanced instruction following, logical reasoning, and long-tail knowledge across multiple languages. Dramatically more efficient than full 235B models.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
No

Knowledge horizon

Knowledge cutoff Mar 2024
Released Apr 2025
Today
Training to release 13 mo Since release 13 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run