Whisper Large V3

by OpenAI

Specifications

Input
Output
Context window: —
Released: Nov 2023

Performance

Speed: —
TTFT: —
Latency: —
Intelligence: —

Pricing

Input: €0.00
Output: €0.00

About this model

OpenAI Whisper Large V3 is a state-of-the-art automatic speech recognition model with 1550M parameters supporting 99 languages. Achieves 10-20% WER reduction compared to V2, trained on 1M hours weakly labeled + 4M hours pseudo-labeled audio. Features 128 Mel frequency bins (increased from 80), improved robustness to accents and background noise, and new Cantonese language support. Supports speech transcription and speech-to-English translation with sentence and word-level timestamps. Optimized with torch.compile for 4.5x speedup. Ideal for accessibility tools, multilingual transcription, and enterprise ASR applications.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: No

Knowledge horizon

Released Nov 2023

Today

Since release 32 mo

Whisper Large V3

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B