Whisper Large V3

von OpenAI

Specifications

Input
Output
Context window: —
Veröffentlicht: Nov 2023

Performance

Speed: —
TTFT: —
Latency: —
Intelligence: —

Pricing

Eingabe: €0.00
Ausgabe: €0.00

Über dieses Modell

OpenAI Whisper Large V3 is a state-of-the-art automatic speech recognition model with 1550M parameters supporting 99 languages. Achieves 10-20% WER reduction compared to V2, trained on 1M hours weakly labeled + 4M hours pseudo-labeled audio. Features 128 Mel frequency bins (increased from 80), improved robustness to accents and background noise, and new Cantonese language support. Supports speech transcription and speech-to-English translation with sentence and word-level timestamps. Optimized with torch.compile for 4.5x speedup. Ideal for accessibility tools, multilingual transcription, and enterprise ASR applications.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: No

Knowledge horizon

Veröffentlicht Nov 2023

Today

Since release 32 mo

Whisper Large V3

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B