Nemotron 3 Nano 30B A3B

von NVIDIA

Specifications

Input
Output
Context window: 128K tokens
Veröffentlicht: Dec 2025

Performance

Speed: 222 t/s
TTFT: 512 ms
Latency: 258 ms
Intelligence: —

Pricing

Eingabe: €0.06
Ausgabe: €0.24

Über dieses Modell

NVIDIA Nemotron 3 Nano is a highly efficient hybrid Mamba-Transformer MoE model with 30B total / 3.5B active parameters. Features 128K context window extensible to 1M tokens. Excels at agentic AI, reasoning, and tool calling tasks. Trained on 25T tokens with state-of-the-art efficiency. Supports English, German, French, Spanish, Italian, and Japanese. Open weights with commercial license.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: Hybrid Standard on

Knowledge horizon

Wissensstand Jun 2025

Veröffentlicht Dec 2025

Today

Training to release 6 mo Since release 7 mo

Nemotron 3 Nano 30B A3B

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B