Nemotron 3 Nano 30B A3B
Specifications
- Input
- Output
- Context window
- 128K tokens
- Veröffentlicht
- Dec 2025
Performance
- Speed
- 291 t/s
- TTFT
- 291 ms
- Latency
- 128 ms
- Intelligence
- —
Pricing
- Eingabe
- €0.06 per 1M tokens
- Ausgabe
- €0.24 per 1M tokens
Über dieses Modell
NVIDIA Nemotron 3 Nano is a highly efficient hybrid Mamba-Transformer MoE model with 30B total / 3.5B active parameters. Features 128K context window extensible to 1M tokens. Excels at agentic AI, reasoning, and tool calling tasks. Trained on 25T tokens with state-of-the-art efficiency. Supports English, German, French, Spanish, Italian, and Japanese. Open weights with commercial license.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- Hybrid Standard on
Knowledge horizon
Wissensstand Jun 2025
Veröffentlicht Dec 2025
Today
Training to release 6 mo Since release 6 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen