← All Models

NVIDIA: Nemotron 3 Nano Omni (free)

🇺🇸 NVIDIA · Nemotron 3

Input Price Free per million tokens
Output Price Free per million tokens
Context Window 256K tokens Output limit: 66K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Overview

NVIDIA: Nemotron 3 Nano Omni (free) is a large language model API from NVIDIA, part of its Nemotron 3 model family. It is currently routed free of charge on OpenRouter, which makes it well suited to prototyping and high-volume experimentation where per-token cost would otherwise dominate. A large 256K-token context window (≈384 pages of text) lets it take in whole books, large codebases, or lengthy transcripts in a single call. Beyond plain text it also accepts Audio, Image, Video input, so it can be applied to multimodal tasks rather than text alone. On Artificial Analysis's Intelligence Index it scores 15 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens Free
Output per 1M tokens Free

Provider
NVIDIA
Model Family
Nemotron 3
Version String
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
Status
Active
Modality
Text, Audio, Image, Video
Context Window
256,000 tokens
Output Limit
65,536 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 15 F Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 35.7% D Measured: 2026-07-01
GPQA Diamond 46.9% C Measured: 2026-07-01
HLE 5.3% D Measured: 2026-07-01
IFBench 63.2% B Measured: 2026-07-01
MMMU Pro 53.2% C Measured: 2026-07-01
Non-Hallucination 16.9% Measured: 2026-07-01
Omniscience Accuracy 14.8% Measured: 2026-07-01
SciCode 27.8% C Measured: 2026-07-01
Tau2 45.3% Measured: 2026-07-01
TerminalBench 8.3% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 1.0s Measured: 2026-07-01
Output Speed 300 t/s Measured: 2026-07-01
Response Time 9.3s Measured: 2026-07-01

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter
Output Free OpenRouter
Input Free OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

  • NVIDIA: Nemotron 3 Nano Omni (free) Input price: $0/M tokens
  • NVIDIA: Nemotron 3 Nano Omni (free) Output price: $0/M tokens
  • Context window: 256,000 tokens
  • Provider: NVIDIA
  • Model family: Nemotron 3
  • Modalities: Text, Audio, Image, Video
  • Data source: OpenRouter, updated daily