← All Models

NVIDIA: Nemotron Nano 9B V2

🇺🇸 NVIDIA · Nemotron Nano

Input Price $0.040 per million tokens NT$1.3
Output Price $0.160 per million tokens NT$5.1
Context Window 131K tokens Output limit: 16K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Overview

NVIDIA: Nemotron Nano 9B V2 is a large language model API from NVIDIA, part of its Nemotron Nano model family. At $0.040 per million input tokens and $0.160 per million output tokens, it sits in the budget tier — among the cheaper options for high-throughput or cost-sensitive workloads. Output tokens cost about 4× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. The 131K-token context window — around 197 pages of text — comfortably handles long documents, multi-file code, or extended conversations. On Artificial Analysis's Intelligence Index it scores 9 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens $0.040 NT$1.3
Output per 1M tokens $0.160 NT$5.1

Provider
NVIDIA
Model Family
Nemotron Nano
Version String
nvidia/nemotron-nano-9b-v2
Status
Active
Modality
Text
Context Window
131,072 tokens
Output Limit
16,384 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 9 F Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 21.0% F Measured: 2026-07-01
GPQA Diamond 57.0% C Measured: 2026-07-01
HLE 4.6% D Measured: 2026-07-01
IFBench 27.6% F Measured: 2026-07-01
Non-Hallucination 39.0% Measured: 2026-07-01
Omniscience Accuracy 11.3% Measured: 2026-07-01
SciCode 22.0% C Measured: 2026-07-01
Tau2 21.9% Measured: 2026-07-01
TerminalBench 1.5% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 8.5s Measured: 2026-07-01
Output Speed 78 t/s Measured: 2026-07-01
Response Time 40.6s Measured: 2026-07-01

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter
Output $0.160 OpenRouter
Input $0.040 OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

  • NVIDIA: Nemotron Nano 9B V2 Input price: $0.04/M tokens
  • NVIDIA: Nemotron Nano 9B V2 Output price: $0.16/M tokens
  • Context window: 131,072 tokens
  • Provider: NVIDIA
  • Model family: Nemotron Nano
  • Modalities: Text
  • Data source: OpenRouter, updated daily