← All Models

NVIDIA: Nemotron Nano 12B 2 VL

🇺🇸 NVIDIA · Nemotron Nano

Input Price $0.200 per million tokens NT$6.4
Output Price $0.600 per million tokens NT$19.2
Context Window 131K tokens Output limit: 16K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Overview

NVIDIA: Nemotron Nano 12B 2 VL is a large language model API from NVIDIA, part of its Nemotron Nano model family. At $0.200 per million input tokens and $0.600 per million output tokens, it sits in the budget tier — among the cheaper options for high-throughput or cost-sensitive workloads. Output tokens cost about 3× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. The 131K-token context window — around 197 pages of text — comfortably handles long documents, multi-file code, or extended conversations. Beyond plain text it also accepts Image, Video input, so it can be applied to multimodal tasks rather than text alone. On Artificial Analysis's Intelligence Index it scores 9 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens $0.200 NT$6.4
Output per 1M tokens $0.600 NT$19.2

Provider
NVIDIA
Model Family
Nemotron Nano
Version String
nvidia/nemotron-nano-12b-v2-vl
Status
Active
Modality
Image, Text, Video
Context Window
131,072 tokens
Output Limit
16,384 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 9 F Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 40.0% D Measured: 2026-07-01
GPQA Diamond 57.2% C Measured: 2026-07-01
HLE 5.3% D Measured: 2026-07-01
IFBench 31.9% D Measured: 2026-07-01
MMMU Pro 52.9% C Measured: 2026-07-01
Non-Hallucination 9.3% Measured: 2026-07-01
Omniscience Accuracy 13.8% Measured: 2026-07-01
SciCode 26.2% C Measured: 2026-07-01
Tau2 21.3% Measured: 2026-07-01
TerminalBench 4.5% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 0.4s Measured: 2026-07-01
Output Speed 291 t/s Measured: 2026-07-01
Response Time 9.0s Measured: 2026-07-01

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter
Output $0.600 OpenRouter
Input $0.200 OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

  • NVIDIA: Nemotron Nano 12B 2 VL Input price: $0.2/M tokens
  • NVIDIA: Nemotron Nano 12B 2 VL Output price: $0.6/M tokens
  • Context window: 131,072 tokens
  • Provider: NVIDIA
  • Model family: Nemotron Nano
  • Modalities: Image, Text, Video
  • Data source: OpenRouter, updated daily