What is the input price of NVIDIA: Llama 3.1 Nemotron 70B Instruct?

The input price of NVIDIA: Llama 3.1 Nemotron 70B Instruct is $1.2 per 1M tokens.

What is the output price of NVIDIA: Llama 3.1 Nemotron 70B Instruct?

The output price of NVIDIA: Llama 3.1 Nemotron 70B Instruct is $1.2 per 1M tokens.

What is the context window of NVIDIA: Llama 3.1 Nemotron 70B Instruct?

The context window of NVIDIA: Llama 3.1 Nemotron 70B Instruct is 131,072 tokens.

Which company provides NVIDIA: Llama 3.1 Nemotron 70B Instruct?

NVIDIA: Llama 3.1 Nemotron 70B Instruct is provided by NVIDIA.

NVIDIA: Llama 3.1 Nemotron 70B Instruct — Input Price, Context Window

Name: NVIDIA: Llama 3.1 Nemotron 70B Instruct
Brand: NVIDIA
Price: 1.2 USD

Input Price $1.20 per million tokens NT$38.4

Output Price $1.20 per million tokens NT$38.4

Context Window 131K tokens Output limit: 16K

OpenRouter Route Price Please verify with official pricing pages

Overview

NVIDIA: Llama 3.1 Nemotron 70B Instruct is a large language model API from NVIDIA, part of its Llama 3.1 model family. Priced at $1.20 per million input tokens and $1.20 per million output tokens, it occupies the mid-range, balancing capability against running cost. The 131K-token context window — around 197 pages of text — comfortably handles long documents, multi-file code, or extended conversations. On Artificial Analysis's Intelligence Index it scores 8 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension	Unit	Price (USD)	Price (TWD)	Effective From
Input	per 1M tokens	$1.20	NT$38.4	2026-05-03
Output	per 1M tokens	$1.20	NT$38.4	2026-05-03

Provider: NVIDIA
Model Family: Llama 3.1
Version String: nvidia/llama-3.1-nemotron-70b-instruct
Status: Active
Modality: Text
Context Window: 131,072 tokens
Output Limit: 16,384 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 8 F Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 7.0% F Measured: 2026-07-01

GPQA Diamond 46.5% C Measured: 2026-07-01

HLE 4.6% D Measured: 2026-07-01

Non-Hallucination 31.2% Measured: 2026-07-01

Omniscience Accuracy 16.4% Measured: 2026-07-01

SciCode 23.3% C Measured: 2026-07-01

Tau2 23.1% Measured: 2026-07-01

TerminalBench 4.5% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 5.0s Measured: 2026-07-01

Output Speed 292 t/s Measured: 2026-07-01

Response Time 6.7s Measured: 2026-07-01

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date	Dimension	Price (USD)	Source
2026-05-07	Output	$1.20	OpenRouter
2026-05-07	Input	$1.20	OpenRouter
2026-05-07	Output	$1.20	OpenRouter
2026-05-07	Input	$1.20	OpenRouter
2026-05-07	Output	$1.20	OpenRouter
2026-05-07	Input	$1.20	OpenRouter
2026-05-06	Output	$1.20	OpenRouter
2026-05-06	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-05	Output	$1.20	OpenRouter
2026-05-05	Input	$1.20	OpenRouter
2026-05-04	Output	$1.20	OpenRouter
2026-05-04	Input	$1.20	OpenRouter
2026-05-04	Output	$1.20	OpenRouter
2026-05-04	Input	$1.20	OpenRouter
2026-05-04	Output	$1.20	OpenRouter
2026-05-04	Input	$1.20	OpenRouter
2026-05-04	Output	$1.20	OpenRouter
2026-05-04	Input	$1.20	OpenRouter
2026-05-03	Output	$1.20	OpenRouter
2026-05-03	Input	$1.20	OpenRouter
2026-05-03	Output	$1.20	OpenRouter
2026-05-03	Input	$1.20	OpenRouter
2026-05-03	Output	$1.20	OpenRouter
2026-05-03	Input	$1.20	OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

NVIDIA: Llama 3.1 Nemotron 70B Instruct Input price: $1.2/M tokens
NVIDIA: Llama 3.1 Nemotron 70B Instruct Output price: $1.2/M tokens
Context window: 131,072 tokens
Provider: NVIDIA
Model family: Llama 3.1
Modalities: Text
Data source: OpenRouter, updated daily

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Overview

Full Pricing Dimensions

Model Information

Index Metrics

Benchmark Scores

Performance Metrics

90-Day Price Trend

Historical Price Snapshots

Key Insights