NVIDIA: Llama 3.1 Nemotron 70B Instruct
🇺🇸 NVIDIA · Llama 3.1
Overview
NVIDIA: Llama 3.1 Nemotron 70B Instruct is a large language model API from NVIDIA, part of its Llama 3.1 model family. Priced at $1.20 per million input tokens and $1.20 per million output tokens, it occupies the mid-range, balancing capability against running cost. The 131K-token context window — around 197 pages of text — comfortably handles long documents, multi-file code, or extended conversations. On Artificial Analysis's Intelligence Index it scores 8 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.
| Dimension | Unit | Price (USD) |
|---|---|---|
| Input | per 1M tokens | $1.20 |
| Output | per 1M tokens | $1.20 |
- Provider
- NVIDIA
- Model Family
- Llama 3.1
- Version String
- nvidia/llama-3.1-nemotron-70b-instruct
- Status
- Active
- Modality
- Text
- Context Window
- 131,072 tokens
- Output Limit
- 16,384 tokens
Index Metrics
Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis
Benchmark Scores
Data source: Artificial Analysis
Performance Metrics
Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis
90-Day Price Trend
Input / Output price (USD per 1M tokens)
Past 90 days of records; every price change is shown here
| Date | Dimension | Price (USD) | Source |
|---|---|---|---|
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter | |
| Output | $1.20 | OpenRouter | |
| Input | $1.20 | OpenRouter |
Key Insights
Key data points from this page for quick reference and citation.
- NVIDIA: Llama 3.1 Nemotron 70B Instruct Input price: $1.2/M tokens
- NVIDIA: Llama 3.1 Nemotron 70B Instruct Output price: $1.2/M tokens
- Context window: 131,072 tokens
- Provider: NVIDIA
- Model family: Llama 3.1
- Modalities: Text
- Data source: OpenRouter, updated daily