Meta: Llama 3.1 8B Instruct
🇺🇸 Meta · Llama 3.1
Overview
Meta: Llama 3.1 8B Instruct is a large language model API from Meta, part of its Llama 3.1 model family. At $0.020 per million input tokens and $0.030 per million output tokens, it sits in the budget tier — among the cheaper options for high-throughput or cost-sensitive workloads. Output tokens cost about 2× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. The 131K-token context window — around 197 pages of text — comfortably handles long documents, multi-file code, or extended conversations. On Artificial Analysis's Intelligence Index it scores 8 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.
| Dimension | Unit | Price (USD) |
|---|---|---|
| Input | per 1M tokens | $0.020 |
| Output | per 1M tokens | $0.030 |
- Provider
- Meta
- Model Family
- Llama 3.1
- Version String
- meta-llama/llama-3.1-8b-instruct
- Status
- Active
- Modality
- Text
- Context Window
- 131,072 tokens
- Output Limit
- 16,384 tokens
Index Metrics
Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis
Benchmark Scores
Data source: Artificial Analysis
Performance Metrics
Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis
Use Case Analysis
Model strengths, weaknesses, and ideal use cases based on benchmarks and official documentation
Strength
- Exceptional Efficiency MMLU (CoT) 73.0%, GSM8K 84.5%. Best-in-class for 8B parameter models. Benchmark
- Runs on Consumer Hardware Fits on single RTX 4090 (24GB VRAM) or even smaller GPUs with quantization. Official
Best For
- Personal/Small Business AI Can run locally on personal computers; no cloud dependency or API costs. Official
- Fine-tuning Base Model Excellent starting point for domain-specific fine-tuning with limited compute. Official
Not Recommended
- Complex Reasoning Tasks Use 70B or 405B for multi-step logic, advanced math, or scientific reasoning. Official
90-Day Price Trend
Input / Output price (USD per 1M tokens)
Past 90 days of records; every price change is shown here
| Date | Dimension | Price (USD) | Source |
|---|---|---|---|
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter | |
| Output | $0.030 | OpenRouter | |
| Input | $0.020 | OpenRouter |
Key Insights
Key data points from this page for quick reference and citation.
- Meta: Llama 3.1 8B Instruct Input price: $0.02/M tokens
- Meta: Llama 3.1 8B Instruct Output price: $0.03/M tokens
- Context window: 131,072 tokens
- Provider: Meta
- Model family: Llama 3.1
- Modalities: Text
- Data source: OpenRouter, updated daily