← All Models

Xiaomi: MiMo-V2-Flash

🇨🇳 Xiaomi · MiMo-V2-Flash

Input Price $0.100 per million tokens NT$3.2
Output Price $0.300 per million tokens NT$9.6
Context Window 262K tokens Output limit: 66K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Overview

Xiaomi: MiMo-V2-Flash is a large language model API from Xiaomi, part of its MiMo-V2-Flash model family. At $0.100 per million input tokens and $0.300 per million output tokens, it sits in the budget tier — among the cheaper options for high-throughput or cost-sensitive workloads. Output tokens cost about 3× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. A large 262K-token context window (≈393 pages of text) lets it take in whole books, large codebases, or lengthy transcripts in a single call. On Artificial Analysis's Intelligence Index it scores 33 (C grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens $0.100 NT$3.2
Output per 1M tokens $0.300 NT$9.6
Cached Input per 1M tokens $0.010 NT$0.32

Provider
小米 (Xiaomi)
Model Family
MiMo-V2-Flash
Version String
xiaomi/mimo-v2-flash
Status
Active
Modality
Text
Context Window
262,144 tokens
Output Limit
65,536 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 33 C Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 64.3% B Measured: 2026-07-01
GPQA Diamond 83.5% A Measured: 2026-07-01
HLE 20.0% A Measured: 2026-07-01
IFBench 71.8% B Measured: 2026-07-01
Non-Hallucination 51.6% Measured: 2026-07-01
Omniscience Accuracy 20.2% Measured: 2026-07-01
SciCode 38.3% B Measured: 2026-07-01
Tau2 93.3% Measured: 2026-07-01
TerminalBench 31.1% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 2.6s Measured: 2026-07-01
Output Speed 94 t/s Measured: 2026-07-01
Response Time 29.2s Measured: 2026-07-01

Use Case Analysis

Model strengths, weaknesses, and ideal use cases based on benchmarks and official documentation

Strength

  • Efficient MoE Architecture 309B total parameters with only 15B active, reducing inference costs by 6x via hybrid attention. Official
  • Multi-Token Prediction Triples output speed during inference with lightweight MTP module. Official
  • Agentic Excellence Achieves #1 on SWE-Bench Verified among open-source models. Official

Best For

  • Software Engineering SWE-Bench Verified #1 performance for complex coding tasks. Official
  • Cost-Sensitive Deployments Excellent value with 150 tokens/sec inference speed at low cost. Official

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter
Input $0.100 OpenRouter
Cached Input $0.010 OpenRouter
Output $0.300 OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

  • Xiaomi: MiMo-V2-Flash Input price: $0.1/M tokens
  • Xiaomi: MiMo-V2-Flash Output price: $0.3/M tokens
  • Context window: 262,144 tokens
  • Provider: Xiaomi
  • Model family: MiMo-V2-Flash
  • Modalities: Text
  • Data source: OpenRouter, updated daily