← All Models

Qwen: Qwen3 VL 8B Thinking

🇨🇳 Qwen (Alibaba) · Qwen3 VL

Input Price $0.117 per million tokens NT$3.7
Output Price $1.36 per million tokens NT$43.7
Context Window 256K tokens Output limit: 33K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Overview

Qwen: Qwen3 VL 8B Thinking is a large language model API from Qwen (Alibaba), part of its Qwen3 VL model family. Priced at $0.117 per million input tokens and $1.36 per million output tokens, it occupies the mid-range, balancing capability against running cost. Output tokens cost about 12× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. A large 256K-token context window (≈384 pages of text) lets it take in whole books, large codebases, or lengthy transcripts in a single call. Beyond plain text it also accepts Image input, so it can be applied to multimodal tasks rather than text alone. On Artificial Analysis's Intelligence Index it scores 11 (F grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens $0.117 NT$3.7
Output per 1M tokens $1.36 NT$43.7

Provider
通義千問(阿里) (Qwen (Alibaba))
Model Family
Qwen3 VL
Version String
qwen/qwen3-vl-8b-thinking
Status
Active
Modality
Image, Text
Context Window
256,000 tokens
Output Limit
32,768 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Intelligence Index 11 F Measured: 2026-07-01

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 31.0% D Measured: 2026-07-01
GPQA Diamond 57.9% C Measured: 2026-07-01
HLE 3.3% D Measured: 2026-07-01
IFBench 39.9% D Measured: 2026-07-01
MMMU Pro 56.6% C Measured: 2026-07-01
Non-Hallucination 9.3% Measured: 2026-07-01
Omniscience Accuracy 20.1% Measured: 2026-07-01
SciCode 21.9% C Measured: 2026-07-01
Tau2 22.5% Measured: 2026-07-01
TerminalBench 3.8% Measured: 2026-07-01

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 2.4s Measured: 2026-07-01
Output Speed 115 t/s Measured: 2026-07-01
Response Time 24.2s Measured: 2026-07-01

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter
Output $1.36 OpenRouter
Input $0.117 OpenRouter

Key Insights

Key data points from this page for quick reference and citation.

  • Qwen: Qwen3 VL 8B Thinking Input price: $0.117/M tokens
  • Qwen: Qwen3 VL 8B Thinking Output price: $1.365/M tokens
  • Context window: 256,000 tokens
  • Provider: Qwen (Alibaba)
  • Model family: Qwen3 VL
  • Modalities: Image, Text
  • Data source: OpenRouter, updated daily