Qwen

Qwen VL Plus

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

Input / 1M tokens: $0.137
Output / 1M tokens: $0.410
Context window: 131K tokens
Provider: Qwen
Cached input / 1M: $0.027
Knowledge cutoff: 2025-03-31

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: —
Time to first token: —