Qwen
Qwen VL Plus
Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
- Input / 1M tokens
- $0.137
- Output / 1M tokens
- $0.410
- Context window
- 131K tokens
- Provider
- Qwen
- Cached input / 1M
- $0.027
- Knowledge cutoff
- 2025-03-31
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- —
- Time to first token
- —