Prime Intellect

INTELLECT-3

INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GLM-4.5-Air-Base using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL). It offers state-of-the-art performance for its size across math, code, science, and general reasoning, consistently outperforming many larger frontier models. Designed for strong multi-step problem solving, it maintains high accuracy on structured tasks while remaining efficient at inference thanks to its MoE architecture.

Input / 1M tokens: $0.200
Output / 1M tokens: $1.10
Context window: 131K tokens
Provider: Prime Intellect

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: 0 t/s
Time to first token: 0.00s

Benchmarks

Intelligence, coding, and math indexes plus the underlying evaluation scores.

Intelligence Index: 22
Coding Index: 19
Math Index: 88
MMLU-Pro: 82.2%
GPQA: 76.1%
HLE: 12.1%
LiveCodeBench: 77.7%
SciCode: 39.1%
MATH-500: —
AIME: —

Benchmarks via Artificial Analysis