| 1 | Qwen: Qwen3 235B A22B Instruct 2507 | 75.3 | $0.09 | 880.7 |
| 2 | Meta: Llama 3.1 8B Instruct | 25.9 | $0.03 | 740.0 |
| 3 | Meta: Llama 3 8B Instruct | 25.9 | $0.04 | 740.0 |
| 4 | OpenAI: gpt-oss-20b | 61.1 | $0.09 | 718.8 |
| 5 | OpenAI: gpt-oss-120b | 78.2 | $0.11 | 683.0 |
| 6 | NVIDIA: Nemotron Nano 9B V2 | 55.7 | $0.10 | 557.0 |
| 7 | Qwen: Qwen-Turbo | 41.0 | $0.08 | 504.6 |
| 8 | Qwen: Qwen3 30B A3B | 72.6 | $0.18 | 403.3 |
| 9 | Qwen: Qwen3 30B A3B Instruct 2507 | 70.7 | $0.20 | 362.6 |
| 10 | Reka Flash 3 | 52.9 | $0.15 | 352.7 |
| 11 | Xiaomi: MiMo-V2-Flash | 65.6 | $0.19 | 345.3 |
| 12 | Qwen: Qwen3 32B | 53.5 | $0.16 | 334.4 |
| 13 | Google: Gemma 3n 4B | 29.6 | $0.09 | 328.9 |
| 14 | NVIDIA: Nemotron 3 Nano 30B A3B | 39.9 | $0.12 | 319.2 |
| 15 | Qwen: Qwen3 14B | 47.0 | $0.15 | 313.3 |
| 16 | Qwen: Qwen3 Coder 30B A3B Instruct | 51.6 | $0.17 | 303.5 |
| 17 | OpenAI: GPT-5 Nano | 67.6 | $0.22 | 300.4 |
| 18 | NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | 74.8 | $0.25 | 299.2 |
| 19 | Qwen: Qwen3 30B A3B Thinking 2507 | 70.7 | $0.24 | 294.6 |
| 20 | Google: Gemini 2.0 Flash Lite | 53.5 | $0.19 | 285.3 |
| 21 | Google: Gemini 2.5 Flash Lite Preview 09-2025 | 70.9 | $0.25 | 283.6 |
| 22 | Mistral: Devstral Small 1.1 | 53.2 | $0.20 | 266.0 |
| 23 | Qwen: Qwen3 VL 32B Instruct | 66.8 | $0.26 | 256.9 |
| 24 | DeepSeek: DeepSeek V3.2 | 75.1 | $0.32 | 238.4 |
| 25 | Meta: Llama 3.3 70B Instruct | 49.8 | $0.21 | 237.1 |
| 26 | Nous: Hermes 3 70B Instruct | 69.9 | $0.30 | 233.0 |
| 27 | DeepSeek: R1 Distill Qwen 32B | 61.5 | $0.29 | 212.1 |
| 28 | OpenAI: GPT-4.1 Nano | 51.2 | $0.25 | 204.8 |
| 29 | Qwen: Qwen3 8B | 45.2 | $0.22 | 200.9 |
| 30 | Qwen: Qwen3 VL 8B Instruct | 57.9 | $0.29 | 199.7 |
| 31 | xAI: Grok 3 Mini | 79.1 | $0.40 | 197.7 |
| 32 | Qwen: Qwen3 VL 30B A3B Instruct | 62.0 | $0.33 | 190.8 |
| 33 | Google: Gemini 2.5 Flash Lite | 47.4 | $0.25 | 189.6 |
| 34 | xAI: Grok 4 Fast | 63.7 | $0.35 | 182.0 |
| 35 | xAI: Grok 4.1 Fast | 63.7 | $0.35 | 182.0 |
| 36 | AllenAI: Olmo 3 32B Think | 59.1 | $0.33 | 181.8 |
| 37 | Qwen: Qwen3 Next 80B A3B Thinking | 75.9 | $0.44 | 173.0 |
| 38 | Meta: Llama 3.2 1B Instruct | 19.6 | $0.11 | 172.7 |
| 39 | Qwen: QwQ 32B | 59.3 | $0.36 | 162.5 |
| 40 | DeepSeek: DeepSeek V3.1 Terminus | 75.1 | $0.50 | 150.2 |
| 41 | Upstage: Solar Pro 3 | 56.1 | $0.38 | 149.6 |
| 42 | Z.ai: GLM 4.5 Air | 73.3 | $0.49 | 149.6 |
| 43 | Qwen: Qwen3 VL 235B A22B Instruct | 77.2 | $0.54 | 143.0 |
| 44 | MiniMax: MiniMax M2.1 | 83.0 | $0.62 | 133.9 |
| 45 | Meta: Llama 3.2 3B Instruct | 25.5 | $0.20 | 130.4 |
| 46 | MiniMax: MiniMax M2.5 | 83.0 | $0.65 | 127.7 |
| 47 | Qwen: Qwen3 Next 80B A3B Instruct | 75.9 | $0.60 | 127.6 |
| 48 | MiniMax: MiniMax M2 | 77.7 | $0.63 | 123.8 |
| 49 | Mistral: Mistral 7B Instruct v0.1 | 17.7 | $0.15 | 118.0 |
| 50 | Baidu: ERNIE 4.5 300B A47B | 81.1 | $0.69 | 117.5 |
| 51 | Prime Intellect: INTELLECT-3 | 76.1 | $0.65 | 117.1 |
| 52 | OpenAI: GPT-4o-mini | 42.6 | $0.38 | 113.6 |
| 53 | MiniMax: MiniMax M2.7 | 83.0 | $0.75 | 110.7 |
| 54 | NVIDIA: Nemotron Nano 12B 2 VL | 43.9 | $0.40 | 109.8 |
| 55 | DeepSeek: DeepSeek V3.2 Speciale | 87.1 | $0.80 | 108.9 |
| 56 | Mistral: Saba | 42.4 | $0.40 | 106.0 |
| 57 | Kwaipilot: KAT-Coder-Pro V2 | 76.4 | $0.75 | 101.9 |
| 58 | Qwen: Qwen2.5 VL 72B Instruct | 49.1 | $0.50 | 98.2 |
| 59 | Qwen: Qwen3 235B A22B Thinking 2507 | 79.0 | $0.82 | 96.1 |
| 60 | Z.ai: GLM 4.6V | 56.6 | $0.60 | 94.3 |
| 61 | Meta: Llama 3.2 11B Vision Instruct | 22.1 | $0.24 | 90.2 |
| 62 | xAI: Grok Code Fast 1 | 72.7 | $0.85 | 85.5 |
| 63 | Qwen: Qwen3 VL 30B A3B Thinking | 72.0 | $0.84 | 85.2 |
| 64 | Qwen: Qwen3 VL 8B Thinking | 57.9 | $0.74 | 78.1 |
| 65 | OpenAI: GPT-5 Mini | 82.8 | $1.13 | 73.6 |
| 66 | Nous: Hermes 3 405B Instruct | 72.7 | $1.00 | 72.7 |
| 67 | OpenAI: GPT-5.1-Codex-Mini | 81.3 | $1.13 | 72.3 |
| 68 | OpenAI: GPT-4.1 Mini | 66.4 | $1.00 | 66.4 |
| 69 | MoonshotAI: Kimi K2 0905 | 76.7 | $1.20 | 63.9 |
| 70 | MoonshotAI: Kimi K2.5 | 76.6 | $1.22 | 62.8 |
| 71 | OpenAI: GPT-5.4 Nano | 42.8 | $0.72 | 59.0 |
| 72 | Google: Gemini 2.5 Flash | 81.2 | $1.40 | 58.0 |
| 73 | Z.ai: GLM 4.5V | 68.4 | $1.20 | 57.0 |
| 74 | Mistral: Mixtral 8x7B Instruct | 29.2 | $0.54 | 54.1 |
| 75 | MoonshotAI: Kimi K2 Thinking | 83.8 | $1.55 | 54.1 |
| 76 | Qwen: Qwen3 VL 235B A22B Thinking | 77.2 | $1.43 | 54.0 |
| 77 | Qwen: Qwen3 235B A22B | 61.3 | $1.14 | 53.9 |
| 78 | MiniMax: MiniMax M1 | 69.7 | $1.30 | 53.6 |
| 79 | DeepSeek: R1 Distill Llama 70B | 40.2 | $0.75 | 53.6 |
| 80 | MoonshotAI: Kimi K2 0711 | 76.6 | $1.44 | 53.4 |
| 81 | DeepSeek: R1 | 81.3 | $1.60 | 50.8 |
| 82 | Qwen2.5 Coder 32B Instruct | 41.7 | $0.83 | 50.2 |
| 83 | Mistral: Mistral Medium 3.1 | 58.8 | $1.20 | 49.0 |
| 84 | Mistral: Mistral Medium 3 | 58.8 | $1.20 | 49.0 |
| 85 | Perplexity: Sonar | 47.1 | $1.00 | 47.1 |
| 86 | Mistral: Devstral Medium | 49.2 | $1.20 | 41.0 |
| 87 | NVIDIA: Llama 3.1 Nemotron 70B Instruct | 46.5 | $1.20 | 38.8 |
| 88 | Qwen: Qwen3 Max Thinking | 77.6 | $2.34 | 33.2 |
| 89 | OpenAI: GPT-3.5 Turbo | 29.7 | $1.00 | 29.7 |
| 90 | OpenAI: o4 Mini | 78.4 | $2.75 | 28.5 |
| 91 | MoonshotAI: Kimi K2.6 | 76.6 | $2.70 | 28.4 |
| 92 | OpenAI: o3 Mini High | 77.3 | $2.75 | 28.1 |
| 93 | OpenAI: o3 Mini | 74.8 | $2.75 | 27.2 |
| 94 | OpenAI: GPT-5.4 Mini | 66.4 | $2.63 | 25.3 |
| 95 | Qwen: Qwen3 Max | 58.7 | $2.34 | 25.1 |
| 96 | Anthropic: Claude Haiku 4.5 | 67.2 | $3.00 | 22.4 |
| 97 | Mistral Large | 68.0 | $4.00 | 17.0 |
| 98 | OpenAI: o3 | 82.7 | $5.00 | 16.5 |
| 99 | OpenAI: GPT-5 Codex | 86.0 | $5.63 | 15.3 |
| 100 | OpenAI: GPT-5.1-Codex | 86.0 | $5.63 | 15.3 |
| 101 | OpenAI: GPT-5 | 85.4 | $5.63 | 15.2 |
| 102 | Google: Gemini 2.5 Pro | 84.4 | $5.63 | 15.0 |
| 103 | Google: Gemini 2.5 Pro Preview 05-06 | 82.2 | $5.63 | 14.6 |
| 104 | Mistral: Pixtral Large 2411 | 50.5 | $4.00 | 12.6 |
| 105 | Mistral Large 2407 | 47.2 | $4.00 | 11.8 |
| 106 | OpenAI: GPT-5.2-Codex | 86.0 | $7.88 | 10.9 |
| 107 | OpenAI: GPT-4o | 54.3 | $6.25 | 8.7 |
| 108 | Anthropic: Claude 3.7 Sonnet (thinking) | 77.2 | $9.00 | 8.6 |
| 109 | Cohere: Command A | 52.7 | $6.25 | 8.4 |
| 110 | AI21: Jamba Large 1.7 | 39.0 | $5.00 | 7.8 |
| 111 | Anthropic: Claude Sonnet 4 | 59.9 | $9.00 | 6.7 |
| 112 | Anthropic: Claude Sonnet 4.5 | 59.9 | $9.00 | 6.7 |
| 113 | Anthropic: Claude Sonnet 4.6 | 59.9 | $9.00 | 6.7 |
| 114 | Perplexity: Sonar Pro | 57.8 | $9.00 | 6.4 |
| 115 | OpenAI: GPT-4o (2024-05-13) | 52.6 | $10.00 | 5.3 |
| 116 | xAI: Grok 3 Beta | 47.1 | $9.00 | 5.2 |
| 117 | Cohere: Command R+ (08-2024) | 32.3 | $6.25 | 5.2 |
| 118 | Anthropic: Claude Opus 4.7 | 48.9 | $15.00 | 3.3 |
| 119 | Anthropic: Claude Opus 4.6 | 48.9 | $15.00 | 3.3 |
| 120 | Anthropic: Claude Opus 4.5 | 48.9 | $15.00 | 3.3 |
| 121 | OpenAI: o1 | 74.7 | $37.50 | 2.0 |
| 122 | Anthropic: Claude Opus 4 | 48.9 | $45.00 | 1.1 |