AI Model Picker

Model Suggestions

Best for General Use (Balanced)

Grok 4.3 (high)

xAI·4/30/26

Score: 62.7

Pricing (per 1M)

Input: $1.25
Output: $2.50

Speed

Output t/s: 171.9
TTFT: 20.82s

Benchmarks

Intelligence: 53.2Coding: 41GPQA: 90.1HLE: 35.0SCICODE: 47.3IFBENCH: 81.3LCR: 64.3TERMINALBENCH HARD: 37.9TAU2: 97.7

Gemini 3.5 Flash (high)

Google·5/19/26

Score: 62.0

Pricing (per 1M)

Input: $1.50
Output: $9.00

Speed

Output t/s: 213.6
TTFT: 11.73s

Benchmarks

Intelligence: 55.3Coding: 45GPQA: 92.2HLE: 41.0SCICODE: 53.1IFBENCH: 76.3LCR: 69.3TERMINALBENCH HARD: 40.9TAU2: 95.3

Gemini 3.5 Flash (medium)

Google·5/19/26

Score: 61.6

Pricing (per 1M)

Input: $1.50
Output: $9.00

Speed

Output t/s: 204.2
TTFT: 10.78s

Benchmarks

Intelligence: 54.8Coding: 43.9GPQA: 92.1HLE: 39.9SCICODE: 53.0IFBENCH: 74.6LCR: 71.0TERMINALBENCH HARD: 39.4TAU2: 95.6
#4
Qwen3.7 Max
Alibaba
Score: 60.6
#5
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA
Score: 59.9
#6
Gemini 3 Flash Preview (Reasoning)
Google
Score: 59.9

Best for Coding Tools

Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)

Anthropic·5/27/26

Score: 57.4

Pricing (per 1M)

Input: $10.00
Output: $50.00

Speed

Output t/s: 62.8
TTFT: 61.80s

Benchmarks

Intelligence: 64.9Coding: 62GPQA: 92.6HLE: 53.3SCICODE: 60.2IFBENCH: 63.5LCR: 70.0TERMINALBENCH HARD: 62.9TAU2: 98.5

GPT-5.4 (xhigh)

OpenAI·3/5/26

Score: 56.0

Pricing (per 1M)

Input: $2.50
Output: $15.00

Speed

Output t/s: 94.2
TTFT: 165.75s

Benchmarks

Intelligence: 56.8Coding: 57.2GPQA: 92.0HLE: 41.6SCICODE: 56.6IFBENCH: 73.9LCR: 74.0TERMINALBENCH HARD: 57.6TAU2: 87.1

Gemini 3.1 Pro Preview

Google·2/19/26

Score: 55.9

Pricing (per 1M)

Input: $2.00
Output: $12.00

Speed

Output t/s: 129.2
TTFT: 19.76s

Benchmarks

Intelligence: 57.2Coding: 55.5GPQA: 94.1HLE: 44.7SCICODE: 58.9IFBENCH: 77.1LCR: 72.7TERMINALBENCH HARD: 53.8TAU2: 95.6
#4
GPT-5.4 mini (xhigh)
OpenAI
Score: 54.6
#5
GPT-5.5 (xhigh)
OpenAI
Score: 54.6
#6
GPT-5.5 (high)
OpenAI
Score: 54.0

Best Value (Cost Efficient)

MiniMax-M3

MiniMax·5/31/26

Score: 67.2

Pricing (per 1M)

Input: $0.30
Output: $1.20

Speed

Output t/s: 47.1
TTFT: 2.38s

Benchmarks

Intelligence: 54.7Coding: 43.4GPQA: 92.9HLE: 37.1SCICODE: 45.4IFBENCH: 82.9LCR: 74.0TERMINALBENCH HARD: 42.4TAU2: 88.9

MiMo-V2.5-Pro

Xiaomi·4/22/26

Score: 66.3

Pricing (per 1M)

Input: $0.43
Output: $0.87

Speed

Output t/s: 38.7
TTFT: 2.05s

Benchmarks

Intelligence: 53.8Coding: 45.5GPQA: 86.6HLE: 33.8SCICODE: 50.2IFBENCH: 79.9LCR: 73.3TERMINALBENCH HARD: 43.2TAU2: 94.2

Qwen3.7 Plus

Alibaba·6/3/26

Score: 66.2

Pricing (per 1M)

Input: $0.40
Output: $1.16

Speed

Output t/s: 52.7
TTFT: 1.27s

Benchmarks

Intelligence: 53.3Coding: 46.5GPQA: 90.0HLE: 33.4SCICODE: 45.5IFBENCH: 78.0LCR: 65.0TERMINALBENCH HARD: 47.0TAU2: 93.0
#4
MiMo-V2.5
Xiaomi
Score: 65.8
#5
Grok 4.3 (high)
xAI
Score: 65.7
#6
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek
Score: 65.6

Smart & Fast (Intel > Speed > Price)

Gemini 3.5 Flash (high)

Google·5/19/26

Score: 60.3

Pricing (per 1M)

Input: $1.50
Output: $9.00

Speed

Output t/s: 213.6
TTFT: 11.73s

Benchmarks

Intelligence: 55.3Coding: 45GPQA: 92.2HLE: 41.0SCICODE: 53.1IFBENCH: 76.3LCR: 69.3TERMINALBENCH HARD: 40.9TAU2: 95.3

Gemini 3.5 Flash (medium)

Google·5/19/26

Score: 59.9

Pricing (per 1M)

Input: $1.50
Output: $9.00

Speed

Output t/s: 204.2
TTFT: 10.78s

Benchmarks

Intelligence: 54.8Coding: 43.9GPQA: 92.1HLE: 39.9SCICODE: 53.0IFBENCH: 74.6LCR: 71.0TERMINALBENCH HARD: 39.4TAU2: 95.6

Qwen3.7 Max

Alibaba·5/21/26

Score: 59.7

Pricing (per 1M)

Input: $2.50
Output: $7.50

Speed

Output t/s: 169.1
TTFT: 1.58s

Benchmarks

Intelligence: 56.6Coding: 50.1GPQA: 92.3HLE: 38.1SCICODE: 48.8IFBENCH: 80.5LCR: 69.0TERMINALBENCH HARD: 50.8TAU2: 94.7
#4
Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback)
Anthropic
Score: 58.3
#5
Grok 4.3 (high)
xAI
Score: 58.0
#6
Gemini 3.1 Pro Preview
Google
Score: 57.8