Model Comparison Leaderboard
Category
OpenAI
ChatGPT 4o
Google
Imagen 3.0
Reve
Reve Image (Halfmoon)
Recraft
Recraft V3
Ideogram
Ideogram V2
Minimax
MiniMax Image-01
Black Forest Labs
Flux 1.1 Pro Ultra
Midjourney
Midjourney V6.1
Midjourney
Midjourney v7
OpenAI
DALL-E 3
XAI
Grok 2 Image
Overall Score
(100 examples)
Refusals: 11
Refusals: 3
Avg: 7.36 / 10
Avg: 7.32 / 10
Avg: 7.20 / 10
Avg: 7.16 / 10
Refusals: 1
Avg: 6.99 / 10
Refusals: 2
Avg: 6.60 / 10
Refusals: 2
Avg: 6.51 / 10
Avg: 5.39 / 10
Refusals: 1
(10 examples)
Avg Score: 7.20
Avg Score: 7.20
7.75 / 10
Refusals: 2
6.70 / 10
7.10 / 10
7.30 / 10
7.20 / 10
6.70 / 10
7.60 / 10
4.90 / 10
(10 examples)
Avg Score: 7.96
Avg Score: 7.96
Refusals: 3
7.70 / 10
8.20 / 10
8.00 / 10
7.80 / 10
7.40 / 10
5.60 / 10
(10 examples)
Avg Score: 6.97
Avg Score: 6.97
7.33 / 10
Refusals: 1
Refusals: 2
6.00 / 10
7.90 / 10
7.56 / 10
Refusals: 1
6.70 / 10
6.20 / 10
5.70 / 10
4.67 / 10
Refusals: 1
(10 examples)
Avg Score: 7.12
Avg Score: 7.12
Refusals: 3
7.30 / 10
6.60 / 10
6.60 / 10
6.90 / 10
7.62 / 10
Refusals: 2
7.50 / 10
Refusals: 2
6.00 / 10
5.40 / 10
(10 examples)
Avg Score: 6.59
Avg Score: 6.59
5.70 / 10
7.00 / 10
5.50 / 10
7.20 / 10
4.90 / 10
6.00 / 10
6.80 / 10
5.80 / 10
(10 examples)
Avg Score: 7.55
Avg Score: 7.55
7.70 / 10
8.00 / 10
8.10 / 10
7.80 / 10
6.90 / 10
5.60 / 10
5.10 / 10
(10 examples)
Avg Score: 7.73
Avg Score: 7.73
Refusals: 1
Refusals: 1
7.80 / 10
8.00 / 10
7.80 / 10
7.20 / 10
7.20 / 10
8.20 / 10
5.70 / 10
6.50 / 10
(10 examples)
Avg Score: 7.28
Avg Score: 7.28
7.60 / 10
7.10 / 10
6.70 / 10
6.90 / 10
7.50 / 10
7.90 / 10
6.50 / 10
5.30 / 10
(10 examples)
Avg Score: 6.92
Avg Score: 6.92
6.60 / 10
7.50 / 10
7.30 / 10
6.60 / 10
7.10 / 10
4.30 / 10
6.60 / 10
5.80 / 10
(10 examples)
Avg Score: 5.36
Avg Score: 5.36
Refusals: 1
5.60 / 10
5.40 / 10
5.20 / 10
4.80 / 10
4.80 / 10
4.50 / 10
5.50 / 10
4.80 / 10