Image Battle | AI Image Comparison

AI Image Battle Gallery

Battle Category:

Toggle Models:

Prompt

Google

Nano Banana Pro

Avg: 8.80 / 10

Refusals: 0

OpenAI

GPT Image 1.5

Avg: 7.70 / 10

Refusals: 0

Ideogram

Ideogram 3.0 (Quality)

Avg: 6.90 / 10

Refusals: 0

Reve

Reve Image (Halfmoon)

Avg: 6.60 / 10

Refusals: 0

XAI

Grok Imagine

Avg: 6.60 / 10

Refusals: 0

Black Forest Labs

FLUX.1 Kontext Max

Avg: 6.57 / 10

Refusals: 3

Google

Imagen 4.0 Ultra

Avg: 6.30 / 10

Refusals: 0

OpenAI

ChatGPT 4o

Avg: 6.22 / 10

Refusals: 1

Bytedance

Seedream 4.0

Avg: 6.20 / 10

Refusals: 0

Google

Nano Banana (2.5 Flash)

Avg: 6.20 / 10

Refusals: 0

Black Forest Labs

Flux 2 Pro

Avg: 6.10 / 10

Refusals: 0

Bytedance

Seedream 4.5

Avg: 6.00 / 10

Refusals: 0

Alibaba

Z-Image Turbo

Avg: 5.80 / 10

Refusals: 0

Google

Imagen 3.0

Avg: 5.70 / 10

Refusals: 0

Recraft

Recraft V3

Avg: 5.60 / 10

Refusals: 0

Minimax

MiniMax Image-01

Avg: 5.40 / 10

Refusals: 0

XAI

Grok 2 Image

Avg: 5.30 / 10

Refusals: 0

Midjourney

Midjourney V6.1

Avg: 5.20 / 10

Refusals: 0

OpenAI

DALL-E 3

Avg: 5.20 / 10

Refusals: 0

Bytedance

Seedream 3.0

Avg: 5.10 / 10

Refusals: 0

Ideogram

Ideogram V2

Avg: 5.10 / 10

Refusals: 0

Black Forest Labs

Flux 1.1 Pro Ultra

Avg: 5.00 / 10

Refusals: 0

Midjourney

Midjourney v7

Avg: 4.30 / 10

Refusals: 0

Prompt:

Rain-slicked Singapore street, 3 AM. A lone elderly hawker cleans his cart under one flickering fluorescent light. Steam rises gently. Low-angle shot, photorealistic

Description:

Realistic reflections, elderly figure realism, detailed hawker cart, subtle steam effects, cultural authenticity. Validates: Photorealism, low-angle accuracy, complex lighting, narrative mood, texture realism.

Nano Banana Pro

20.8s

Score: 8 / 10

GPT Image 1.5

35.2s

Score: 9 / 10

Ideogram 3.0 (Quality)

11.2s

Score: 8 / 10

Reve Image (Halfmoon)

8.0s

Score: 9 / 10

Grok Imagine

3.9s

Score: 8 / 10

FLUX.1 Kontext Max

14.4s

Score: 6 / 10

Imagen 4.0 Ultra

13.8s

Score: 5 / 10

ChatGPT 4o

5.0s

Score: 2 / 10

Seedream 4.0

13.5s

Score: 5 / 10

Nano Banana (2.5 Flash)

8.1s

Score: 8 / 10

Flux 2 Pro

12.7s

Score: 4 / 10

Seedream 4.5

17.8s

Score: 5 / 10

Z-Image Turbo

6.6s

Score: 7 / 10

Imagen 3.0

9.5s

Score: 4 / 10

Recraft V3

14.4s

Score: 3 / 10

MiniMax Image-01

31.8s

Score: 7 / 10

Grok 2 Image

11.9s

Score: 8 / 10

Midjourney V6.1

47.7s

Score: 6 / 10

DALL-E 3

21.6s

Score: 4 / 10

Seedream 3.0

8.4s

Score: 7 / 10

Ideogram V2

20.6s

Score: 6 / 10

Flux 1.1 Pro Ultra

20.1s

Score: 4 / 10

Midjourney v7

45.0s

Score: 5 / 10

Prompt:

A realistic astronaut being ridden by a horse, photorealistic depiction in outer space with accurate lighting and proportions

Description:

Photorealistic style with accurate lighting/proportions. Tests logical/spatial coherence for an absurd, reversed scenario (zero-gravity, spacesuit/horse anatomy).

Nano Banana Pro

20.5s

Score: 9 / 10

GPT Image 1.5

37.5s

Score: 8 / 10

Ideogram 3.0 (Quality)

14.3s

Score: 3 / 10

Reve Image (Halfmoon)

94.1s

Score: 2 / 10

Grok Imagine

4.3s

Score: 4 / 10

FLUX.1 Kontext Max

14.3s

Score: 3 / 10

Imagen 4.0 Ultra

10.1s

Score: 4 / 10

Generation failed

I can’t create that image because it would violate our content policies.

Seedream 4.0

12.7s

Score: 3 / 10

Nano Banana (2.5 Flash)

7.2s

Score: 3 / 10

Flux 2 Pro

18.2s

Score: 9 / 10

Seedream 4.5

17.7s

Score: 4 / 10

Z-Image Turbo

6.6s

Score: 3 / 10

Imagen 3.0

10.9s

Score: 3 / 10

Recraft V3

13.6s

Score: 4 / 10

MiniMax Image-01

42.4s

Score: 3 / 10

Grok 2 Image

11.9s

Score: 4 / 10

Midjourney V6.1

34.0s

Score: 3 / 10

DALL-E 3

24.7s

Score: 4 / 10

Seedream 3.0

8.5s

Score: 3 / 10

Ideogram V2

20.7s

Score: 4 / 10

Flux 1.1 Pro Ultra

14.2s

Score: 3 / 10

Midjourney v7

44.9s

Score: 5 / 10

Prompt:

Photorealistic aerial photograph clearly showing the edge of the Earth from space, capturing realistic curvature, atmosphere, and sunlight reflections

Description:

Photorealistic aerial photography style. Tests geographical accuracy, perspective realism, depiction of Earth's curvature, atmospheric layers, and space lighting.

Nano Banana Pro

14.4s

Score: 9 / 10

GPT Image 1.5

16.9s

Score: 8 / 10

Ideogram 3.0 (Quality)

13.3s

Score: 10 / 10

Reve Image (Halfmoon)

46.7s

Score: 9 / 10

Grok Imagine

3.3s

Score: 8 / 10

FLUX.1 Kontext Max

12.8s

Score: 9 / 10

Imagen 4.0 Ultra

9.6s

Score: 8 / 10

ChatGPT 4o

5.0s

Score: 8 / 10

Seedream 4.0

12.3s

Score: 7 / 10

Nano Banana (2.5 Flash)

8.8s

Score: 5 / 10

Flux 2 Pro

13.0s

Score: 9 / 10

Seedream 4.5

21.4s

Score: 6 / 10

Z-Image Turbo

7.3s

Score: 9 / 10

Imagen 3.0

11.0s

Score: 9 / 10

Recraft V3

8.7s

Score: 10 / 10

MiniMax Image-01

37.1s

Score: 6 / 10

Grok 2 Image

11.2s

Score: 7 / 10

Midjourney V6.1

56.7s

Score: 8 / 10

DALL-E 3

17.6s

Score: 6 / 10

Seedream 3.0

13.5s

Score: 4 / 10

Ideogram V2

21.1s

Score: 6 / 10

Flux 1.1 Pro Ultra

14.4s

Score: 6 / 10

Midjourney v7

44.6s

Score: 6 / 10

Prompt:

A famous cartoon character (e.g. Homer Simpson) rendered fully photorealistically as if a real human being, accurately preserving recognizable facial features and proportions

Description:

Photorealistic portrait style. Tests combining realism (human anatomy, skin textures) while preserving recognizable cartoon features/proportions.

Nano Banana Pro

16.7s

Score: 8 / 10

GPT Image 1.5

17.3s

Score: 4 / 10

Ideogram 3.0 (Quality)

11.4s

Score: 5 / 10

Reve Image (Halfmoon)

91.5s

Score: 5 / 10

Grok Imagine

4.3s

Score: 3 / 10

Generation failed

Image generation failed for replicate / black-forest-labs/flux-kontext-max (fallback disabled)

Imagen 4.0 Ultra

13.6s

Score: 4 / 10

ChatGPT 4o

5.0s

Score: 3 / 10

Seedream 4.0

12.6s

Score: 5 / 10

Nano Banana (2.5 Flash)

8.5s

Score: 2 / 10

Flux 2 Pro

17.7s

Score: 4 / 10

Seedream 4.5

15.1s

Score: 6 / 10

Z-Image Turbo

6.4s

Score: 5 / 10

Imagen 3.0

9.7s

Score: 5 / 10

Recraft V3

8.4s

Score: 4 / 10

MiniMax Image-01

43.3s

Score: 6 / 10

Grok 2 Image

11.8s

Score: 3 / 10

Midjourney V6.1

35.6s

Score: 6 / 10

DALL-E 3

20.3s

Score: 3 / 10

Seedream 3.0

8.4s

Score: 5 / 10

Ideogram V2

19.9s

Score: 3 / 10

Flux 1.1 Pro Ultra

12.6s

Score: 5 / 10

Midjourney v7

44.7s

Score: 5 / 10

Prompt:

Photorealistic image of a robot painting a realistic self-portrait (i.e. the robot) on canvas, mimicking Van Gogh’s art style; clear, realistic metallic textures and painting details visible

Description:

Photorealistic style with detailed textures. Tests artistic style emulation (Van Gogh), recursive creativity, realism of metallic textures, and believable painting action/details.

Nano Banana Pro

17.5s

Score: 9 / 10

GPT Image 1.5

36.4s

Score: 10 / 10

Ideogram 3.0 (Quality)

10.9s

Score: 6 / 10

Reve Image (Halfmoon)

9.1s

Score: 9 / 10

Grok Imagine

4.7s

Score: 9 / 10

FLUX.1 Kontext Max

12.7s

Score: 6 / 10

Imagen 4.0 Ultra

11.8s

Score: 9 / 10

ChatGPT 4o

5.0s

Score: 6 / 10

Seedream 4.0

13.7s

Score: 6 / 10

Nano Banana (2.5 Flash)

8.6s

Score: 6 / 10

Flux 2 Pro

13.1s

Score: 6 / 10

Seedream 4.5

27.6s

Score: 9 / 10

Z-Image Turbo

7.7s

Score: 5 / 10

Imagen 3.0

10.6s

Score: 6 / 10

Recraft V3

12.8s

Score: 6 / 10

MiniMax Image-01

36.4s

Score: 6 / 10

Grok 2 Image

13.0s

Score: 6 / 10

Midjourney V6.1

56.2s

Score: 6 / 10

DALL-E 3

20.0s

Score: 7 / 10

Seedream 3.0

7.9s

Score: 6 / 10

Ideogram V2

22.1s

Score: 6 / 10

Flux 1.1 Pro Ultra

14.2s

Score: 6 / 10

Midjourney v7

44.7s

Score: 3 / 10

Prompt:

Photorealistic depiction of a man wearing a clearly visible black OpenAI-branded T-shirt. He is standing at the front of a university lecture hall, writing complex mathematics and AI-related equations across a large, dusty chalkboard filled with notation

Description:

Photorealistic, corporate tech style. Tests realistic text/handwriting generation (equations), accurate human anatomy/collaboration poses, and clear clothing branding.

Nano Banana Pro

17.2s

Score: 8 / 10

GPT Image 1.5

39.0s

Score: 5 / 10

Ideogram 3.0 (Quality)

12.4s

Score: 9 / 10

Reve Image (Halfmoon)

47.9s

Score: 9 / 10

Grok Imagine

4.8s

Score: 9 / 10

FLUX.1 Kontext Max

14.8s

Score: 9 / 10

Imagen 4.0 Ultra

12.0s

Score: 5 / 10

ChatGPT 4o

5.0s

Score: 4 / 10

Seedream 4.0

13.6s

Score: 9 / 10

Nano Banana (2.5 Flash)

7.1s

Score: 8 / 10

Flux 2 Pro

17.2s

Score: 4 / 10

Seedream 4.5

20.4s

Score: 5 / 10

Z-Image Turbo

7.0s

Score: 8 / 10

Imagen 3.0

6.1s

Score: 6 / 10

Recraft V3

13.1s

Score: 6 / 10

MiniMax Image-01

31.1s

Score: 5 / 10

Grok 2 Image

12.3s

Score: 5 / 10

Midjourney V6.1

34.0s

Score: 4 / 10

DALL-E 3

20.1s

Score: 6 / 10

Seedream 3.0

14.3s

Score: 5 / 10

Ideogram V2

21.5s

Score: 6 / 10

Flux 1.1 Pro Ultra

13.8s

Score: 8 / 10

Midjourney v7

44.8s

Score: 4 / 10

Prompt:

Pixel art cityscape of San Francisco in the iconic SimCity 2000 style, isometric view, detailed skyscrapers, residential areas, clearly identifiable Golden Gate Bridge, Coit Tower, Transamerica Pyramid, surrounded by the classic SimCity 2000 UI elements

Description:

Pixel art, isometric, SimCity 2000 style. Tests detailed pixel-art accuracy, recognizable landmark rendering, and nostalgic game UI element replication.

Nano Banana Pro

24.3s

Score: 9 / 10

GPT Image 1.5

40.9s

Score: 10 / 10

Ideogram 3.0 (Quality)

10.7s

Score: 8 / 10

Reve Image (Halfmoon)

51.0s

Score: 4 / 10

Grok Imagine

5.2s

Score: 8 / 10

Generation failed

Image generation failed for replicate / black-forest-labs/flux-kontext-max (fallback disabled)

Imagen 4.0 Ultra

11.7s

Score: 9 / 10

ChatGPT 4o

5.0s

Score: 9 / 10

Seedream 4.0

13.7s

Score: 5 / 10

Nano Banana (2.5 Flash)

8.3s

Score: 9 / 10

Flux 2 Pro

18.3s

Score: 5 / 10

Seedream 4.5

20.6s

Score: 8 / 10

Z-Image Turbo

6.9s

Score: 4 / 10

Imagen 3.0

4.8s

Score: 5 / 10

Recraft V3

14.4s

Score: 4 / 10

MiniMax Image-01

29.7s

Score: 5 / 10

Grok 2 Image

11.7s

Score: 3 / 10

Midjourney V6.1

44.0s

Score: 6 / 10

DALL-E 3

21.1s

Score: 6 / 10

Seedream 3.0

14.0s

Score: 7 / 10

Ideogram V2

21.1s

Score: 4 / 10

Flux 1.1 Pro Ultra

13.9s

Score: 5 / 10

Midjourney v7

44.3s

Score: 5 / 10

Prompt:

Vintage Apple II computer with green monochrome CRT screen, displaying 'END OF WORLD PROCEDURES' in green text. Two external floppy drives stacked on the right, labeled disk II with rainbow Apple logos. Beige casing, black background, retro aesthetic.

Description:

Authentic green CRT glow, precise vintage Apple II details, accurate floppy drive labels and logos, realistic retro textures, correct typography and screen curvature

Nano Banana Pro

16.5s

Score: 9 / 10

GPT Image 1.5

32.9s

Score: 8 / 10

Ideogram 3.0 (Quality)

11.9s

Score: 6 / 10

Reve Image (Halfmoon)

8.1s

Score: 8 / 10

Grok Imagine

3.8s

Score: 6 / 10

Generation failed

Image generation failed for replicate / black-forest-labs/flux-kontext-max (fallback disabled)

Imagen 4.0 Ultra

10.9s

Score: 6 / 10

ChatGPT 4o

5.0s

Score: 6 / 10

Seedream 4.0

14.6s

Score: 9 / 10

Nano Banana (2.5 Flash)

7.0s

Score: 8 / 10

Flux 2 Pro

11.9s

Score: 9 / 10

Seedream 4.5

17.9s

Score: 5 / 10

Z-Image Turbo

6.6s

Score: 6 / 10

Imagen 3.0

6.0s

Score: 5 / 10

Recraft V3

14.5s

Score: 6 / 10

MiniMax Image-01

36.5s

Score: 3 / 10

Grok 2 Image

11.5s

Score: 4 / 10

Midjourney V6.1

56.7s

Score: 2 / 10

DALL-E 3

19.2s

Score: 7 / 10

Seedream 3.0

14.1s

Score: 5 / 10

Ideogram V2

20.5s

Score: 7 / 10

Flux 1.1 Pro Ultra

13.8s

Score: 5 / 10

Midjourney v7

44.6s

Score: 3 / 10

Prompt:

Photorealistic close-up portrait of a person clearly performing the American Sign Language gesture for "thank you," hand positioned visibly in front of the chest, clear expression on face, neutral background

Description:

Photorealistic portrait style. Tests anatomical correctness, accuracy of a specific ASL gesture, clear facial expression, and communicative context.

Nano Banana Pro

20.7s

Score: 9 / 10

GPT Image 1.5

15.7s

Score: 6 / 10

Ideogram 3.0 (Quality)

11.4s

Score: 5 / 10

Reve Image (Halfmoon)

42.6s

Score: 4 / 10

Grok Imagine

3.4s

Score: 4 / 10

FLUX.1 Kontext Max

13.8s

Score: 5 / 10

Imagen 4.0 Ultra

11.8s

Score: 4 / 10

ChatGPT 4o

5.0s

Score: 9 / 10

Seedream 4.0

14.1s

Score: 4 / 10

Nano Banana (2.5 Flash)

6.9s

Score: 5 / 10

Flux 2 Pro

18.3s

Score: 4 / 10

Seedream 4.5

14.8s

Score: 4 / 10

Z-Image Turbo

6.6s

Score: 2 / 10

Imagen 3.0

5.5s

Score: 5 / 10

Recraft V3

13.2s

Score: 4 / 10

MiniMax Image-01

30.4s

Score: 5 / 10

Grok 2 Image

11.7s

Score: 4 / 10

Midjourney V6.1

44.8s

Score: 2 / 10

DALL-E 3

17.1s

Score: 3 / 10

Seedream 3.0

8.0s

Score: 4 / 10

Ideogram V2

20.1s

Score: 3 / 10

Flux 1.1 Pro Ultra

19.9s

Score: 3 / 10

Midjourney v7

44.5s

Score: 4 / 10

Prompt:

Photorealistic daytime street photograph, clearly showing a man standing still on a busy urban street corner holding a rectangular cardboard sign clearly facing camera, handwritten bold black marker text clearly readable as "AGI has arrived!", background blurred with realistic pedestrians and cityscape

Description:

Photorealistic, urban photography style. Tests text clarity/readability within an image, realistic handwriting, depth-of-field effects, and compositional coherence in a dynamic scene.

Nano Banana Pro

19.4s

Score: 10 / 10

GPT Image 1.5

36.7s

Score: 9 / 10

Ideogram 3.0 (Quality)

12.1s

Score: 9 / 10

Reve Image (Halfmoon)

9.2s

Score: 7 / 10

Grok Imagine

12.9s

Score: 7 / 10

FLUX.1 Kontext Max

14.0s

Score: 8 / 10

Imagen 4.0 Ultra

10.2s

Score: 9 / 10

ChatGPT 4o

5.0s

Score: 9 / 10

Seedream 4.0

13.7s

Score: 9 / 10

Nano Banana (2.5 Flash)

8.8s

Score: 8 / 10

Flux 2 Pro

13.3s

Score: 7 / 10

Seedream 4.5

12.8s

Score: 8 / 10

Z-Image Turbo

6.7s

Score: 9 / 10

Imagen 3.0

5.5s

Score: 9 / 10

Recraft V3

13.8s

Score: 9 / 10

MiniMax Image-01

31.6s

Score: 8 / 10

Grok 2 Image

12.3s

Score: 9 / 10

Midjourney V6.1

33.7s

Score: 9 / 10

DALL-E 3

18.1s

Score: 6 / 10

Seedream 3.0

13.5s

Score: 5 / 10

Ideogram V2

21.0s

Score: 6 / 10

Flux 1.1 Pro Ultra

13.2s

Score: 5 / 10

Midjourney v7

45.6s

Score: 3 / 10

Summary for Ultra Hard

The Ultra Hard category lived up to its name, serving as a brutal stress test for modern AI models. While many models excel at standard photorealism, this dataset revealed significant gaps in logical reasoning and spatial intelligence.

Key Findings

Logic is the new frontier: The prompt A realistic astronaut being ridden by a horse caused a massive failure rate. Most models ignored the syntax and defaulted to the common trope of a human riding a horse. Only a select few, like Nano Banana Pro and Flux 2 Pro, successfully reversed the roles.
Text is solving itself: Models like GPT Image 1.5 and Ideogram 3.0 (Quality) are now handling complex text integration (handwriting on cardboard, chalkboards) with near-perfect accuracy.
Anatomy remains tricky: The ASL 'thank you' gesture stumped almost everyone, with models confusing it for "thinking" poses or the "OK" sign. Nano Banana Pro and ChatGPT 4o were rare successes here.

Top Performers

🏆 GPT Image 1.5: Demonstrated the highest consistency across logic, text, and texture.
🥈 Nano Banana Pro: Showed surprising ingenuity in interpreting difficult logical prompts.
🥉 ChatGPT 4o: Excellent at stylistic mimicry (SimCity) and general adherence.

Deep Dive: Breaking the Models

This dataset highlighted three distinct "intelligence gaps" in current image generation technology.

1. The "Training Data Inertia" Problem

Models struggle to generate images that contradict their training frequency.

The Test: A realistic astronaut being ridden by a horse.
The Result: Models like DALL-E 3, Midjourney V6.1, and Recraft V3 failed completely, generating an astronaut riding a horse. They prioritized the statistical correlation of "riding" over the grammatical structure of the prompt.
The Exception: Nano Banana Pro not only followed the prompt but creatively solved the physics by putting the horse in a space suit, earning a high realism score.

2. The Style Transfer Trap

When asked to make a cartoon character "photorealistic as a real human," most models fail to abandon the cartoon's color palette.

The Test: Homer Simpson photorealistic.
The Result: Most models (e.g., Flux 1.1 Pro Ultra, Ideogram V2) simply created a high-res 3D render of a yellow character.
The Exception: Nano Banana Pro successfully translated the character into human skin tones while keeping the features recognizable.

3. Text & Interface Accuracy

Generating UI elements and specific text styles remains a hurdle.

The Test: Pixel art cityscape... SimCity 2000 style.
The Result: Many models ignored the "UI elements" instruction or generated gibberish text.
The Exception: ChatGPT 4o and GPT Image 1.5 generated perfect replicas of the game interface, including legible menu text.

Model Recommendations by Scenario

Based on the performance in the Ultra Hard category, here are the best models for specific high-difficulty tasks:

🧠 Best for Complex Logic & Reasoning

Winner: GPT Image 1.5

Why: It actually "reads" the prompt. Whether it's a robot painting a self-portrait or a horse riding a human, this model adheres to the sentence structure rather than just keywords.
Alternative: Nano Banana Pro (Excellent at interpreting physical interactions in absurd scenarios).

✍️ Best for Text Integration

Winner: Ideogram 3.0 (Quality)

Why: consistently handles chalkboard equations and cardboard signs without spelling errors.
See: OpenAI-branded T-shirt for clear text handling.

🎨 Best for Stylized & Retro Art

Winner: ChatGPT 4o

Why: It nailed the SimCity 2000 prompt, perfectly replicating the pixel art style, isometric view, and specific UI layout where others failed.

📷 Best for High-Fidelity Photorealism

Winner: Recraft V3

Why: While it struggled with logic, its texture work on the Edge of the Earth prompt was rated a perfect 10 for looking like a genuine ISS photograph.
Alternative: Reve Image (Halfmoon) also scored highly on texture-heavy prompts like the Singapore Hawker.

AI Image Battle Gallery

Summary for Ultra Hard

Key Findings

Top Performers

Deep Dive: Breaking the Models

1. The "Training Data Inertia" Problem

2. The Style Transfer Trap

3. Text & Interface Accuracy

Model Recommendations by Scenario

🧠 Best for Complex Logic & Reasoning

✍️ Best for Text Integration

🎨 Best for Stylized & Retro Art

📷 Best for High-Fidelity Photorealism

Image Evaluation