xai
xai

Grok Imagine 1.0

xAI's Grok Imagine 1.0 text-to-video model focused on exploratory creative generation and rapid concept iteration.
rank
5
cost
$0.00
/min
Grok Imagine 1.0
Total Score
59
59

Scores

Physics40.0
Prompt Adherence67.0
Animation72.6
2D Animation72.0
3D Animation85.0
Anime Animation61.0
Cinematography61.0
Human66.0
Hands60.0
Animals54.0
Objects55.0
Logic + Consistency67.0
Scene Consistency55.0
Text Fidelity69.0
Actor Performance64.0
Total Score59.0

Evaluation Summary

Good for
  • Value/cost ratio
  • Animation
  • Prompt adherence
Bad for
  • Physics accuracy
  • Rapid motion (stuttering, inconsistent speed)
Summary

Grok Imagine is probably the best value model if you're not paying through the API. It scores quite highly in categories like prompt adherence and was one of the top animation models. Given xAI's generous tier, it holds its own with frontier model generations like Veo 3 and Kling at a fraction of the cost. It can still struggle with physics, and in rapid motion you sometimes see stuttering or inconsistent speed-ups and slow-downs.

Examples

A pianist's hands playing Chopin—arched wrists, curved fingers, striking keys with controlled force. We see both hands in frame, moving independently.
A futuristic city skyline at sunset, cinematic drone shot
Macro shot: a tattoo artist's gloved hands at work, needle buzzing against skin, other hand stretching the canvas taut. Ink, blood, precision. Five fingers per hand, always.
A woman practices yoga at sunrise on a cliff overlooking the ocean, flowing from warrior pose into a deep lunge. Wind catches her hair.
a man sits next to a woman at a bar they both sip at their drinks. They both exchange a flirty glance and smile.
Luxury car commercial: chrome, carbon fiber, leather interior. The camera caresses every surface. Reflections are accurate. The badge gleams.
Single continuous shot at a bright outdoor skatepark. A female skater in a red beanie and black hoodie rolls toward camera on a skateboard with a bold checkerboard deck graphic. The camera tracks alongside her smoothly. She performs one clean kickflip and lands it, continuing forward. The beanie, hoodie, and checkerboard graphic remain stable without flicker, and the board does not morph mid-air.”
Action scene, cinematic and dynamic: A female lead in a dark tactical jacket sprints through a rain-soaked museum hall at night. The hall has three distinct landmarks: (1) a huge dinosaur skeleton on the left, (2) a glass display case with a glowing blue gem centered in the background, and (3) a marble staircase on the right.
Sports anime climax japanese anime style: a femalebeach volleyball player performs a slow motion volleyball spike. The rotation of the ball. Fingers stretching. The opponent's pupils dilating.

Compare Models

Seedance 1.5 Pro
53
53
Cost/min$12.00
Sora 2 Pro
50
50
Cost/min$12.00
Veo 2
55
55
Google

Veo 2

Cost/min$30.00
Veo 3
60
60
Google

Veo 3

Cost/min$12.00
Veo 3 Fast
48
48
Cost/min$6.00
All Comparison Pages
RankModelProviderScoreCompare Page
#1
Seedance 2.0 ProTop 5
ByteDance73Grok Imagine 1.0 vs Seedance 2.0 Pro
#2
Kling 3 ProTop 5
Kling62Grok Imagine 1.0 vs Kling 3 Pro
#3
Kling 2.6Top 5
Kling60Grok Imagine 1.0 vs Kling 2.6
#3
Veo 3Top 5
Google60Grok Imagine 1.0 vs Veo 3
#6
Veo 3.1
Google57Grok Imagine 1.0 vs Veo 3.1
#7
PixVerse v5.5
PixVerse55Grok Imagine 1.0 vs PixVerse v5.5
#7
Veo 2
Google55Grok Imagine 1.0 vs Veo 2
#9
Grok 2025
xai54Grok Imagine 1.0 vs Grok 2025
#10
Seedance 1.5 Pro
ByteDance53Grok Imagine 1.0 vs Seedance 1.5 Pro
#11
Veo 3.1 Fast
Google52Grok Imagine 1.0 vs Veo 3.1 Fast
#12
Sora 2 Pro
OpenAI50Grok Imagine 1.0 vs Sora 2 Pro
#13
Veo 3 Fast
Google48Grok Imagine 1.0 vs Veo 3 Fast
#14
Vidu Q2
Vidu45Grok Imagine 1.0 vs Vidu Q2
#15
LTX-2 19B
Lightricks43Grok Imagine 1.0 vs LTX-2 19B
#16
Gen-4.5
Runway40Grok Imagine 1.0 vs Gen-4.5
#17
Pika v2.2 Text-to-Video
Pika26Grok Imagine 1.0 vs Pika v2.2 Text-to-Video
#18
Infinity Star
FoundationVision25Grok Imagine 1.0 vs Infinity Star