Scores
Physics40.0
Prompt Adherence67.0
Animation
2D Animation72.0
3D Animation85.0
Anime Animation61.0
Cinematography61.0
Taste0.0
Capabilities0.0
Human66.0
Hands60.0
Animals54.0
Objects55.0
Logic + Consistency67.0
Scene Consistency55.0
Text Fidelity69.0
Actor Performance64.0
Total Score59.0
Evaluation Summary
Good for
- •Value/cost ratio
- •Animation
- •Prompt adherence
Bad for
- •Physics accuracy
- •Rapid motion (stuttering, inconsistent speed)
Summary
Grok Imagine is probably the best value model if you're not paying through the API. It scores quite highly in categories like prompt adherence and was one of the top animation models. Given xAI's generous tier, it holds its own with frontier model generations like Veo 3 and Kling at a fraction of the cost. It can still struggle with physics, and in rapid motion you sometimes see stuttering or inconsistent speed-ups and slow-downs.
Examples
A pianist's hands playing Chopin—arched wrists, curved fingers, striking keys with controlled force. We see both hands in frame, moving independently.
A futuristic city skyline at sunset, cinematic drone shot
Macro shot: a tattoo artist's gloved hands at work, needle buzzing against skin, other hand stretching the canvas taut. Ink, blood, precision. Five fingers per hand, always.
A woman practices yoga at sunrise on a cliff overlooking the ocean, flowing from warrior pose into a deep lunge. Wind catches her hair.
a man sits next to a woman at a bar they both sip at their drinks. They both exchange a flirty glance and smile.
Luxury car commercial: chrome, carbon fiber, leather interior. The camera caresses every surface. Reflections are accurate. The badge gleams.
Single continuous shot at a bright outdoor skatepark. A female skater in a red beanie and black hoodie rolls toward camera on a skateboard with a bold checkerboard deck graphic. The camera tracks alongside her smoothly. She performs one clean kickflip and lands it, continuing forward. The beanie, hoodie, and checkerboard graphic remain stable without flicker, and the board does not morph mid-air.”
Action scene, cinematic and dynamic: A female lead in a dark tactical jacket sprints through a rain-soaked museum hall at night. The hall has three distinct landmarks: (1) a huge dinosaur skeleton on the left, (2) a glass display case with a glowing blue gem centered in the background, and (3) a marble staircase on the right.
Sports anime climax japanese anime style: a femalebeach volleyball player performs a slow motion volleyball spike. The rotation of the ball. Fingers stretching. The opponent's pupils dilating.



