97+ MODELS LIVE — TRY THE AUDIO LAB
STUDIO
Videogrokgrok-imagine-text-to-video

Grok Imagine Text To Video

Grok Imagine is xAI’s fast, creative text-to-video model that generates cinematic clips from 6 to 30 seconds with smooth motion, expressive lighting, and ambient audio. It turns a written idea into a visually rich video.

Grok guideVerified

xAI's Grok Imagine. Newer entrant with a distinct humour-forward aesthetic. Useful for meme work and pop-culture-aware content.

Strengths
  • Distinctive aesthetic — feels different from the Veo / Kling crowd.
  • Good at irreverent / humorous beats.
  • Generous on what it'll generate (less restrictive than Veo).
Weaknesses
  • Quality is a tier below the established flagships.
  • Documentation is sparse — prompting is more trial-and-error.
Parameters
  • aspect_ratio
    string
    Aspect ratio of the generated video.
    16:94:33:21:12:33:49:16
    default: 16:9
  • duration
    int
    Video duration in seconds.
    default: 6
    range: 1 15
  • resolution
    string
    Resolution of the output video.
    480p720p
    default: 720p
You'll need
  • A text prompt
Try now

More from grok