Image → Videogrokgrok-imagine-image-to-video
Grok Imagine Image To Video
Grok Imagine is xAI’s multimodal image-to-video model, capable of animating still images into cinematic videos from 6 to 30 seconds with synchronized ambient audio. It focuses on realism, fluid motion, and expressive lighting transitions while maintaining high generation speed.
Grok guideVerified
xAI's Grok Imagine. Newer entrant with a distinct humour-forward aesthetic. Useful for meme work and pop-culture-aware content.
Strengths
- Distinctive aesthetic — feels different from the Veo / Kling crowd.
- Good at irreverent / humorous beats.
- Generous on what it'll generate (less restrictive than Veo).
Weaknesses
- Quality is a tier below the established flagships.
- Documentation is sparse — prompting is more trial-and-error.
Parameters
- aspect_ratiostringAspect ratio of the generated video.auto16:99:16default: auto
- durationintVideo duration in seconds.default: 6range: 1 … 15
- image_urlstringURL of the input image for video generation.
- resolutionstringResolution of the output video.480p720pdefault: 720p
More from grok
Grok Imagine Text To Image
grok-imagine-text-to-image
Grok Imagine Text To Image Quality
grok-imagine-text-to-image-quality
Grok Imagine Image To Image
grok-imagine-image-to-image
Grok Imagine Edit
grok-imagine-image-edit
Grok Imagine Pro Edit
grok-imagine-image-edit-quality
Grok Imagine Extend
grok-imagine-extend