Image → Videovidu-q2vidu-q2-reference-to-video-pro
Vidu Q2 Pro (Reference → Video)
Vidu Q2 Pro reference-to-video. Up to 7 reference images (or 4 images + 2 reference videos) plus a prompt to drive the scene. Flat per-video at 540p/720p; 1080p billed per second.
Vidu guideVerified
Shengshu Tech's video model. Reference-to-video pioneer — designed around blending multiple input images into a consistent moving subject.
Strengths
- Best multi-image reference-to-video in the field — 1–9 references blend cleanly.
- Subject consistency across shots when references are well-chosen.
- Strong start-and-end-frame mode (vidu-q3-pro-first-last-frames).
Weaknesses
- Pure text-to-video quality is mid-pack.
- Camera motion is less controllable than Runway / Kling.
Best for
- Character / costume consistency across multiple shots
- Product variations — same item shot from different angles
- Start+end frame animations where you control both poles
Prompting tips
- Provide reference images that match the desired LIGHTING and ANGLE — Vidu blends literally.
- Describe each @image in the prompt ("@image1 from above", "@image2 close-up") for precise blending.
Parameters
- reference_image_urlsarrayUp to 7 reference images (4 if reference videos are also used).
- reference_video_urlsarrayUp to 2 reference videos for motion / editing.
- durationintNo description.default: 4range: 1 … 8
- resolutionstringNo description.540p720p1080pdefault: 720p
More from vidu-q2
Vidu Q2 Text To Image
vidu-q2-text-to-image
Vidu Q2 Reference To Image
vidu-q2-reference-to-image
Vidu Q2 Pro Text To Video
vidu-q2-pro-text-to-video
Vidu Q2 Turbo Text To Video
vidu-q2-turbo-text-to-video
Vidu Q2 Pro Image To Video
vidu-q2-pro-image-to-video
Vidu Q2 Pro Start End Video
vidu-q2-pro-start-end-video