97+ MODELS LIVE — TRY THE AUDIO LAB
STUDIO
Imagewan2.7wan2.7-text-to-image

Wan2.7 Text To Image

Alibaba WAN 2.7 Text-to-Image generates high-quality images from text prompts with thinking mode for enhanced image quality.

Wan guideVerified

Alibaba's open-source video model line (Wan 2.1 → 2.7). Strong prompt adherence; the open-source pedigree means heavy community use and well-documented prompting patterns.

Strengths
  • Best-in-class prompt adherence — does what you ask, not what it thinks you want.
  • Wide variant family covers most needs (T2V, I2V, reference, motion control, lipsync).
  • Wan 2.5 and 2.6 catch up to closed-source quality at lower cost.
  • Wan 2.2 Spicy variants for adult creative work.
Weaknesses
  • Older versions (2.1, 2.2) look dated next to current flagship.
  • Stylization quality lags behind Kling and Hailuo.
Best for
  • Precise prompt-driven scene construction
  • Hybrid pipelines where Wan does the heavy lifting and another model polishes
Prompting tips
  • Treat Wan like a brief — itemize what's in frame, the action, the camera.
  • Wan does NOT need flowery language; plain descriptive prose works better.
Parameters
  • aspect_ratio
    string
    The aspect ratio of the generated image
    1:14:33:416:99:1621:99:213:22:3
    default: 1:1
  • thinking_mode
    boolean
    Enable thinking mode for enhanced reasoning and better image quality. Increases generation time.
    default: true
You'll need
  • A text prompt
Try now

More from wan2.7