Text to Video¶
Generate an MP4 from a prompt. OmniRT wraps video tasks in the same request contract as image tasks; the artifact is always exported via imageio-ffmpeg.
Minimal example¶
Key parameters¶
| Parameter | Type | Default | Notes |
|---|---|---|---|
prompt |
str |
required | text prompt |
num_frames |
int? |
model default | typically 81 for Wan2.2, 49 for CogVideoX, 129 for Hunyuan |
fps |
int? |
model default | output frames per second |
negative_prompt |
str? |
None |
negative prompt (if supported) |
preset |
fast/balanced/quality/low-vram |
balanced |
see Presets |
num_inference_steps |
int? |
preset | explicit denoise override |
guidance_scale |
float? |
preset | CFG |
width / height |
int? |
model default | video models impose strict aspect / divisor constraints |
seed |
int? |
random | reproducibility |
Supported models¶
- High-quality:
wan2.2-t2v-14b(24 GB+),hunyuan-video(48 GB+) - Mid-tier:
cogvideox-2b,cogvideox-5b - Experimental / roadmap: see Roadmap
Full list: omnirt models --task text2video.
Common recipes¶
Troubleshooting¶
Warning
- MP4 encode fails — make sure
imageio-ffmpegis installed via runtime extras (pip install '.[runtime]') - OOM — video is the most memory-intensive task; drop
num_framesfirst, thenwidth/height, thenpreset=low-vram - Temporal flicker / inconsistency — raise
num_inference_stepsor switch topreset=quality; some models are scheduler-sensitive (see Architecture) - Ascend falls back to eager on video models — expected; recorded in
RunReport.backend_timeline. See Ascend Backend.