OmniRT supported models
This page is generated from the live registry by scripts/generate_models_doc.py.
Update the registry, not this file.
Models are organized by the digital-human production chain rather than by a generic multimodal taxonomy.
Core avatar rendering
Registry id
Task
Tier
Maturity
Realtime
Summary
longcat-video-avatar-1.5
audio2video
core
beta
no
LongCat-Video-Avatar 1.5 audio-driven avatar generation via external CUDA or Ascend checkouts.
soulx-flashhead-1.3b
audio2video
core
beta
no
SoulX-FlashHead low-latency talking-head generation via image plus audio.
soulx-flashtalk-14b
audio2video
core
beta
yes
SoulX-FlashTalk talking-head avatar generation via image plus audio on CUDA or Ascend.
soulx-liveact-14b
audio2video
core
beta
no
SoulX-LiveAct long-form audio-driven avatar video generation on Ascend.
Voice generation
Registry id
Task
Tier
Maturity
Realtime
Summary
cosyvoice3-triton-trtllm
text2audio
core
beta
no
CosyVoice3 text-to-audio generation through a Triton-compatible service endpoint.
indextts
text2audio
adjacent
beta
yes
IndexTTS-2 resident text-to-audio service for OpenTalking TTS, with segment streaming and experimental token-window streaming through serve-text2audio.
soulx-podcast-1.7b
text2audio
core
beta
no
SoulX-Podcast text-to-audio generation through a FastAPI service endpoint.
Voice understanding roadmap
Registry id
Task
Tier
Maturity
Realtime
Summary
sensevoice-small
audio2text
core
beta
no
SenseVoice offline audio transcription for digital-human voice understanding.
Avatar asset generation
Registry id
Task
Tier
Maturity
Realtime
Summary
chronoedit
edit
experimental
beta
no
ChronoEdit physically-consistent image editing pipeline.
flux-canny
edit
adjacent
beta
no
Flux canny-guided structured image generation pipeline.
flux-depth
edit
adjacent
beta
no
Flux depth-guided structured image generation pipeline.
flux-kontext
edit
adjacent
beta
no
Flux Kontext image editing pipeline.
qwen-image-edit
edit
adjacent
beta
no
Qwen-Image single-image editing pipeline.
qwen-image-edit-plus
edit
adjacent
beta
no
Qwen-Image multi-reference editing pipeline.
qwen-image-layered
edit
adjacent
beta
no
Qwen-Image layered decomposition pipeline.
sdxl-refiner-1.0
image2image
adjacent
beta
no
SDXL refiner image-to-image pipeline for second-stage refinement passes.
flux-fill
inpaint
adjacent
beta
no
Flux Fill inpainting and outpainting pipeline.
bria-3.2
text2image
experimental
beta
no
Bria 3.2 commercial-ready text-to-image pipeline.
flux-dev
text2image
adjacent
stable
no
Flux 1 dev text-to-image pipeline.
flux-schnell
text2image
experimental
stable
no
Flux 1 schnell low-step text-to-image pipeline.
flux2.dev
text2image
adjacent
beta
no
Flux 2 dev text-to-image pipeline.
glm-image
text2image
experimental
beta
no
GLM-Image instruction-following text-to-image pipeline.
hidream-i1
text2image
experimental
beta
no
HiDream-I1 modern text-to-image pipeline.
hunyuan-image-2.1
text2image
experimental
beta
no
Hunyuan Image 2.1 text-to-image pipeline.
kolors
text2image
experimental
beta
no
Kolors multilingual text-to-image pipeline.
lumina-t2x
text2image
experimental
beta
no
Lumina-T2X text-to-image pipeline via the LuminaPipeline runtime.
omnigen
text2image
experimental
beta
no
OmniGen text-to-image generation path.
ovis-image
text2image
experimental
beta
no
Ovis-Image text-heavy generation pipeline.
pixart-sigma
text2image
experimental
beta
no
PixArt-Sigma high-resolution text-to-image pipeline.
qwen-image
text2image
adjacent
beta
no
Qwen-Image multilingual text-to-image pipeline.
sana-1.6b
text2image
experimental
beta
no
Sana 1.6B efficient text-to-image pipeline.
sd15
text2image
experimental
beta
no
Stable Diffusion 1.5 baseline text-to-image pipeline.
sd21
text2image
experimental
beta
no
Stable Diffusion 2.1 text-to-image pipeline.
sd3-medium
text2image
experimental
beta
no
Stable Diffusion 3 Medium text-to-image pipeline.
sd3.5-large
text2image
experimental
beta
no
Stable Diffusion 3.5 Large text-to-image pipeline.
sd3.5-large-turbo
text2image
experimental
beta
no
Stable Diffusion 3.5 Large Turbo text-to-image pipeline.
sdxl-base-1.0
text2image
adjacent
stable
no
SDXL base text-to-image pipeline with LoRA support.
sdxl-turbo
text2image
experimental
beta
no
SDXL Turbo low-latency text-to-image pipeline.
Video and idle assets
Registry id
Task
Tier
Maturity
Realtime
Summary
helios-i2v
image2video
experimental
beta
no
Helios image-to-video pipeline.
hunyuan-video-1.5-i2v
image2video
experimental
beta
no
HunyuanVideo 1.5 image-to-video pipeline.
kandinsky5-i2v
image2video
experimental
beta
no
Kandinsky 5 Pro image-to-video pipeline.
ltx2-i2v
image2video
experimental
beta
no
LTX image-to-video pipeline.
svd
image2video
adjacent
stable
no
Stable Video Diffusion base image-to-video pipeline.
svd-xt
image2video
adjacent
stable
no
Stable Video Diffusion XT image-to-video pipeline.
wan2.1-i2v-14b
image2video
experimental
beta
no
Wan 2.1 image-to-video pipeline.
wan2.2-i2v-14b
image2video
adjacent
beta
no
Wan 2.2 image-to-video pipeline.
animate-diff-sdxl
text2video
adjacent
beta
no
AnimateDiff SDXL text-to-video pipeline.
cogvideox-2b
text2video
experimental
beta
no
CogVideoX 2B text-to-video pipeline.
cogvideox-5b
text2video
experimental
beta
no
CogVideoX 5B text-to-video pipeline.
helios-t2v
text2video
experimental
beta
no
Helios text-to-video pipeline.
hunyuan-video
text2video
experimental
beta
no
HunyuanVideo text-to-video pipeline.
hunyuan-video-1.5-t2v
text2video
experimental
beta
no
HunyuanVideo 1.5 text-to-video pipeline.
kandinsky5-t2v
text2video
experimental
beta
no
Kandinsky 5 Pro text-to-video pipeline.
ltx-video
text2video
experimental
beta
no
LTX-Video text-to-video pipeline.
mochi
text2video
experimental
beta
no
Mochi text-to-video pipeline.
sana-video
text2video
experimental
beta
no
Sana-Video efficient text-to-video pipeline.
skyreels-v2
text2video
experimental
beta
no
SkyReels-V2 text-to-video pipeline.
wan2.1-t2v-14b
text2video
experimental
beta
no
Wan 2.1 text-to-video pipeline.
wan2.2-t2v-14b
text2video
adjacent
beta
no
Wan 2.2 text-to-video pipeline.
Aliases
Alias
Canonical id
flux2-dev
flux2.dev