Model Onboarding¶

This guide is the shortest path for onboarding a new Diffusers-style model into omnirt.

1. Write the pipeline class¶

Create src/omnirt/models/<model_name>/pipeline.py, subclass BasePipeline, and implement the five stages:

from omnirt.core.base_pipeline import BasePipeline
from omnirt.core.registry import ModelCapabilities, register_model


@register_model(
    id="my-model",
    task="text2image",
    default_backend="auto",
    resource_hint={"min_vram_gb": 12, "dtype": "fp16"},
    capabilities=ModelCapabilities(
        required_inputs=("prompt",),
        optional_inputs=("negative_prompt",),
        supported_config=(
            "model_path", "scheduler", "height", "width",
            "num_inference_steps", "guidance_scale", "seed", "dtype", "output_dir",
        ),
        default_config={
            "scheduler": "euler-discrete",
            "height": 1024, "width": 1024, "dtype": "fp16",
        },
        supported_schedulers=("euler-discrete", "ddim", "dpm-solver", "euler-ancestral"),
        adapter_kinds=("lora",),
        artifact_kind="image",
        maturity="experimental",   # experimental | beta | stable
        summary="One-sentence description that shows up in `omnirt models`.",
        example='omnirt generate --task text2image --model my-model --prompt "..." --backend cuda',
    ),
)
class MyPipeline(BasePipeline):
    def prepare_conditions(self, req): ...
    def prepare_latents(self, req, conditions): ...
    def denoise_loop(self, latents, conditions, config): ...
    def decode(self, latents): ...
    def export(self, raw, req): ...

`ModelCapabilities` field reference¶

capabilities exposes the model's configuration surface to omnirt models, omnirt validate, and the Python API:

Field	Purpose
`required_inputs`	Keys that must appear in `inputs`; `validate()` errors if missing
`optional_inputs`	Keys allowed but not required in `inputs`
`supported_config`	Allowed keys in `config`; anything outside the list is flagged as a warning
`default_config`	Fallback values when the user does not provide them (height/width/dtype…)
`supported_schedulers`	Scheduler ids actually tested for this pipeline; `--scheduler` outside the list emits a warning
`adapter_kinds`	Supported adapter kinds (currently only `"lora"`)
`artifact_kind`	`"image"` or `"video"`; selects the exporter
`maturity`	`experimental` / `beta` / `stable`; shown in `omnirt models`
`summary` / `example`	Short description and sample command for docs and CLI

Full definition: src/omnirt/core/registry.py.

2. Reuse the backend contract¶

Wrap backend-sensitive submodules through self.runtime.wrap_module(submodule, tag="unet"). This gives the new model compile / override / eager fallback behavior without any model-specific backend code, and every attempt lands in RunReport.backend_timeline.

3. Register aliases (optional)¶

Stacking @register_model(...) decorators lets one pipeline class expose several ids. Mark the alias variant with alias_of so it shows up in a dedicated "Aliases" block in omnirt models --format markdown instead of cluttering the main list:

@register_model(
    id="flux2.dev",
    task="text2image",
    capabilities=ModelCapabilities(summary="Flux 2 dev text-to-image pipeline."),
)
@register_model(
    id="flux2-dev",
    task="text2image",
    capabilities=ModelCapabilities(
        summary="Flux 2 dev text-to-image pipeline.",
        alias_of="flux2.dev",
    ),
)
class Flux2Pipeline(BasePipeline):
    ...

Each decorator invocation writes its own registry entry; get_model("flux2-dev") and get_model("flux2.dev") resolve to the same class.

4. Formats and conventions¶

Weights: safetensors only.
Adapters: validate the path in __init__, then apply once inside prepare_conditions after the real pipeline is materialized; single-file adapters may come from local paths or Hugging Face.
Artifacts: export returns Artifact records with complete file paths; the CLI echoes Artifact.path.

5. Plug the model into registration¶

Add your id to _BUILTIN_MODEL_IDS and import your pipeline module inside ensure_registered() in src/omnirt/models/__init__.py. That way omnirt models loads your pipeline on startup.

6. Add tests¶

Unit: copy tests/unit/test_sd15_pipeline.py as the template. Fake runtime + fake Diffusers object cover BasePipeline.run() end-to-end.
Integration: tests/integration/test_<model>_{cuda,ascend}.py. Hardware-dependent cases are auto-skipped via the shared conftest.py.
Parity (optional): if the model participates in cross-backend verification, wire into the latent-statistics / PSNR helpers in tests/parity/test_parity.py.

7. Documentation¶

You do not manually edit docs/_generated/models.en.md. Run python scripts/generate_models_doc.py and the ModelCapabilities.summary line flows into the table automatically.
If the model has deployment quirks or known limitations, document them in the "Partial support" or "Awaiting hardware smoke" section of Support Status.

Reference pipelines¶

src/omnirt/models/sd15/, src/omnirt/models/sdxl/, and src/omnirt/models/svd/ are the most complete references today: they cover SD1.5, SDXL, and SVD end-to-end with the full five-stage implementation, and also demonstrate shared concerns like local fp16 variant detection, LoRA wiring, and video frame export.