Skip to content

Results and Baselines

This page records only currently citable public baselines. New results should keep full context instead of reporting FPS alone.

Current Reference Data

Path Hardware / State Data Notes
Wav2Lip quickstart NVIDIA 3090 path singer example: about 28 frames / 0.83-0.85s, about 33 FPS README quickstart configuration record; useful as a lightweight-model reference.
FlashTalk via OmniRT Ascend 910B2 x8, hot full-audio 937 frames / 37.377s, about 25 FPS External OmniRT/model-service inference baseline, not direct OpenTalking inference.
FlashTalk steady chunk Ascend 910B2 x8, hot chunk 29-frame chunks around 30 FPS equivalent External steady-state inference data; keep separate from end-to-end first response.

Result Template

### <model> / <backend> / <hardware> / <date>

- OpenTalking commit:
- backend commit or service version:
- hardware:
- model and weights:
- avatar:
- input audio:
- cold start or hot state:
- `session_create_ms`:
- `llm_first_token_ms`:
- `tts_first_pcm_ms`:
- `avatar_first_frame_ms`:
- `webrtc_first_frame_ms`:
- `render_fps`:
- `av_drift_ms`:
- notes:

Interpretation Rules

  • For end-to-end experience, prioritize first response and A/V sync over model FPS alone.
  • For model-service throughput, prioritize steady chunks and queue depth over one cold run.
  • Public references to external benchmarks must label the source as an external backend.