FlashTalk¶

Support Status¶

Item	Value
Model ID	`flashtalk`
Backend	`omnirt`, with legacy `direct_ws` fallback
Evidence level	OmniRT path documented; Ascend path has validation records
Best for	High-quality private deployment, heavy models, multi-card GPU/NPU

Recommended Hardware¶

CUDA evaluation should use a 4090/A100-class GPU. Ascend 910B deployment should run in the host CANN environment.

Weights¶

Primary Hugging Face sources:

Terminal

hf download Soul-AILab/SoulX-FlashTalk-14B --local-dir "$OMNIRT_MODEL_ROOT/SoulX-FlashTalk-14B"
hf download TencentGameMate/chinese-wav2vec2-base --local-dir "$OMNIRT_MODEL_ROOT/chinese-wav2vec2-base"

For domestic mirrors, search ModelScope or Modelers for SoulX-FlashTalk-14B and chinese-wav2vec2-base.

Directory Layout¶

$OMNIRT_MODEL_ROOT/
├── SoulX-FlashTalk-14B/
├── chinese-wav2vec2-base/
└── SoulX-FlashTalk/        # Optional for custom CUDA/manual paths

Configuration¶

configs/default.yaml

models:
  flashtalk:
    backend: omnirt

Legacy WebSocket fallback:

.env

OPENTALKING_FLASHTALK_WS_URL=ws://127.0.0.1:8765

Start¶

CUDA:

Terminal

bash scripts/quickstart/start_omnirt_flashtalk.sh --device cuda --nproc 1
bash scripts/quickstart/start_all.sh --omnirt http://127.0.0.1:9000

Ascend:

Terminal

source /usr/local/Ascend/ascend-toolkit/set_env.sh
bash scripts/quickstart/start_omnirt_flashtalk.sh --device npu --nproc 8

`/models` Verification¶

Terminal

curl -s http://127.0.0.1:8000/models | python3 -m json.tool

Expected:

{"id":"flashtalk","backend":"omnirt","connected":true,"reason":"omnirt"}

Common Errors¶

Symptom	Action
Cold startup is slow	Inspect OmniRT/FlashTalk logs and separate dependency setup, weight load, and worker init.
CUDA OOM	Lower `OPENTALKING_FLASHTALK_FRAME_NUM`, `OPENTALKING_FLASHTALK_SAMPLE_STEPS`, or resolution.
NPU import failure	Confirm CANN is sourced and `torch_npu`, driver, and CANN versions match.
`reason=not_configured`	Configure `OMNIRT_ENDPOINT` or run `start_all.sh --omnirt ...`.

Frontend Entry¶

After the model or backend service is running, use the OpenTalking WebUI:

Terminal

cd "$OPENTALKING_HOME"
bash scripts/quickstart/start_frontend.sh --api-port 8000 --web-port 5173 --host 0.0.0.0

For a remote server, forward your local browser port to the server 5173, then open http://127.0.0.1:5173.

FlashTalk¶

Support Status¶

Recommended Hardware¶

Weights¶

Directory Layout¶

Configuration¶

Start¶

/models Verification¶

Common Errors¶

Frontend Entry¶

`/models` Verification¶