Skip to content

WebUI Basic Usage

WebUI is OpenTalking's interactive workspace for avatar selection, model selection, voice configuration, and conversation validation. It is useful for quick checks and for letting non-engineering teammates preview the digital human experience.

What WebUI Is For

Use WebUI to:

  • Select built-in or custom avatars.
  • Choose the model for the current session.
  • Select TTS provider and voice.
  • Converse through text or voice.
  • Inspect connection status, session state, and errors.

WebUI is not a production admin system or a full asset management platform. It is a visual validation and debugging entry point.

Open WebUI

Start services with:

bash scripts/start_unified.sh --mock

The script prints the WebUI URL. The default is:

http://127.0.0.1:5173

If you changed ports, use the URL printed by the terminal.

Screenshot placeholder: WebUI first screen To be added: avatar list, settings area, and conversation panel after successful startup.

Page Layout

Avatar Selection

The avatar area lists available digital humans. Each item usually has a preview image, name, and type label. Custom avatars are marked as custom and can be deleted.

If an expected avatar is missing, confirm that it is under OPENTALKING_AVATARS_DIR and contains a valid manifest.json and preview image.

Model Selection

Model selection controls which digital human driver model is used for the session. In Mock mode, choose a driverless/mock option. In local or OmniRT mode, make sure model weights, backend services, and startup arguments are ready.

Model capabilities and backend choices are covered in Model Support.

Voice Selection

Voice selection controls the TTS provider and voice used for generated speech. Different providers have different voice identifiers, credentials, and latency profiles.

For first validation, use the default voice. For a business-specific voice, see Voice and TTS.

Conversation Panel

The conversation panel is used for text input, replies, and digital human playback. When voice input is enabled, the browser asks for microphone permission.

Start with short text to verify first frame, audio, and captions before testing long prompts or continuous voice.

Status and Errors

WebUI displays connection, session, model, and TTS errors. When something fails, read the page message first, then inspect API and WebUI logs.

First-use Flow

1. Select Avatar

Select an avatar from the library. For the first run, use a built-in avatar to avoid custom asset issues.

2. Select Model

Choose a model that matches the startup mode:

  • Mock mode: choose the mock / driverless option.
  • Local QuickTalk: choose quicktalk.
  • OmniRT backend: choose the model specified in startup arguments.

3. Select Voice

Use the default voice first. If multiple providers are configured, preview voices before selecting one.

4. Create Session

Create a session after avatar, model, and voice are selected. When creation succeeds, the page enters the conversation state.

5. Allow Microphone Permission

If you use voice input, allow microphone access in the browser. Text-only usage does not require it.

6. Start Conversation

Try a short message:

Hello, please briefly introduce OpenTalking.

After reply, audio, and video are working, test more complex input.

Screenshot placeholder: complete session To be added: selected avatar, model, voice, and a generated digital human response.

Common Operations

Switch Avatar

After switching avatars, recreate the session. Different avatars may have different assets and model compatibility.

Switch Model

Before switching models, make sure the backend supports the selected model. Otherwise session creation may fail.

Switch Voice

Voice changes affect future replies. Already generated audio is not re-synthesized.

View Captions and Events

The page shows conversation text, generated replies, and some status events. For detailed backend events, inspect API logs or later reference materials.

Stop or Recreate Session

If inference stalls, audio breaks, or configuration changes behave unexpectedly, stop the current session and create a new one. If needed, restart services:

bash scripts/quickstart/stop_all.sh

Common Issues

Blank Page or Failed Assets

Check that the WebUI dev server is still running and that the frontend has no compilation errors.

Session Creation Fails

Check API status:

bash scripts/quickstart/status.sh

Then confirm the selected model matches the backend that was started.

No Audio

Check browser mute state, TTS provider configuration, voice availability, credentials, and network access.

Microphone Unavailable

Check browser permissions, system microphone permissions, and whether the page is opened from localhost or 127.0.0.1.