agent-zero/docs/guides
Alessandro 675afa8dee
Some checks are pending
Build And Publish Docker Images / plan (push) Waiting to run
Build And Publish Docker Images / build (push) Blocked by required conditions
Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins
Split the legacy core speech stack into two built-in, independently toggleable plugins: `_kokoro_tts` for TTS and `_whisper_stt` for STT.

This refactor keeps dependency installation and bootstrap concerns in Docker/bootstrap/preload, while moving speech-specific tooling, APIs, prompts, UI, and runtime behavior into the plugins. Core now exposes engine-agnostic `tts-service` and `stt-service` brokers, with browser-native TTS preserved as the fallback when Kokoro is disabled.

Included in this change:
- add built-in `_kokoro_tts` plugin with plugin-owned synth API, config, status UI, and provider registration
- add built-in `_whisper_stt` plugin with plugin-owned transcribe API, mic runtime, device UI, prompt injection, and provider registration
- remove legacy core speech APIs/helpers/settings/UI and delete unused `webui/js/speech_browser.js`
- replace the old hardcoded speech settings section with a generic voice surface backed by plugin extensions
- update preload/docs/tests to match the new plugin-owned speech architecture

Behavioral intent:
- both plugins are built-in but not `always_enabled`
- users can now hot-switch TTS and STT independently
- browser TTS remains available when `_kokoro_tts` is off
- Whisper mic UI only appears when `_whisper_stt` is enabled
2026-05-21 05:41:59 +02:00
..
a0-cli-connector.md refactor: align skills and tool guidance 2026-05-10 07:13:14 +02:00
a2a-setup.md optimize media; README.md; change structure 2026-02-10 17:38:02 +01:00
agent-profiles.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
api-integration.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
browser.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
contribution.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
create-plugin.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
desktop.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
mcp-setup.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
memory.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
model-presets.md Preserve model preset inherited settings 2026-05-18 02:45:08 +02:00
onboarding.md Document first-run onboarding flow 2026-05-09 08:05:09 +02:00
projects.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
self-update.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
skills.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
troubleshooting.md docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
usage.md Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins 2026-05-21 05:41:59 +02:00