agent-zero/docs
Alessandro 675afa8dee
Some checks are pending
Build And Publish Docker Images / plan (push) Waiting to run
Build And Publish Docker Images / build (push) Blocked by required conditions
Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins
Split the legacy core speech stack into two built-in, independently toggleable plugins: `_kokoro_tts` for TTS and `_whisper_stt` for STT.

This refactor keeps dependency installation and bootstrap concerns in Docker/bootstrap/preload, while moving speech-specific tooling, APIs, prompts, UI, and runtime behavior into the plugins. Core now exposes engine-agnostic `tts-service` and `stt-service` brokers, with browser-native TTS preserved as the fallback when Kokoro is disabled.

Included in this change:
- add built-in `_kokoro_tts` plugin with plugin-owned synth API, config, status UI, and provider registration
- add built-in `_whisper_stt` plugin with plugin-owned transcribe API, mic runtime, device UI, prompt injection, and provider registration
- remove legacy core speech APIs/helpers/settings/UI and delete unused `webui/js/speech_browser.js`
- replace the old hardcoded speech settings section with a generic voice surface backed by plugin extensions
- update preload/docs/tests to match the new plugin-owned speech architecture

Behavioral intent:
- both plugins are built-in but not `always_enabled`
- users can now hot-switch TTS and STT independently
- browser TTS remains available when `_kokoro_tts` is off
- Whisper mic UI only appears when `_whisper_stt` is enabled
2026-05-21 05:41:59 +02:00
..
agents Simplify plugin activation toggle UI 2026-05-21 04:31:19 +02:00
developer docs: make Agent Zero guides more human 2026-05-09 00:02:11 +02:00
guides Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins 2026-05-21 05:41:59 +02:00
res Document first-run onboarding flow 2026-05-09 08:05:09 +02:00
setup Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins 2026-05-21 05:41:59 +02:00
quickstart.md Document first-run onboarding flow 2026-05-09 08:05:09 +02:00
README.md Document first-run onboarding flow 2026-05-09 08:05:09 +02:00

Agent Zero Logo

Agent Zero Documentation

Welcome to the Agent Zero documentation hub. Start with the practical guides below: install it, open the Web UI, connect your host machine when needed, and learn the main workflows by sight.

For architecture and source-linked internals, use DeepWiki for Agent Zero. The local docs focus on practical setup, screenshots, and user workflows.

Quick Start

User Guides

  • Usage Guide: Practical tour of Agent Zero's main workflows.
  • First-Run Onboarding: Set up OpenRouter, our proxy API or another provider with the guided wizard.
  • Browser Guide: Use the built-in Browser, live Canvas surface, annotations, screenshots, host browser mode, and extensions.
  • Desktop Guide: Use the built-in Linux desktop, GUI apps, and LibreOffice Writer/Calc/Impress Cowork.
  • A0 CLI Connector: Terminal-first host connector for Agent Zero, with screenshots of the host picker, connected shell, command palette, and Browser modes.
  • Create a Small Plugin: Build and review a tiny Web UI plugin that adds an unread dot to the chat list.
  • Skills Guide: Open the Skills selector, add active skills, and remove prompt extras you no longer need.
  • Agent Profiles: Switch the current chat profile or create a new guided profile from the chat input.
  • Model Presets: Create simple named shortcuts for model setups.
  • Memory Guide: Search, edit, delete, and curate memories so useful context does not become stale noise.
  • Projects Tutorial: Learn to create isolated workspaces with dedicated context and memory.
  • API Integration: Add external APIs without writing code.
  • MCP Setup: Configure Model Context Protocol servers.
  • A2A Setup: Enable agent-to-agent communication.
  • Troubleshooting: Solutions to common issues and FAQs.

Technical Reference

Community & Support

  • Join the Community: Connect with other users on Discord to discuss ideas, ask questions, and collaborate.
  • Share Your Work: Show off your Agent Zero creations and workflows in the Show and Tell area.
  • Report Issues: Use the GitHub issue tracker to report bugs or suggest features.
  • Follow Updates: Subscribe to the YouTube channel for tutorials and release videos.

Table of Contents


Documentation Ownership

Content type Home
Setup, screenshots, and everyday workflows These docs
Architecture and source-linked internals DeepWiki
Exact behavior The current source code
Community help and examples Discord, Skool, GitHub discussions

Your journey with Agent Zero starts now!

Ready to dive in? Start with the Quickstart Guide for the fastest path to your first chat, or follow the Installation Guide for a detailed setup walkthrough.