agent-zero

mirror of https://github.com/agent0ai/agent-zero.git synced 2026-05-22 19:47:15 +00:00

Author	SHA1	Message	Date
Alessandro	5e2c2a86ef	Add skill visibility controls Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Let users hide skills from the model-facing available catalog through the chat Skills selector while keeping pinned skill injection as a separate mode. Hidden skills are filtered from skill listing, search, loading, relevant recall, and loaded-skill prompt injection, with chat-level show/hide overrides and persistent default hidden-skill config support.	2026-05-22 17:44:22 +02:00
Alessandro	bb48fad754	Improve Codex OAuth model setup UI Add Main and Utility Codex model selectors to the OAuth plugin config and persist them through the existing model config API. Clean up the OAuth config layout by removing the redundant Check Models action, moving the available model list above Advanced, softening borders, and removing repeated account labels. Show account quota usage bars on the welcome dashboard Codex card and add static coverage for the selector, model list, and quota UI.	2026-05-22 17:22:47 +02:00
Alessandro	4f2d996ac8	Allow file browser to open root Markdown in Editor Let File Browser-sourced Editor opens register existing Markdown files under the Agent Zero runtime root while preserving the stricter document artifact sandbox for ordinary document operations. Pass the file-browser source through the Editor store and cover the /a0/AGENTS.md-style path with regression tests.	2026-05-22 15:00:03 +02:00
Alessandro	1c9b5c8b21	Add contextual file browser surface actions Route Markdown files to Editor, txt and Office documents to Desktop, and browser-renderable files to Browser from the file browser action menu. Extend the Desktop/document allowlists for txt files, keep unsupported small files on the legacy editor path, and harden tooltip cleanup for dropdown-triggered modal closes.	2026-05-22 14:45:53 +02:00
Alessandro	8601f0d10c	Split text editor and Office artifact ownership - rename document_artifact to office_artifact and remove retired shims/facades - make text_editor own Markdown saves, canvas-open intent, refresh, and stale-save protection - keep Office artifacts Desktop-only with Office formats and update skills/tests	2026-05-22 11:21:04 +02:00
Alessandro	c1bdde057c	Make Desktop screenshots ephemeral by default Route in-process Xpra/Desktop screenshot observations through context-scoped ephemeral image refs with vision_load payloads, matching the privacy posture of computer-use and browser screenshots. Keep desktopctl shell observations path-based with aggressive pruning so image payloads are not printed into shell logs, and preserve explicit screenshot paths as durable user-owned artifacts.	2026-05-22 10:21:28 +02:00
Alessandro	430c48d1a5	Make browser screenshots ephemeral and context scoped Route no-path browser screenshots through an in-process ephemeral image registry that vision_load consumes into the existing data-url model boundary. Stop materializing host-browser artifacts into tmp/browser/host-screenshots, keep explicit path screenshots durable, and make browser log metadata point at the active chat/task context while preserving browser-context detail.	2026-05-22 09:50:47 +02:00
Alessandro	26b3ae00d5	Open saved browser screenshots in image viewer Route persisted browser screenshot thumbnails to the shared image viewer modal while keeping live Browser previews as Canvas entry points.\n\nAdd regression coverage so static screenshot artifacts do not regress back to opening the Browser surface.	2026-05-22 05:03:11 +02:00
Alessandro	d1827e6c66	Refactor: use user locale for time displays Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Add user-configurable timezone and 12/24-hour preferences, then wire them through settings, runtime snapshots, scheduler payloads, wait handling, notifications, backups, memory, plugin metadata, and frontend formatters. Keep UTC as the boundary for absolute instants while serializing user-facing dates in the configured or browser-resolved timezone. Preserve scheduler wall-clock inputs in the selected timezone, propagate TZ into desktop/runtime process environments, and restart active desktop sessions when the runtime timezone changes. Cover the risky paths with timezone regression tests for settings normalization, auto and fixed timezone resolution, scheduler round-trips, memory timestamp conversion, and desktop timezone sync.	2026-05-21 15:26:00 +02:00
Alessandro	675afa8dee	Refactor speech stack into built-in Kokoro TTS and Whisper STT plugins Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Split the legacy core speech stack into two built-in, independently toggleable plugins: `_kokoro_tts` for TTS and `_whisper_stt` for STT. This refactor keeps dependency installation and bootstrap concerns in Docker/bootstrap/preload, while moving speech-specific tooling, APIs, prompts, UI, and runtime behavior into the plugins. Core now exposes engine-agnostic `tts-service` and `stt-service` brokers, with browser-native TTS preserved as the fallback when Kokoro is disabled. Included in this change: - add built-in `_kokoro_tts` plugin with plugin-owned synth API, config, status UI, and provider registration - add built-in `_whisper_stt` plugin with plugin-owned transcribe API, mic runtime, device UI, prompt injection, and provider registration - remove legacy core speech APIs/helpers/settings/UI and delete unused `webui/js/speech_browser.js` - replace the old hardcoded speech settings section with a generic voice surface backed by plugin extensions - update preload/docs/tests to match the new plugin-owned speech architecture Behavioral intent: - both plugins are built-in but not `always_enabled` - users can now hot-switch TTS and STT independently - browser TTS remains available when `_kokoro_tts` is off - Whisper mic UI only appears when `_whisper_stt` is enabled	2026-05-21 05:41:59 +02:00
Alessandro	d4a9cd82d5	Simplify plugin activation toggle UI Replace the plugin list activation dropdown and advanced shortcut with a one-click ON/OFF switch. Keep project/profile-specific activation inside the plugin config flow, remove the old advanced-only modal, update plugin docs, and add regression coverage for the binary list toggle contract.	2026-05-21 04:31:19 +02:00
Alessandro	30315f5227	Reduce plugin scanner false positives Calibrate scanner prompts around demonstrated risk instead of the mere presence of common plugin capabilities. Treat scoped credentials, network calls, filesystem access, subprocesses, prompts, and generated assets as expected behavior when they match the declared plugin purpose, while keeping warnings and failures for ambiguity, unsafe handling, concealment, exploitability, or purpose mismatch. Add regression coverage for the rendered scanner prompt so this calibration is preserved.	2026-05-21 04:02:43 +02:00
Alessandro	cf51c792f5	Make error retry count configurable Some checks are pending Build And Publish Docker Images / plan (push) Has been skipped Details Build And Publish Docker Images / build (push) Waiting to run Details Read the _error_retry retry limit from plugin settings instead of using the hardcoded single retry. Add config sanitization, preserve the default retry count in the settings UI, update plugin docs, and cover configured and zero-retry behavior with focused tests.	2026-05-18 03:23:56 +02:00
Alessandro	e0337410e7	Preserve model preset inherited settings Deep-merge model preset slots with the active configuration so custom context windows, rate limits, and nested kwargs survive preset switches. Treat legacy utility preset defaults as implicit values, allow omitted utility and embedding slots to inherit configured models, and document the partial-preset behavior.	2026-05-18 02:45:08 +02:00
Alessandro	27aa2d8550	Improve Browser Docker runtime recovery Clarify Browser settings around internal Docker vs A0 CLI host-browser runtimes. Add recovery guidance to host-browser failures so users can switch back to the internal Docker browser from settings or /browser container. Cover the recovery messaging in host-browser connector tests.	2026-05-18 02:00:31 +02:00
Alessandro	70adbe91a0	Polish Editor and Browser surface cleanup Remove obsolete Office markdown editor UI and handoff code now that Markdown lives in the dedicated Editor surface. Harden the Editor modal so it opens directly into a Markdown draft and rebinds Ace to the visible root when switching surfaces. Make Browser address Enter navigation explicit and update the canvas setup expectations for the slimmer Office shell.	2026-05-15 12:38:29 +02:00
Alessandro	89901b64f0	Polish native Markdown editor experience Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Expand the dedicated Editor surface with safe rendered preview mode, ACE-backed source editing, browser-style tabs, toolbar/file actions, preview search, and richer Markdown rendering for code blocks, task lists, images, tables, math, local links, and footnotes. Keep open Markdown files synchronized with the active context and saved tool edits, including live refresh for document_artifact and text_editor results without routing Markdown through Desktop/Office. Add inline preview-page editing, clickable preview task-list checkboxes, source editor rehydration after preview-mode refreshes, and regression coverage for the new editor wiring and sync behavior.	2026-05-15 04:47:24 +02:00
Alessandro	330a0c5790	Split Markdown editor into dedicated surface Add a builtin _editor plugin that owns Markdown API/WebSocket sessions, canvas and modal UI, live refresh, tabs, prompt Extras for active-context open files, inline close confirmation, and Close All handling. Route Markdown document artifacts to Editor while keeping Office/Desktop focused on LibreOffice formats, and update Desktop/Office prompts, menus, compatibility shims, and regression coverage.	2026-05-15 02:41:41 +02:00
Alessandro	7ba1d61e34	Make self-update backups skip runtime sockets Some checks failed Build And Publish Docker Images / plan (push) Has been cancelled Details Build And Publish Docker Images / build (push) Has been cancelled Details Treat live usr runtime artifacts as non-blocking during self-update backups. Skip sockets, device nodes, vanished files, and unreadable entries with log messages so update rollback checks are not tripped by active Desktop profile state.	2026-05-12 16:46:07 +02:00
Alessandro	68c3b8b022	Move office and desktop state under plugin storage Migrate retired /usr/_office and /usr/_desktop trees from plugin startup into /usr/plugins/<plugin>. Update office document storage, desktop session/runtime paths, and context-scoped screenshots to use the plugin-owned state layout. Add focused tests for retired-state migration and the new path behavior.	2026-05-12 16:21:43 +02:00
Alessandro	7b61ceb241	Reflect connector model overrides in Web UI Render custom per-chat model overrides in the model switcher instead of hiding them behind a generic Custom label. Mark model override updates dirty so an already-open Web UI refreshes after CLI or Web UI changes, without exposing API key values in labels. Add focused regression coverage for switcher rendering hooks and state-sync notifications.	2026-05-12 16:04:02 +02:00
Alessandro	03cc91287e	Add Nebius Token Factory provider Register Nebius Token Factory as an OpenAI-compatible chat provider using the Token Factory API base URL and NEBIUS_API_KEY-derived provider id. Expose Nebius in onboarding metadata and add static coverage for the provider endpoint and UI listing.	2026-05-12 15:50:24 +02:00
Alessandro	4bab8da3f5	Keep host browser requests on Browser runtime Route host/local browser requests through the Browser tool instead of desktop or shell fallbacks. Add remote-debugging setup guidance to Browser runtime errors and document the exact Chrome inspect setting in prompts, skills, and Web UI copy.	2026-05-12 15:45:29 +02:00
Alessandro	7b1c84aeca	Improve browser tool ergonomics for agent UI control Teach the Browser content helper to ignore global/delegated framework event bindings so snapshots surface the actual actionable controls instead of broad wrapper elements. Add an accessible name to the Browser address bar for more reliable capture output. Allow agents to use selector-based reference actions, coordinate click fallbacks, focused-field typing, and string key chords such as CTRL+A across the browser tool, container runtime, and host connector runtime. Cover the behavior with browser regression and host connector tests.	2026-05-12 09:41:13 +02:00
Alessandro	55474443c9	Stabilize document artifact affordances Make file creation opt-in through document_artifact, move document file cards to final responses, and keep the tool payload as a quiet execution record. Deduplicate response cards by file identity, refresh open Desktop canvas sessions after saved edits, and harden document_artifact edit input normalization for common append/update shapes. Update prompts, skills, styles, and regression coverage for response-only file actions and explicit-only canvas opening.	2026-05-12 06:59:22 +02:00
Alessandro	6de7073bf9	Fix blocking history compression edge cases Detect stalled automatic history compression so the prompt-prep wait loop cannot spin forever when no further reduction is possible. Split large manual chat compaction input by verified token budget instead of line midpoint, covering single-line 85k+ character histories. Add regression tests for stalled compression, max-pass bailout, and large single-line compaction chunking.	2026-05-12 04:47:28 +02:00
Alessandro	1f2d512226	fix(api): resolve image_get containment bypass (#1609 ) Fixes agent0ai/agent-zero#1609. Issue: "Unauthenticated Path-Containment Bypass in Agent Zero `/api/image_get`" https://github.com/agent0ai/agent-zero/issues/1609 Resolve the path-containment bypass in /api/image_get by resolving requested images against the Agent Zero base directory before serving them, including symlink-aware validation and the development RFC fallback path. Harden SVG and SVGZ responses with nosniff and a sandboxed CSP so uploaded SVGs cannot execute scripts in the Agent Zero origin. Add focused regressions for outside paths, symlink escapes, SVG headers, and development-mode remote validation.	2026-05-12 04:15:10 +02:00
Alessandro	ba0d90c380	Improve model config provider controls Reset the custom API base URL whenever the provider dropdown changes so stale endpoints do not carry across provider tests. Move the chat Supports Vision toggle out of Advanced Settings while keeping dependent vision settings, such as Max embeds, inside Advanced.	2026-05-12 03:52:18 +02:00
Alessandro	f17198e126	fix: tighten tool guidance and editor workflows	2026-05-11 11:51:58 +02:00
Alessandro	6ba1f30dca	fix: make memory cleanup update stale fragments	2026-05-11 11:51:58 +02:00
frdel	904a0f4a25	Persist API chat lifetime and add cleanup job Validate and persist API chat lifetime: lifetime_hours is validated as a positive number and stored in the AgentContext data, and context.last_message is set using UTC. Removed the in-class threading-based cleanup state and old _cleanup_expired_chats method. Introduced a new job-loop extension (extensions/python/job_loop/_20_cleanup_expired_api_chats.py) that periodically scans AgentContext instances and removes expired API chats (using persist_chat.remove_chat) in a UTC-aware manner. Added tests (tests/test_api_chat_lifetime.py) to verify lifetime persistence and that the job loop removes expired chats.	2026-05-11 08:49:06 +02:00
Alessandro	6d29268cbd	refactor: align skills and tool guidance Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Rename high-impact skills to task-oriented names and move plugin-owned skills into their owning plugin folders.\n\nAlign renamed skill frontmatter with the official SKILL.md standard by keeping trigger language in name/description metadata, replacing the old create-skill wizard with build-skill, and updating browser, A0 connector, computer-use, CLI setup, and scheduler skill references.\n\nTighten the recurring cross-provider guidance gaps surfaced by the evidence sweeps: memory requests now avoid promptinclude-file routing, scheduler prompts distinguish cron schedules from planned ISO dates, document questions prefer document_query, skills_tool search/read_file usage is clearer, normal notifications set info/priority 10, and local/host text editors preserve patch intent.\n\nUpdate regression tests for the renamed skills, plugin ownership, prompt budget reality, and standard frontmatter shape.	2026-05-10 07:13:14 +02:00
Alessandro	854a96e880	Treat bare desktop canvas as ready Allow the Linux Desktop state collector to report a healthy canvas when XFCE has no active application window, as long as the display, visible windows, and screenshots are available. Document the readiness rule in the linux-desktop skill and add regression coverage for the bare-desktop active_window=null case.	2026-05-10 00:07:04 +02:00
Alessandro	daf95ec3ab	Normalize tool contracts and slim prompt surface Standardize multi-action tools around tool_args.action while keeping parser compatibility for older tool/args, tool_name:action, and method-shaped requests. This keeps new prompts clean without breaking agents that learned the previous dialect. Move A0 connector remote execution/file tools into stable standard prompts, make remote targeting independent of the active chat context, and skill-gate beta computer-use remote so it no longer weighs down the always-on tool list. Align text editor, scheduler, skills, office artifact, memory, notify, and browser prompts/tools around the canonical action contract. Add scheduler update/timezone handling, skills_tool read_file, text editor patch coverage, and fixes for memory_forget, behaviour_adjustment, and code execution progress warnings. Reduce default prompt pressure by compacting browser and scheduler prompts into skill-backed manifests, shortening skill catalog descriptions, and pruning noisy framework knowledge. Remove obsolete connector prompt stubs and root tool-call knowledge examples. Tests: conda run -n a0 pytest tests/test_a0_connector_prompt_gating.py tests/test_tool_action_contracts.py tests/test_task_scheduler_timezone.py tests/test_text_editor_context_patch.py tests/test_tool_request_normalization.py tests/test_office_document_store.py::test_odf_is_advertised_and_docx_remains_explicit_compatibility tests/test_office_document_store.py::test_document_artifact_accepts_method_alias_for_ods_create tests/test_skills_runtime.py tests/test_default_prompt_budget.py::test_a0_small_profile_removed_and_prompt_text_generic -q	2026-05-09 21:54:43 +02:00
Alessandro	09d9ed2e80	Bound browser tab usage during research	2026-05-09 17:36:15 +02:00
Alessandro	0a8aaee9ac	Add host browser profile mode setting Default Bring Your Own Browser mode to the existing browser profile while exposing a clean Agent profile option in Browser settings with a clear warning for existing-profile access. Forward the selected profile mode through the connector browser runtime, tolerate legacy config modules and old saved configs, and update regression coverage for the new payload shape.	2026-05-09 16:25:27 +02:00
Alessandro	f6bc52201d	Redesign first-run onboarding Introduce a guided Cloud versus Local first-run modal with provider selection, account connection, model picking, and a ready state.\n\nAdd the reusable discovery auto-modal trigger, chat-created startup checks, onboarding-owned provider presentation metadata and assets, OAuth affordances, local provider guidance, and model-search hardening.\n\nKeep runtime provider data centralized while preserving onboarding-specific copy, logos, and docs links in the onboarding plugin. Update onboarding.html Update onboarding.html	2026-05-09 07:46:36 +02:00
Alessandro	0395778fd7	Add Ollama Cloud provider Configure Ollama Cloud as an OpenAI-compatible chat provider with the hosted API base and model listing endpoint.\n\nAdd a focused provider config test so the integration stays key-required and keeps the expected /models endpoint.	2026-05-09 06:46:22 +02:00
Alessandro	44d5e1ccf7	Persist browser history screenshots Save a static JPEG for each Browser tool call in the chat history folder and render that immutable image in transcript screenshot previews. Keep live Browser surface attachment available through stored browser/context metadata, and ignore generated Playwright CLI artifacts.	2026-05-08 19:24:44 +02:00
Alessandro	bb2432693e	Fix canvas attachment for browser and documents Attach the Browser canvas to active Docker sessions by returning an initial snapshot on subscribe and preserving valid frames through state-only updates. Route Markdown document opens through the right-canvas Desktop editor instead of the legacy office modal. Skip automatic office document response affordances for subordinate agents so delegated reviews keep their actual content.	2026-05-08 19:08:53 +02:00
Alessandro	a3d41e2ca1	Split A0 remote workflow skills by affordance Replace the combined A0 CLI remote workflow skill with separate text-editor and code-execution remote skills, update tool stubs to load the matching per-tool guide, and keep computer-use remote scoped to desktop control. Add prompt-gating coverage for the per-affordance skill split.	2026-05-08 18:53:37 +02:00
Alessandro	229de5166b	Expose Browser runtime selection to CLI Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Add a protected connector endpoint for reading and updating the Browser plugin runtime backend so the A0 CLI can switch between Docker browser and Bring Your Own Browser mode. Keep legacy host_when_available values normalized to host_required, move the host/container setting to the top of Browser settings, and cover the config normalization path.	2026-05-08 18:37:46 +02:00
Alessandro	001c7e2ccb	Simplify Host Browser config Remove the ambiguous Use host when ready option from the Browser plugin settings and present the host-required path as Bring Your Own Browser. Add concise Chrome/Chromium remote-debugging guidance, normalize legacy host_when_available values to the BYOB setting, and make missing host-browser connector setup a repairable error with regression coverage.	2026-05-08 18:18:03 +02:00
Alessandro	aa7944b95a	Centralize Browser helper contracts Move URL normalization into Agent Zero-owned Browser helper code and expose the content helper's required API contract from the shared asset. Normalize host-browser open/navigate payloads before they cross into the connector, including nested multi actions, and add regression coverage for helper payload delivery and URL edge cases.	2026-05-08 16:39:04 +02:00
Alessandro	c020f1af28	Send browser helper source to host connector Make the Browser plugin the source of truth for browser-page-content.js by attaching its source and sha256 to host-browser operations when the CLI has no matching helper hash. Store the helper hash in connector metadata and cover the routing/ensure path in tests.	2026-05-08 15:23:44 +02:00
Alessandro	d47207dfd7	Refine host browser routing and settings copy Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Store and surface host-browser preparation and CDP endpoint metadata from A0 CLI. Let Browser runtime prepare candidate CLIs before the first action, and keep host-required errors more actionable. Simplify Host Browser settings language and document the Chrome remote-debugging consent flow.	2026-05-08 06:37:32 +02:00
Alessandro	4b3e2eb327	Route Browser through A0 host connector Integrate host-browser routing into the existing Browser tool. Store connector host-browser metadata, add pending browser op resolution, select connector runtimes from Browser settings, enforce host-content privacy policy, support automatic host preparation, and document the A0 CLI host-browser flow.	2026-05-08 04:22:18 +02:00
Alessandro	1af5344f3c	Clean uv cache during self-update Run a best-effort uv cache clean when the durable self-update manager consumes an update request. Cover uv-present, uv-missing, and cleanup-failure paths with focused tests, and document the new self-update step.	2026-05-08 00:56:24 +02:00
Alessandro	06a83030c0	Remove browser chat action button Delete the Browser plugin's injected bottom-action button so it no longer appears under the chat input while preserving the rest of the Browser surface entry points. Update the browser regression coverage to assert the chat action stays absent.	2026-05-08 00:36:36 +02:00
Alessandro	fa0d2beaf2	Clean up legacy runtime artifacts Some checks are pending Build And Publish Docker Images / plan (push) Waiting to run Details Build And Publish Docker Images / build (push) Blocked by required conditions Details Remove stale runtime directories and expand retired package coverage for startup/self-update cleanup. Discover installed collaboraoffice* split packages dynamically so future package-name changes are still purged, and extend Office cleanup tests for the legacy /opt paths, known leftover packages, idempotency, and marker rerun behavior.	2026-05-07 20:20:52 +02:00

1 2 3 4

182 commits