mirror of
https://github.com/agent0ai/agent-zero.git
synced 2026-05-17 04:01:13 +00:00
docs: update Browser README notes
This commit is contained in:
parent
44d5e1ccf7
commit
c06e4f497d
2 changed files with 6 additions and 22 deletions
|
|
@ -94,7 +94,11 @@ Markdown, Writer, Spreadsheet, and Presentation files share a compact active-fil
|
|||
<img alt="Native Browser" width="1406" height="720" src="https://github.com/user-attachments/assets/8d425af6-fb27-4d46-ade9-8ffa34c7382c" />
|
||||
<br>
|
||||
|
||||
Agent Zero includes a direct Playwright-powered Browser tool with a visible WebUI viewer. The agent can navigate pages, inspect readable page content, and act through typed page references such as `[link 3]`, `[button 6]`, and `[input text 8]` and use vision.
|
||||
Agent Zero includes a direct Playwright-powered `browser` tool with a visible WebUI viewer, replacing the older browser-agent command flow. The main agent can call actions such as `open`, `navigate`, `content`, `detail`, `screenshot`, `click`, and `type`, then act through typed page references like `[link 3]`, `[button 6]`, and `[input text 8]`.
|
||||
|
||||
The Browser can run in the Docker/container browser or, when configured through the A0 CLI, attach to a host Chrome-family browser. Browser tool calls do not force-open the canvas by themselves: open the Browser surface when you want to watch the live page, or ask the agent to open/show the Browser in canvas.
|
||||
|
||||
Browser history keeps per-step screenshots, so older chats show the page state the agent actually saw at that moment instead of linking every step to the latest live frame.
|
||||
|
||||
For web and mobile development, Annotate mode lets you click page elements or regions and leave actionable comments for the agent targeted at the page itself. You can review a UI visually, mark what needs to change, and send those notes straight back into the conversation.
|
||||
|
||||
|
|
@ -180,7 +184,7 @@ Agent Zero supports plugins, MCP, A2A, custom tools, custom prompts, project-sco
|
|||
|
||||
## Try These First
|
||||
|
||||
- **Research with a browser:** "Open the browser, compare three project management tools for a small AI team, and summarize the tradeoffs with source links."
|
||||
- **Research with the Browser tool:** "Use the Browser tool to compare three project management tools for a small AI team, and summarize the tradeoffs with source links."
|
||||
- **Cowork on a spreadsheet:** "Create an editable ODS budget model with assumptions and monthly projections."
|
||||
- **Review a web UI:** "Open my local app in the Browser. I will annotate the page with comments; then implement the requested UI fixes."
|
||||
- **Work inside a Git project:** "Clone this repository into a new project, inspect the architecture, and propose the safest first improvement."
|
||||
|
|
|
|||
|
|
@ -1,20 +0,0 @@
|
|||
# Office Tunnel Memory Footprint Note
|
||||
|
||||
During the ARM Desktop audit on 2026-05-03, `run_tunnel.py` was the clearest near-future memory optimization candidate. The process held roughly 573 MiB PSS while the main UI process held roughly 706 MiB PSS, even though the tunnel should mostly be a lightweight network edge.
|
||||
|
||||
## Observed Shape
|
||||
|
||||
- `run_tunnel.py` imports enough of the framework stack to pull in heavy provider and API dependencies.
|
||||
- The tunnel stays resident for the life of the container, so every eagerly imported module becomes steady-state memory.
|
||||
- The Desktop/Xpra service itself was not the largest outlier; the always-on tunnel process was.
|
||||
|
||||
## Future Work
|
||||
|
||||
- Split tunnel startup into a small import surface that only loads routing, auth, and socket plumbing required for health and proxy operation.
|
||||
- Lazy-import provider/framework modules only when a tunnel request truly needs them.
|
||||
- Review any `ApiHandler` or helper imports used by the tunnel path and replace broad framework imports with narrower functions.
|
||||
- Measure with `smem -P run_tunnel.py`, `/proc/<pid>/smaps_rollup`, and before/after cold-start RSS/PSS on ARM64.
|
||||
|
||||
## Success Signal
|
||||
|
||||
The tunnel process should remain useful as an always-on edge while dropping its idle PSS substantially below the main UI process. A good first target is under 250 MiB PSS on ARM64 without changing tunnel behavior.
|
||||
Loading…
Add table
Add a link
Reference in a new issue