vrr/zed - VRR Forge

vrr/zed

mirror of https://github.com/zed-industries/zed.git synced 2026-05-25 14:44:28 +00:00

Author	SHA1	Message	Date
Ben Kunkle	8033fbfccf	ep: Send trigger in header (#56433 ) Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ...	2026-05-11 15:22:43 +00:00
Oleksiy Syvokon	740b4241c0	ep: Generate request IDs on the client (#56386 ) This change lays the groundwork for canceling in-flight requests. Without it, we always wait for a request to complete, even when we already know its result won't be used. Release Notes: - N/A	2026-05-11 09:10:05 +00:00
Oleksiy Syvokon	4bea755dee	ep: Fix moving cursor to a predicted position (#55079 ) Starting ~3 weeks ago, `output` no longer contains the cursor marker, cloud strips it on parsing. Instead, it should return a cursor offset. Release Notes: - Fixed moving the cursor to a predicted position in Zeta 2	2026-04-28 15:44:17 +00:00
Ben Brandt	2eafa6e6aa	language_models: Remove unused language model token counting (#54177 ) Drop the `count_tokens` API and related implementations across providers, and remove the unused `tiktoken-rs` dependency. I was going to update the dependency becuase they finally released a fix we needed. But then I realized we only used this api in one place, the Rules library. And for most models it would have been wildly incorrect becuase we use tiktoken, i.e. OpenAI tokenizers, for almost every model, which is going to give incorrect results. Given that, I just removed these because the difference in how we get these has caused plenty of confusion in the past. Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Release Notes: - N/A	2026-04-22 13:39:48 +00:00
Ben Kunkle	e3718e51ac	ep: Send preferred experiment in a header (#54154 ) Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ...	2026-04-17 07:25:01 -04:00
Ben Kunkle	d3d8f1500d	ep: Send edit prediction mode in prediction request (#53812 ) Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ...	2026-04-15 02:22:14 -04:00
Agus Zubiaga	98c17ca160	language_models: Refactor deps and extract cloud (#53270 ) - `language_model` no longer depends on provider-specific crates such as `anthropic` and `open_ai` (inverted dependency) - `language_model_core` was extracted from `language_model` which contains the types for the provider-specific crates to convert to/from. - `gpui::SharedString` has been extracted into its own crate (still exposed by `gpui`), so `language_model_core` and provider API crates don't have to depend on `gpui`. - Removes some unnecessary `&'static str` \| `SharedString` -> `String` -> `SharedString` conversions across the codebase. - Extracts the core logic of the cloud `LanguageModelProvider` into its own crate with simpler dependencies. Release Notes: - N/A --------- Co-authored-by: John Tur <john-tur@outlook.com>	2026-04-07 12:28:19 -03:00
Marshall Bowers	72bc4dc534	cloud_llm_client: Move `CompletionIntent` to `language_model` (#52359 ) This PR moves the `CompletionIntent` enum from the `cloud_llm_client` crate to the `language_model` crate, as it is no longer part of the Cloud interface. Release Notes: - N/A	2026-03-25 08:39:17 +01:00
Ben Brandt	d3ab0d9daf	agent: Mark subagent completions with Subagent intent (#52350 ) ## Context Ensure subagent threads build requests with the Subagent intent instead of UserPrompt. This allows us to properly attribute this as a tool call for certain providers instead of a user request. ## Self-Review Checklist <!-- Check before requesting review: --> - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Release Notes: - copilot_chat: Fix subagent requests being marked as user requests.	2026-03-24 23:12:20 +01:00
Neel	0f1f0f9272	cloud_llm_client: Add derives for edit prediction fields (#51968 ) ## Context This PR adds some derives which make tracing easier on cloud side. ## Self-Review Checklist <!-- Check before requesting review: --> - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Release Notes: - N/A	2026-03-19 19:04:43 +00:00
Ben Kunkle	52fb089258	ep: Track e2e latency (#51678 ) Closes #ISSUE Before you mark this PR as ready for review, make sure that you have: - [ ] Added a solid test coverage and/or screenshots from doing manual testing - [ ] Done a self-review taking into account security and performance aspects - [ ] Aligned any UI changes with the [UI checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) Release Notes: - N/A or Added/Fixed/Improved ... Co-authored-by: Oleksiy <oleksiy@zed.dev>	2026-03-16 11:35:51 -04:00
Piotr Osiewicz	97421c670e	Remove unreferenced dev dependencies (#51093 ) This will help with test times (in some cases), as nextest cannot figure out whether a given rdep is actually an alive edge of the build graph Closes #ISSUE Before you mark this PR as ready for review, make sure that you have: - [ ] Added a solid test coverage and/or screenshots from doing manual testing - [ ] Done a self-review taking into account security and performance aspects - [ ] Aligned any UI changes with the [UI checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) Release Notes: - N/A	2026-03-09 13:22:12 +01:00
Max Brunsfeld	9ff0b0206f	Include optional model version with EP acceptance and rejection messages (#50262 ) Release Notes: - N/A	2026-02-27 01:07:37 +00:00
Tom Houlé	6a749380aa	Add fast mode toggle in agent panel (#49714 ) This is a staff only toggle for now, since the consequences of activating it are not obvious and quite dire (tokens costs 6 times more). Also, persist thinking, thinking effort and fast mode in DbThread so the thinking mode toggle and thinking effort are persisted. Release Notes: - Agent: The thinking mode toggle and thinking effort are now persisted when selecting a thread from history.	2026-02-26 21:19:41 +01:00
Ben Kunkle	04db6c389c	zeta2: Use editable range returned by cloud for prediction diffs (#50029 ) Closes #ISSUE Before you mark this PR as ready for review, make sure that you have: - [ ] Added a solid test coverage and/or screenshots from doing manual testing - [ ] Done a self-review taking into account security and performance aspects - [ ] Aligned any UI changes with the [UI checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) Release Notes: - N/A or Added/Fixed/Improved ... Co-authored-by: Max <max@zed.dev>	2026-02-24 18:04:57 -05:00
Tom Houlé	1702a05920	cloud_llm_client: Delete unused variants of CompletionRequestStatus (#49516 ) Small clean up commit. Co-authored-by: Marshall <marshall@zed.dev> Release Notes: - N/A	2026-02-18 14:30:40 -05:00
Tom Houlé	93ead966c2	cloud_llm_client: Add StreamEnded and Unknown variants to CompletionRequestStatus (#49121 ) Add StreamEnded variant so the client can distinguish between a stream that the cloud ran to completion versus one that was interrupted (see CLO-258). That logic is to be added in a follow up PR. Add an Unknown fallback with #[serde(other)] for forward-compatible deserialization of future variants. The client advertises support via a new x-zed-client-supports-stream-ended-request-completion-status header. The server will only send the new variant if that header is passed. Both StreamEnded and Unknown are silently ignored at the event mapping layer (from_completion_request_status returns Ok(None)). Part of CLO-264 and CLO-266; cloud-side changes to follow. Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2026-02-16 15:39:47 +01:00
Marshall Bowers	afafb66f76	agent: Highlight latest models available through the Zed provider (#48614 ) This PR updates the model selector to highlight the latest models that are available through the Zed provider: <img width="388" height="477" alt="Screenshot 2026-02-06 at 1 46 41 PM" src="https://github.com/user-attachments/assets/70760399-ecf6-46e3-80a7-cb998216c192" /> Closes CLO-205. Release Notes: - Added a "Latest" indicator to highlight the latest models available through the Zed provider.	2026-02-06 14:03:03 -05:00
Max Brunsfeld	c430681211	Do not pass zeta prompt format in production endpoint (#48541 ) This allows us to switch the prompt format without client-side changes. If we want to experiment with prompt formats or models other than the currently-deployed one, we can use the raw endpoint, and do prompt construction and output processing on the client. This also adds an optional environment parameter to the raw endpoint, so that we can use that endpoint in the new scheme where we're deploying to separate environments for different zeta prompt versions. Release Notes: - N/A	2026-02-06 00:31:24 -08:00
Marshall Bowers	9860106b8e	agent: Add support for setting thinking effort for Zed provider (#48545 ) This PR adds the ability to set the thinking effort of a model. Right now this only applies to Opus 4.6 through the Zed provider. This is gated behind the `cloud-thinking-toggle` feature flag. UI is still rough; needs a design pass: <img width="639" height="163" alt="Screenshot 2026-02-05 at 7 45 54 PM" src="https://github.com/user-attachments/assets/2b5a9ef8-74cd-498e-9c81-b92666572409" /> <img width="263" height="148" alt="Screenshot 2026-02-05 at 7 45 58 PM" src="https://github.com/user-attachments/assets/40232cb0-1743-443b-b04c-5cd33065513d" /> Release Notes: - N/A	2026-02-06 01:04:53 +00:00
Marshall Bowers	a2ca07514c	language_model: Add `supported_effort_levels` method to `LanguageModel` (#48523 ) This PR adds a new `supported_effort_levels` method to the `LanguageModel` trait. This can be used to retrieve the list of effort levels that the model supports, which will eventually be used to drive the UI for selecting the thinking effort. Right now this list will only be populated for Cloud models. Release Notes: - N/A	2026-02-05 22:20:08 +00:00
Ben Kunkle	4cb85917c5	Differentiate between explicit rejection and ignored in ep acceptance tracking (#48409 ) Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ...	2026-02-04 17:54:11 -05:00
Marshall Bowers	4723dbe696	cloud_llm_client: Move `Plan` type into `cloud_api_types` (#47778 ) This PR moves the `Plan` type out of `cloud_llm_client` and into `cloud_api_types`. Release Notes: - N/A	2026-01-27 15:58:05 +00:00
Marshall Bowers	39b34f8f33	cloud_llm_client: Remove unused code (#47774 ) This PR removes some unused code around the plan types from the `cloud_llm_client`. Release Notes: - N/A	2026-01-27 15:24:58 +00:00
Max Brunsfeld	bdb84818ac	Send some traffic to zeta2 for testing (#47710 ) Release Notes: - N/A	2026-01-26 16:54:59 -08:00
Max Brunsfeld	2301c5f9f0	Send EP trigger as part of zeta2 prediction request (#47523 ) Release Notes: - N/A	2026-01-23 23:40:49 +00:00
Marshall Bowers	25904f691e	Add support for refreshing outdated LLM tokens (#47512 ) This PR adds support for refreshing LLM tokens that are "outdated"—that is, that are missing some required claims. Release Notes: - Fixed some instances of authentication errors with the Zed API that could be resolved automatically by refreshing the token.	2026-01-23 21:03:28 +00:00
Max Brunsfeld	780a87dd98	Introduce new predict_edits/v3 endpoint (#46960 ) Release Notes: - N/A	2026-01-16 02:16:34 +00:00
Marshall Bowers	a92df1eee4	Remove Burn Mode code (#46950 ) This PR removes the code for Burn Mode, as we won't need it anymore after the 17th. Closes CLO-79. Release Notes: - N/A	2026-01-15 21:28:33 +00:00
Marshall Bowers	6fcc5e9461	Remove legacy billing code (#46927 ) This PR removes the code for the legacy plans. No more users will be on this plan as of January 17th, so it's fine to land these changes now (as they won't be released until the 21st). Closes CLO-76. Release Notes: - N/A	2026-01-15 13:06:45 -05:00
Agus Zubiaga	79c69dc622	ep: Fix raw request shape (#46711 ) Release Notes: - N/A	2026-01-13 15:27:31 +00:00
Agus Zubiaga	90ec58836e	ep: Use FIM-like prompt for zeta2 (#46657 ) Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2026-01-13 00:16:24 +00:00
Agus Zubiaga	7853589033	ep: Use non-chat completions for /predict/raw (#46633 ) Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2026-01-12 17:42:50 +00:00
Michael Benfield	56daba28d4	supports_streaming_tools member (#44753 ) Release Notes: - N/A	2025-12-13 00:56:06 +00:00
Max Brunsfeld	42583c1141	Reorganize edit prediction code and remove old experiments (#44187 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev>	2025-12-04 15:56:57 -08:00
Agus Zubiaga	2db237aa52	Limit edit prediction reject batches to max (#43965 ) We currently attempt to flush all rejected predictions at once even if we have accumulated more than `MAX_EDIT_PREDICTION_REJECTIONS_PER_REQUEST`. Instead, we will now flush as many as possible, and then keep the rest for the next batch. Release Notes: - N/A	2025-12-02 13:22:16 -03:00
Agus Zubiaga	36a3b41f53	edit prediction: Request trigger (#43588 ) Adds a `trigger` field to the zeta1/zeta2 prediction requests so that we can distinguish between editor, diagnostic, and zeta-cli requests. Release Notes: - N/A	2025-11-26 20:34:29 +00:00
Agus Zubiaga	f89e5308e3	edit prediction: Report early-rejected predictions and fix cancel bug (#43585 ) Many prediction requests end up being rejected early without ever being set as the current prediction. Before this change, those cases weren’t reported as rejections because the `request_prediction_with_*` functions simply returned `Ok(None)`. With this update, whenever we get a successful response from the provider, we will return at least the `id`, allowing it to be properly reported. The request now also includes a “reject reason,” since the different variants carry distinct implications for prediction quality. All of these scenarios are now covered by tests. While adding them, I also found and fixed a bug where some cancelled predictions were incorrectly being set as the current one. Release Notes: - N/A --------- Co-authored-by: MrSubidubi <dev@bahn.sh>	2025-11-26 20:15:05 +00:00
Max Brunsfeld	9122dd2d70	Combine zeta and zeta2 edit prediction providers (#43284 ) We've realized that a lot of the logic within an `EditPredictionProvider` is not specific to a particular edit prediction model / service. Rather, it is just the generic state management required to perform edit predictions at all in Zed. We want to move to a setup where there's one "built-in" edit prediction provider in Zed, which can be pointed at different edit prediction models. The only logic that is different for different models is how we construct the prompt, send the request, and parse the output. This PR also changes the behavior of the staff-only `zeta2` feature flag so that in only gates your ability to use Zeta2, but you can still use your local settings file to choose between different edit prediction models/services: zeta1, zeta2, and sweep. This PR also makes zeta1's outcome reporting and prediction-rating features work with all prediction models, not just zeta1. To do: * [x] remove duplicated logic around sending cloud requests between zeta1 and zeta2 * [x] port the outcome reporting logic from zeta to zeta2. * [x] get the "rate completions" modal working with all EP models * [x] display edit prediction diff * [x] show edit history events * [x] remove the original `zeta` crate. Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev>	2025-11-24 22:17:48 -08:00
Oleksiy Syvokon	ea7568ceb3	zeta2: Support experimental 1120-seedcoder model (#43411 ) 1. Introduce a common `PromptFormatter` trait 2. Let models define their generation params. 3. Add support for the experimental 1120-seedcoder prompt format Release Notes: - N/A	2025-11-24 16:27:11 +00:00
Oleksiy Syvokon	b2f561165f	zeta2: Support qwen3-minimal prompt format (#42902 ) This prompt is for a fine-tuned model. It has the following changes, compared to `minimal`: - No instructions at all, except for one sentence at the beginning of the prompt. - Output is a simplified unified diff -- hunk headers have no line counts (e.g., `@@ -20 +20 @@`) - Qwen's FIM tokens are used where possible (`<\|file_sep\|>`, `<\|fim_prefix\|>`, `<\|fim_suffix\|>`, etc.) To evaluate this model: ``` ZED_ZETA2_MODEL=zeta2-exp [usual zeta-cli eval params ...] --prompt-format minimal-qwen ``` This will point to the most recent Baseten deployment of zeta2-exp (which may change in the future, so the prompt-format may get out of sync). Release Notes: - N/A	2025-11-17 20:36:05 +02:00
Oleksiy Syvokon	723f9b1371	zeta2: Add minimal prompt for fine-tuned models (#42691 ) 1. Add `--prompt-format=minimal` that matches single-sentence instructions used in fine-tuned models (specifically, in `1028-` and `1029-` models) 2. Use separate configs for agentic context search model and edit prediction model. This is useful when running a fine-tuned EP model, but we still want to run vanilla model for context retrieval. 3. `zeta2-exp` is a symlink to the same-named Baseten deployment. This model can be redeployed and updated without having to update the deployment id. 4. Print scores as a compact table Release Notes: - N/A --------- Co-authored-by: Piotr Osiewicz <piotr@zed.dev>	2025-11-14 13:08:54 +00:00
Max Brunsfeld	c9e231043a	Report discarded zeta predictions and indicate whether they were shown (#42403 ) Release Notes: - N/A --------- Co-authored-by: Michael Sloan <mgsloan@gmail.com> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-11-12 16:41:04 -08:00
Max Brunsfeld	b607077c08	Add old_text/new_text as a zeta2 prompt format (#42171 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Michael Sloan <mgsloan@gmail.com>	2025-11-10 15:44:54 -07:00
Max Brunsfeld	b8081ad7a6	Make it easy to point zeta2 at ollama (#42329 ) I wanted to be able to work offline, so I made it a little bit more convenient to point zeta2 at ollama. * For zeta2, don't require that request ids be UUIDs * Add an env var `ZED_ZETA2_OLLAMA` that sets the edit prediction URL and model id to work w/ ollama. Release Notes: - N/A	2025-11-09 21:10:36 -08:00
Max Brunsfeld	784fdcaee3	zeta2: Build edit prediction prompt and process model output in client (#41870 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com>	2025-11-06 18:36:58 -05:00
Oleksiy Syvokon	91d631c229	Evaluate zeta2 context retrieval and edit predictions (#41921 ) This PR implements the `zeta-cli eval` command. It will: - Run the edit prediction model if there are no cached results - Compute precision/recall/F1 for context retrieval at the line level: every retrieved line of context is counted as a true positive (correct retrieval), false positive (retrieved something that was not expected), or false negative (didn't retrieve an expected line) - Compute similar metrics for edit predictions - Pretty-print results, highlighting the difference between actual and expected when printing to tty Other changes: - `zeta-cli predict` accepts a `--format` argument with options `md`, `json`, `diff` - Code restructure Release Notes: - N/A --------- Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com> Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-11-04 17:36:50 +00:00
Max Brunsfeld	1631cec15a	Add zeta-cli subcommand for running zeta2 predictions (#41722 ) This PR adds a `zeta zeta2 predict` subcommand that takes an edit prediction example markdown file as an argument, and performs zeta2's prediction, showing the retrieved context and the predicted edit. * [x] Apply uncommitted diff to get repo into the right state. * [x] Apply edits in edit history * [x] Display predicted edits as unified diff, regardless of model output format Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com> Co-authored-by: Ben Kunkle <ben.kunkle@gmail.com>	2025-11-03 15:12:08 -08:00
Agus Zubiaga	ee80ba6693	zeta2: LLM-based context gathering (#41326 ) Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Max Brunsfeld <max@zed.dev>	2025-10-27 22:54:42 +00:00
Julia Ryan	ef5b8c6fed	Remove workspace-hack (#40216 ) We've been considering removing workspace-hack for a couple reasons: - Lukas ran into a situation where its build script seemed to be causing spurious rebuilds. This seems more likely to be a cargo bug than an issue with workspace-hack itself (given that it has an empty build script), but we don't necessarily want to take the time to hunt that down right now. - Marshall mentioned hakari interacts poorly with automated crate updates (in our case provided by rennovate) because you'd need to have `cargo hakari generate && cargo hakari manage-deps` after their changes and we prefer to not have actions that make commits. Currently removing workspace-hack causes our workspace to grow from ~1700 to ~2000 crates being built (depending on platform), which is mainly a problem when you're building the whole workspace or running tests across the the normal and remote binaries (which is where feature-unification nets us the most sharing). It doesn't impact incremental times noticeably when you're just iterating on `-p zed`, and we'll hopefully get these savings back in the future when rust-lang/cargo#14774 (which re-implements the functionality of hakari) is finished. Release Notes: - N/A	2025-10-17 18:58:14 +00:00

1 2

78 commits