vrr/zed - VRR Forge

vrr/zed

mirror of https://github.com/zed-industries/zed.git synced 2026-05-25 06:24:56 +00:00

Author	SHA1	Message	Date
Ben Brandt	b13849d04f	google: Add Google thinking level support (#57358 ) Some checks are pending Congratsbot / check-author (push) Waiting to run Details Congratsbot / congrats (push) Blocked by required conditions Details deploy_nightly_docs / deploy_docs (push) Waiting to run Details run_tests / run_tests_linux (push) Blocked by required conditions Details run_tests / orchestrate (push) Waiting to run Details run_tests / check_style (push) Waiting to run Details run_tests / clippy_windows (push) Blocked by required conditions Details run_tests / clippy_linux (push) Blocked by required conditions Details run_tests / clippy_mac (push) Blocked by required conditions Details run_tests / clippy_mac_x86_64 (push) Blocked by required conditions Details run_tests / run_tests_windows (push) Blocked by required conditions Details run_tests / run_tests_mac (push) Blocked by required conditions Details run_tests / miri_scheduler (push) Blocked by required conditions Details run_tests / build_visual_tests_binary (push) Blocked by required conditions Details run_tests / check_wasm (push) Blocked by required conditions Details run_tests / check_dependencies (push) Blocked by required conditions Details run_tests / check_docs (push) Blocked by required conditions Details run_tests / doctests (push) Blocked by required conditions Details run_tests / check_workspace_binaries (push) Blocked by required conditions Details run_tests / check_licenses (push) Blocked by required conditions Details run_tests / check_scripts (push) Blocked by required conditions Details run_tests / check_postgres_and_protobuf_migrations (push) Blocked by required conditions Details run_tests / extension_tests (push) Blocked by required conditions Details run_tests / tests_pass (push) Blocked by required conditions Details Also makes sure we are properly catching and processing thinking events. Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Release Notes: - google: Support thinking levels for Google models.	2026-05-21 10:00:29 +00:00
Tom Houlé	4558d14cf8	google_ai: Support Gemini 3.5 Flash (#57299 ) Implements the [official upgrade instructions](https://ai.google.dev/gemini-api/docs/whats-new-gemini-3.5#migrate-from-3-flash-preview) for Gemini 3.5 Flash, and adds BYOK support. The changes about thinking_level and temperature apply to our situation, but they are only recommendations, and we have to support older models, so I preferred not trying to force the preferred / remove the discouraged parameters for now. `temperature` becomes optional - we don't fill in a default anymore, since passing it is now discouraged. This commit also adds support for `thinking_level`, since it is now preferred to `thinking_budget`. `FunctionCall` and `FunctionResponse` now support passing an `id` to properly maintain chain-of-thought preservation and match execution IDs across turns. When resolving incoming tool uses, the mapper prefers the execution ID returned by Gemini, falling back to sequential naming in other scenarios. Release Notes: - Added support for Gemini 3.5 Flash in the Google AI model provider.	2026-05-21 07:37:59 +00:00
Collin	b930851b64	Add Gemini 3.1 Flash Lite (#56248 ) Self-Review Checklist: - [X] I've reviewed my own diff for quality, security, and reliability - [X] Unsafe blocks (if any) have justifying comments - [X] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [X] Tests cover the new/changed behavior - [X] Performance impact has been considered and is acceptable Closes (none) Release Notes: - Google: Added Gemini 3.1 Flash Lite --------- Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com> Co-authored-by: Smit Barmase <heysmitbarmase@gmail.com>	2026-05-12 08:55:22 +00:00
Bennet Bo Fenner	bf3fc2336d	agent: Allow tools to output multiple content parts (#54518 ) Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Closes #ISSUE Release Notes: - N/A	2026-04-27 12:36:11 +00:00
Ben Brandt	2eafa6e6aa	language_models: Remove unused language model token counting (#54177 ) Drop the `count_tokens` API and related implementations across providers, and remove the unused `tiktoken-rs` dependency. I was going to update the dependency becuase they finally released a fix we needed. But then I realized we only used this api in one place, the Rules library. And for most models it would have been wildly incorrect becuase we use tiktoken, i.e. OpenAI tokenizers, for almost every model, which is going to give incorrect results. Given that, I just removed these because the difference in how we get these has caused plenty of confusion in the past. Self-Review Checklist: - [x] I've reviewed my own diff for quality, security, and reliability - [x] Unsafe blocks (if any) have justifying comments - [x] The content is consistent with the [UI/UX checklist](https://github.com/zed-industries/zed/blob/main/CONTRIBUTING.md#uiux-checklist) - [x] Tests cover the new/changed behavior - [x] Performance impact has been considered and is acceptable Release Notes: - N/A	2026-04-22 13:39:48 +00:00
Danilo Leal	399d3d267e	docs: Update mentions to "assistant panel" (#53514 ) We don't use this terminology anymore; now it's "agent panel". Release Notes: - N/A	2026-04-09 10:42:21 -03:00
Agus Zubiaga	98c17ca160	language_models: Refactor deps and extract cloud (#53270 ) - `language_model` no longer depends on provider-specific crates such as `anthropic` and `open_ai` (inverted dependency) - `language_model_core` was extracted from `language_model` which contains the types for the provider-specific crates to convert to/from. - `gpui::SharedString` has been extracted into its own crate (still exposed by `gpui`), so `language_model_core` and provider API crates don't have to depend on `gpui`. - Removes some unnecessary `&'static str` \| `SharedString` -> `String` -> `SharedString` conversions across the codebase. - Extracts the core logic of the cloud `LanguageModelProvider` into its own crate with simpler dependencies. Release Notes: - N/A --------- Co-authored-by: John Tur <john-tur@outlook.com>	2026-04-07 12:28:19 -03:00
Richard Feldman	866ec42371	Remove deprecated Gemini 3 Pro Preview (#50503 ) Gemini 3 Pro Preview has been deprecated in favor of Gemini 3.1 Pro. This removes the `Gemini3Pro` variant from the `Model` enum and all associated match arms, updates eval model lists, docs, and test fixtures. A serde alias (`"gemini-3-pro-preview"`) is kept on `Gemini31Pro` so existing user settings gracefully migrate to the replacement model. Closes AI-66 Release Notes: - Removed deprecated Gemini 3 Pro Preview model; existing configurations automatically migrate to Gemini 3.1 Pro.	2026-03-04 15:53:26 +00:00
Richard Feldman	17abde72b0	Add gemini-3.1-pro-preview model (#49622 ) Closes AI-48 Release Notes: - Added support for Gemini 3.1 Pro	2026-02-19 13:19:14 -05:00
Xiaobo Liu	225a2a8a20	google_ai: Refactor token count methods in Google AI (#45184 ) The change simplifies the `max_token_count` and `max_output_tokens` methods by grouping Gemini models with identical token limits. Release Notes: - N/A	2025-12-17 20:12:40 -06:00
Richard Feldman	27c5d39d28	Add Gemini 3 Flash (#45139 ) Add support for the new Gemini 3 Flash model Release Notes: - Added support for Gemini 3 Flash model	2025-12-17 18:56:15 +00:00
Junseong Park	6404939427	google_ai: Update Gemini models (#43117 ) Closes #43040 Release Notes: - Remove the end-of-support Gemini 1.5 model from the options. - Remove the older Gemini 2.0 model from the options. - Please let me know if you think it's better to keep it, as it is still a usable model. - Update the incorrect amounts for some input/output tokens. - Update the default model to Gemini 2.5 Flash-Lite. - Rename variant `Gemini3ProPreview` to `Gemini3Pro` When this PR is merged, users will be able to select the following Gemini models. - 2.5 Flash - 2.5 Flash-Lite - 2.5 Pro - 3 Pro	2025-11-28 15:48:33 -05:00
Mikayla Maki	19d2532cf8	Update google_ai.rs (#43034 ) Release Notes: - N/A	2025-11-19 05:41:24 +00:00
Martin Bergo	7c0663b825	google_ai: Add gemini-3-pro-preview model (#43015 ) Release Notes: - Added the newly released Gemini 3 Pro Preview Model https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-pro	2025-11-18 23:51:32 +00:00
Richard Feldman	c0fadae881	Thought signatures (#42915 ) Implement Gemini API's [thought signatures](https://ai.google.dev/gemini-api/docs/thinking#signatures) Release Notes: - Added thought signatures for Gemini tool calls	2025-11-18 10:41:19 -05:00
Julia Ryan	ef5b8c6fed	Remove workspace-hack (#40216 ) We've been considering removing workspace-hack for a couple reasons: - Lukas ran into a situation where its build script seemed to be causing spurious rebuilds. This seems more likely to be a cargo bug than an issue with workspace-hack itself (given that it has an empty build script), but we don't necessarily want to take the time to hunt that down right now. - Marshall mentioned hakari interacts poorly with automated crate updates (in our case provided by rennovate) because you'd need to have `cargo hakari generate && cargo hakari manage-deps` after their changes and we prefer to not have actions that make commits. Currently removing workspace-hack causes our workspace to grow from ~1700 to ~2000 crates being built (depending on platform), which is mainly a problem when you're building the whole workspace or running tests across the the normal and remote binaries (which is where feature-unification nets us the most sharing). It doesn't impact incremental times noticeably when you're just iterating on `-p zed`, and we'll hopefully get these savings back in the future when rust-lang/cargo#14774 (which re-implements the functionality of hakari) is finished. Release Notes: - N/A	2025-10-17 18:58:14 +00:00
Conrad Irwin	fcdab160f9	Settings refactor (#38367 ) Co-Authored-By: Ben K <ben@zed.dev> Co-Authored-By: Anthony <anthony@zed.dev> Co-Authored-By: Mikayla <mikayla@zed.dev> Release Notes: - settings: Major internal changes to settings. The primary user-facing effect is that some settings which did not make sense in project settings files are no-longer read from there. (For example the inline blame settings) --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com> Co-authored-by: Anthony <anthony@zed.dev>	2025-09-18 16:47:23 +00:00
Antonio Scandurra	39d86eeb7f	Trim API key when submitting requests to LLM providers (#37082 ) This prevents the common footgun of copy/pasting an API key starting/ending with extra newlines, which would lead to a "bad request" error. Closes #37038 Release Notes: - agent: Support pasting language model API keys that contain newlines.	2025-08-28 12:00:44 +00:00
Piotr Osiewicz	05fc0c432c	Fix a bunch of other low-hanging style lints (#36498 ) - Fix a bunch of low hanging style lints like unnecessary-return - Fix single worktree violation - And the rest Release Notes: - N/A	2025-08-19 21:26:17 +02:00
Bennet Bo Fenner	6b6eb11643	agent2: Fix tool schemas for Gemini (#36507 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-08-19 18:06:09 +00:00
Piotr Osiewicz	8f567383e4	Auto-fix clippy::collapsible_if violations (#36428 ) Release Notes: - N/A	2025-08-19 13:27:24 +00:00
Ben Brandt	0191f16ebc	Update Gemini Models (#32902 ) Updates google_ai to use latest model information from the respective model cards: https://ai.google.dev/gemini-api/docs/models Release Notes: - google: Update to latest Gemini 2.5 models	2025-06-17 20:26:27 +00:00
Richard Feldman	5405c2c2d3	Standardize on u64 for token counts (#32869 ) Previously we were using a mix of `u32` and `usize`, e.g. `max_tokens: usize, max_output_tokens: Option<u32>` in the same `struct`. Although [tiktoken](https://github.com/openai/tiktoken) uses `usize`, token counts should be consistent across targets (e.g. the same model doesn't suddenly get a smaller context window if you're compiling for wasm32), and these token counts could end up getting serialized using a binary protocol, so `usize` is not the right choice for token counts. I chose to standardize on `u64` over `u32` because we don't store many of them (so the extra size should be insignificant) and future models may exceed `u32::MAX` tokens. Release Notes: - N/A	2025-06-17 10:43:07 -04:00
Oleksiy Syvokon	04cd3fcd23	google: Add latest versions of Gemini 2.5 Pro and Flash Preview (#32183 ) Release Notes: - Added the latest versions of Gemini 2.5 Pro and Flash Preview	2025-06-05 19:30:34 +00:00
90aca	cf931247d0	Add thinking budget for Gemini custom models (#31251 ) Closes #31243 As described in my issue, the [thinking budget](https://ai.google.dev/gemini-api/docs/thinking) gets automatically chosen by Gemini unless it is specifically set to something. In order to have fast responses (inline assistant) I prefer to set it to 0. Release Notes: - ai: Added `thinking` mode for custom Google models with configurable token budget --------- Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-06-03 13:40:20 +02:00
Fernando Freire	3077abf9cf	google_ai: Parse thought parts in Gemini responses (#31925 ) Fixes thinking Gemini models. Closes #31902 Release Notes: - Updated Google Gemini client to match the latest API	2025-06-03 10:37:06 +00:00
Ben Brandt	119beb210a	Update default models to newer versions (#31415 ) Follow up to: https://github.com/zed-industries/zed/pull/31209 Changes default models across multiple providers: - Zed.dev Default Models in settings: claude-3-7-sonnet-latest → claude-4-sonnet-latest - Bedrock Default Model: Claude 3.5 Sonnet v2 → Claude Sonnet 4 - Google AI Default Fast Model: Gemini 1.5 Flash → Gemini 2.0 Flash Release Notes: - N/A	2025-05-27 10:54:42 +02:00
Kirill Bulatov	16366cf9f2	Use `anyhow` more idiomatically (#31052 ) https://github.com/zed-industries/zed/issues/30972 brought up another case where our context is not enough to track the actual source of the issue: we get a general top-level error without inner error. The reason for this was `.ok_or_else(\|\| anyhow!("failed to read HEAD SHA"))?; ` on the top level. The PR finally reworks the way we use anyhow to reduce such issues (or at least make it simpler to bubble them up later in a fix). On top of that, uses a few more anyhow methods for better readability. * `.ok_or_else(\|\| anyhow!("..."))`, `map_err` and other similar error conversion/option reporting cases are replaced with `context` and `with_context` calls * in addition to that, various `anyhow!("failed to do ...")` are stripped with `.context("Doing ...")` messages instead to remove the parasitic `failed to` text * `anyhow::ensure!` is used instead of `if ... { return Err(...); }` calls * `anyhow::bail!` is used instead of `return Err(anyhow!(...));` Release Notes: - N/A	2025-05-20 23:06:07 +00:00
Michael Sloan	76ad1a29a5	Add support for getting the token count for all parts of Gemini generation requests (#29630 ) * `CountTokensRequest` now takes a full `GenerateContentRequest` instead of just content. * Fixes use of `models/` prefix in `model` field of `GenerateContentRequest`, since that's required for use in `CountTokensRequest`. This didn't cause issues before because it was always cleared and used in the path. Release Notes: - N/A	2025-05-04 21:32:45 +00:00
Michael Sloan	edf78e770d	Fix token counting requests in Gemini (#29643 ) Release Notes: - N/A	2025-04-30 04:55:07 +00:00
Michael Sloan	b4732235e3	Skip serializing `None` fields in Gemini API (#29632 ) Release Notes: - N/A	2025-04-29 19:03:01 -06:00
Michael Sloan	2beefc8158	Fix gemini model token limits (#29584 ) Release Notes: - N/A	2025-04-29 03:12:59 +00:00
Antonio Scandurra	3fdbc3090d	Fix error when deserializing Gemini streams (#29470 ) Sometimes Gemini would report `Content` without a `parts` field. Release Notes: - Fixed a bug that would sometimes cause Gemini models to fail streaming their response.	2025-04-26 11:51:04 +00:00
Bennet Bo Fenner	cd365b0cf5	gemini: Fix issue when deserializing tool call (#29363 ) Fixes a regression introduced in #29322 Release Notes: - N/A Co-authored-by: Agus Zubiaga <hi@aguz.me>	2025-04-24 18:19:05 +00:00
Marshall Bowers	f527df6fa1	google_ai: Remove list of supported countries (#29348 ) This PR removes the list of supported countries from the `google_ai` crate, as it is no longer referenced in this repo. Release Notes: - N/A	2025-04-24 15:04:45 +00:00
Nathan Sobo	8836c6fb42	Introduce LanguageModelToolUse::raw_input (#29322 ) This is to enable alternative streaming solutions at the application layer. I'm not sure we really should have performed parsing of the input at this layer. Either way I want to experiment with streaming approaches in a separate crate on a branch, and this will help. /cc @maxdeviant @bennetbo @rtfeldman Closes #ISSUE Release Notes: - N/A	2025-04-24 02:30:48 +00:00
Stephan Seidt	10ded0ab75	agent: Add support for google gemini 2.5 flash preview (#29205 ) Adds support for the new gemini-2.5-flash-preview-04-17 Release Notes: - agent: Added support for gemini-2.5-flash-preview	2025-04-22 09:37:12 +00:00
Michael Sloan	fbf7caf93e	Default to fast model for thread summaries and titles + don't include system prompt / context / thinking segments (#29102 ) * Adds a fast / cheaper model to providers and defaults thread summarization to this model. Initial motivation for this was that https://github.com/zed-industries/zed/pull/29099 would cause these requests to fail when used with a thinking model. It doesn't seem correct to use a thinking model for summarization. * Skips system prompt, context, and thinking segments. * If tool use is happening, allows 2 tool uses + one more agent response before summarizing. Downside of this is that there was potential for some prefix cache reuse before, especially for title summarization (thread summarization omitted tool results and so would not share a prefix for those). This seems fine as these requests should typically be fairly small. Even for full thread summarization, skipping all tool use / context should greatly reduce the token use. Release Notes: - N/A	2025-04-19 23:26:29 +00:00
Bennet Bo Fenner	ae47829fa8	agent: Fix system instructions typo (#28949 ) See #28793, the name of the field is actually `systemInstruction` not `systemInstructions`. Release Notes: - Fixed an issue where Gemini requests would fail	2025-04-17 08:51:05 +00:00
Bennet Bo Fenner	c7e80c80c6	gemini: Pass system prompt as system instructions (#28793 ) https://ai.google.dev/gemini-api/docs/text-generation#system-instructions Release Notes: - agent: Improve performance of Gemini models	2025-04-15 19:45:47 +02:00
Marshall Bowers	a8b1ef3531	google_ai: Remove unused `extract_text_from_events` function (#28723 ) This PR removes the `extract_text_from_events` function from `google_ai`, as it was not used anywhere. Release Notes: - N/A	2025-04-14 22:01:21 +00:00
Bennet Bo Fenner	97abf21a28	agent: Add support for Google Gemini 2.5 preview (#28326 ) Adds support for the new `gemini-2.5-pro-preview-03-25` Release Notes: - Added support for `gemini-2.5-pro-preview-03-25` in the assistant	2025-04-08 15:00:23 +00:00
Julia Ryan	01ec6e0f77	Add workspace-hack (#27277 ) This adds a "workspace-hack" crate, see [mozilla's](https://hg.mozilla.org/mozilla-central/file/3a265fdc9f33e5946f0ca0a04af73acd7e6d1a39/build/workspace-hack/Cargo.toml#l7) for a concise explanation of why this is useful. For us in practice this means that if I were to run all the tests (`cargo nextest r --workspace`) and then `cargo r`, all the deps from the previous cargo command will be reused. Before this PR it would rebuild many deps due to resolving different sets of features for them. For me this frequently caused long rebuilds when things "should" already be cached. To avoid manually maintaining our workspace-hack crate, we will use [cargo hakari](https://docs.rs/cargo-hakari) to update the build files when there's a necessary change. I've added a step to CI that checks whether the workspace-hack crate is up to date, and instructs you to re-run `script/update-workspace-hack` when it fails. Finally, to make sure that people can still depend on crates in our workspace without pulling in all the workspace deps, we use a `[patch]` section following [hakari's instructions](https://docs.rs/cargo-hakari/0.9.36/cargo_hakari/patch_directive/index.html) One possible followup task would be making guppy use our `rust-toolchain.toml` instead of having to duplicate that list in its config, I opened an issue for that upstream: guppy-rs/guppy#481. TODO: - [x] Fix the extension test failure - [x] Ensure the dev dependencies aren't being unified by Hakari into the main dependencies - [x] Ensure that the remote-server binary continues to not depend on LibSSL Release Notes: - N/A --------- Co-authored-by: Mikayla <mikayla@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com>	2025-04-02 13:26:34 -07:00
Piotr Osiewicz	dc64ec9cc8	chore: Bump Rust edition to 2024 (#27800 ) Follow-up to https://github.com/zed-industries/zed/pull/27791 Release Notes: - N/A	2025-03-31 20:55:27 +02:00
Bennet Bo Fenner	c8a9a74e6a	Add tool calling support for Gemini models (#27772 ) Release Notes: - N/A	2025-03-31 17:46:42 +02:00
Michael Sloan	7376c6f377	Add support for Gemini 2.5 Pro Experimental model (#27468 ) Release Notes: - Added support for Gemini 2.5 Pro Experimental model to Zed AI. Co-authored-by: Wilhelm Klopp <wil.klopp@gmail.com>	2025-03-26 00:12:10 +00:00
Antonio Scandurra	f517050548	Partially fix assistant onboarding (#25313 ) While investigating #24896, I noticed two issues: 1. The default configuration for the `zed.dev` provider was using the wrong string for Claude 3.5 Sonnet. This meant the provider would always result as not configured until the user selected it from the model picker, because we couldn't deserialize that string to a valid `anthropic::Model` enum variant. 2. When clicking on `Open New Chat`/`Start New Thread` in the provider configuration, we would select `Claude 3.5 Haiku` by default instead of Claude 3.5 Sonnet. Release Notes: - Fixed some issues that caused AI providers to sometimes be misconfigured.	2025-02-24 07:29:55 +00:00
IaVashik	8114d17cba	google_ai: Add support for Gemini 2.0 models (#24448 ) Add support for the newly released Gemini 2.0 models from Google announced this new family of models earlier this week (2025-02-05). Release Notes: - Added support for Google's new Gemini 2.0 models.	2025-02-07 11:18:18 -05:00
João Marcos	5bd7eaa173	Solve 50+ `cargo doc` warnings (#24071 ) Release Notes: - N/A	2025-02-01 06:19:29 +00:00
Piotr Osiewicz	c9534e8025	chore: Use workspace fields for edition and publish (#23291 ) This prepares us for an upcoming bump to Rust 2024 edition. Release Notes: - N/A	2025-01-17 17:39:22 +01:00

1 2

66 commits