vrr/zed - VRR Forge

vrr/zed

mirror of https://github.com/zed-industries/zed.git synced 2026-05-30 03:34:30 +00:00

Author	SHA1	Message	Date
Neel	175707f95c	open_ai: Support reasoning summaries in OpenAI Responses API (#50959 ) Related to AI-79. Release Notes: - N/A	2026-03-09 13:51:22 +00:00
Richard Feldman	3b3ffc022e	Add GPT-5.4 and GPT-5.4-pro BYOK models (#50858 ) Add GPT-5.4 and GPT-5.4-pro as Bring Your Own Key model options for the OpenAI provider. GPT-5.4 (`gpt-5.4`): - 1,050,000 token context window, 128K max output - Supports chat completions, images, parallel tool calls - Default reasoning effort: none GPT-5.4-pro (`gpt-5.4-pro`): - 1,050,000 token context window, 128K max output - Responses API only (no chat completions) - Default reasoning effort: medium (supports medium/high/xhigh) Also fixes context window sizes for GPT-5 mini and GPT-5 nano (272K → 400K) to match current OpenAI docs. Closes AI-78 Release Notes: - Added GPT-5.4 and GPT-5.4-pro as available models when using your own OpenAI API key.	2026-03-05 23:40:03 -05:00
Richard Feldman	a18b7727ee	Add GPT-5.3-Codex BYOK model under the OpenAI provider (#50122 ) Adds `gpt-5.3-codex` as a built-in model under the OpenAI provider for BYOK usage. Model specs: - 400,000 context window - 128,000 max output tokens - Reasoning token support (default medium effort) - Uses the Responses API (like other codex models) - Token counting falls back to the gpt-5 tokenizer Closes AI-59 Release Notes: - Added support for GPT-5.3-Codex as a bring-your-own-key model in the OpenAI provider.	2026-02-25 16:29:01 -05:00
Richard Feldman	0b8424a14c	Remove deprecated GPT-4o, GPT-4.1, GPT-4.1-mini, and o4-mini (#49082 ) Remove GPT-4o, GPT-4.1, GPT-4.1-mini, and o4-mini from BYOK model options in Zed before OpenAI retires these models. These models are being retired by OpenAI (ChatGPT workspace support ends April 3, 2026), so they have been removed from the available models list in Zed's BYOK provider. Closes AI-4 Release Notes: - Removed deprecated GPT-4o, GPT-4.1, GPT-4.1-mini, and o4-mini models from OpenAI BYOK provider	2026-02-13 04:54:22 +00:00
Oleksiy Syvokon	757ee0571e	ep: Use rejected_output for DPO training + OpenAI support (#47697 ) Release Notes: - N/A --------- Co-authored-by: Zed Zippy <234243425+zed-zippy[bot]@users.noreply.github.com>	2026-01-27 13:02:40 +00:00
Aero	7bd3075d53	open_ai: Support reasoning content (#43662 ) Support for Kimi K2 Thinking Release Notes: - Added support for thinking traces when using OpenAI-API-compatible AI providers --------- Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2026-01-21 10:08:59 +00:00
Richard Feldman	e5706f2349	Add BYOK GPT-5.2-codex support (#47025 ) <img width="449" height="559" alt="Screenshot 2026-01-16 at 4 52 12 PM" src="https://github.com/user-attachments/assets/1b5583d7-9b90-46b1-a32f-9821543ea542" /> Release Notes: - Add support for GPT-5.2-Codex via OpenAI API Key	2026-01-16 17:09:08 -05:00
Marshall Bowers	c6a38f2cfb	open_ai: Use proper type for Responses API `input` (#46526 ) This PR makes it so we use a proper type for the Responses API `input` rather than a `serde_json::Value`. It should have never used `serde_json::Value` to begin with. Release Notes: - N/A	2026-01-10 17:40:20 +00:00
Marshall Bowers	30f776e47f	open_ai: Move `responses` module to its own file (#46450 ) This PR moves the `responses` module to its own module in the `open_ai` crate. Release Notes: - N/A	2026-01-09 14:29:08 +00:00
Matt Stallone	84017bca89	Add OpenAI Responses API support with chat_completions capability flag (#39989 ) Add support for OpenAI's /responses endpoint for models that don't support /chat/completions API. This enables compatibility with newer model variants (`gpt-5-codex`, `gpt-5-pro`, `o3-pro`, etc) while maintaining compatibility with existing configs Changes: - Add `supports_chat_completions` flag to model capabilities that defaults to true for existing behavior - Implement responses API client with streaming support as per [OpenAI documentation](https://app.stainless.com/api/spec/documented/openai/openapi.documented.yml). - Add `ResponseEventMapper` to convert responses events to completion events for maintainer simplicity - Update UI to allow toggling `chat_completions` capability - Add `gpt-5-codex` model Closes #38858 Release Notes: - Added support for `gpt-5-codex` model --------- Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2026-01-05 18:15:54 +01:00
Richard Feldman	b5a0a3322d	Add GPT-5.2 support (#44656 ) <img width="429" height="188" alt="Screenshot 2025-12-11 at 3 45 26 PM" src="https://github.com/user-attachments/assets/fe9f1b86-7268-4c63-a8c2-75ac671012c9" /> Release Notes: - Added GPT-5.2 support when using your own OpenAI key	2025-12-11 15:49:10 -05:00
Agus Zubiaga	f08fd732a7	Add experimental mercury edit prediction provider (#44256 ) Release Notes: - N/A --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-12-06 10:08:44 +00:00
Mikayla Maki	53eb35f5b2	Add GPT 5.1 to Zed BYOK (#43492 ) Release Notes: - Added support for OpenAI's GPT 5.1 model to BYOK	2025-11-25 14:17:27 -08:00
Tim McLean	fb90b12073	Add retry support for OpenAI-compatible LLM providers (#37891 ) Automatically retry the agent's LLM completion requests when the provider returns 429 Too Many Requests. Uses the Retry-After header to determine the retry delay if it is available. Many providers are frequently overloaded or have low rate limits. These providers are essentially unusable without automatic retries. Tested with Cerebras configured via openai_compatible. Related: #31531 Release Notes: - Added automatic retries for OpenAI-compatible LLM providers --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-11-13 14:15:46 +00:00
Max Brunsfeld	784fdcaee3	zeta2: Build edit prediction prompt and process model output in client (#41870 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com>	2025-11-06 18:36:58 -05:00
Techy	27a18843d4	open_ai: Make the deltas optional (#39142 ) I am using an Azure OpenAI instance since that is what is provided at work and with how they have it setup not all responses contain a delta, which lead to errors and truncated responses. This is related to how they are filtering potentially offensive requests and responses. I don't believe this filter was made in-house, instead I believe it is provided by Microsoft/Azure, so I suspect this fix may help other users. Release Notes: - N/A Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-11-05 13:47:14 +01:00
Conrad Irwin	fcdab160f9	Settings refactor (#38367 ) Co-Authored-By: Ben K <ben@zed.dev> Co-Authored-By: Anthony <anthony@zed.dev> Co-Authored-By: Mikayla <mikayla@zed.dev> Release Notes: - settings: Major internal changes to settings. The primary user-facing effect is that some settings which did not make sense in project settings files are no-longer read from there. (For example the inline blame settings) --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com> Co-authored-by: Anthony <anthony@zed.dev>	2025-09-18 16:47:23 +00:00
ZhangJun	7091c70a1e	open_ai: Trim newline before "data:" prefix and account for the possibility of no space after ":" (#37644 ) I'am using an openai compatible model, but got nothing in agent thread panel, and Zed log has "Model generated an empty summary" line. I add one log to open_ai.rs: <img width="2454" height="626" alt="图片" src="https://github.com/user-attachments/assets/85354c7d-a0cc-4bba-86fd-2a640038a13e" /> and got: <img width="3456" height="278" alt="图片" src="https://github.com/user-attachments/assets/7746aedd-5d76-44b5-90f2-e129a1507178" /> It appear that `let line = line.strip_prefix("data: ")?;` can not handle correctly. Release Notes: - N/A --------- Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-09-08 22:01:55 +02:00
Umesh Yadav	9f749881b3	language_models: Fix tool_choice null issue for other providers (#34554 ) Follow up: #34532 Closes #35434 Mostly fixes a issue were when the tool_choice is none it was getting serialised as null. This was fixed for openrouter just wanted to follow up and cleanup for other providers which might have this issue as this is against the spec. Release Notes: - N/A	2025-09-03 01:22:57 +02:00
Antonio Scandurra	39d86eeb7f	Trim API key when submitting requests to LLM providers (#37082 ) This prevents the common footgun of copy/pasting an API key starting/ending with extra newlines, which would lead to a "bad request" error. Closes #37038 Release Notes: - agent: Support pasting language model API keys that contain newlines.	2025-08-28 12:00:44 +00:00
Michael Sloan	0470baca50	open_ai: Remove `model` field from ResponseStreamEvent (#36902 ) Closes #36901 Release Notes: - Fixed use of Open WebUI as an LLM provider.	2025-08-25 19:50:08 +00:00
Piotr Osiewicz	05fc0c432c	Fix a bunch of other low-hanging style lints (#36498 ) - Fix a bunch of low hanging style lints like unnecessary-return - Fix single worktree violation - And the rest Release Notes: - N/A	2025-08-19 21:26:17 +02:00
Oleksiy Syvokon	42ffa8900a	open_ai: Fix error response parsing (#36390 ) Closes #35925 Release Notes: - Fixed OpenAI error response parsing in some cases	2025-08-18 08:54:31 +00:00
Oleksiy Syvokon	2a57b160b0	openai: Don't send prompt_cache_key for OpenAI-compatible models (#36231 ) Some APIs fail when they get this parameter Closes #36215 Release Notes: - Fixed OpenAI-compatible providers that don't support prompt caching and/or reasoning	2025-08-15 13:54:24 +03:00
Oleksiy Syvokon	a3dcc76687	openai: Don't send reasoning_effort if it's not set (#36228 ) Release Notes: - N/A	2025-08-15 09:12:18 +00:00
Cretezy	8ff2e3e195	language_models: Add reasoning_effort for custom models (#35929 ) Release Notes: - Added `reasoning_effort` support to custom models Tested using the following config: ```json5 "language_models": { "openai": { "available_models": [ { "name": "gpt-5-mini", "display_name": "GPT 5 Mini (custom reasoning)", "max_output_tokens": 128000, "max_tokens": 272000, "reasoning_effort": "high" // Can be minimal, low, medium (default), and high } ], "version": "1" } } ``` Docs: https://platform.openai.com/docs/api-reference/chat/create#chat_create-reasoning_effort This work could be used to split the GPT 5/5-mini/5-nano into each of it's reasoning effort variant. E.g. `gpt-5`, `gpt-5 low`, `gpt-5 minimal`, `gpt-5 high`, and same for mini/nano. Release Notes: * Added a setting to control `reasoning_effort` in OpenAI models	2025-08-13 06:09:16 +00:00
Oleksiy Syvokon	7167f193c0	open_ai: Send `prompt_cache_key` to improve caching (#36065 ) Release Notes: - N/A Co-authored-by: Michael Sloan <mgsloan@gmail.com>	2025-08-12 21:51:23 +03:00
Oleksiy Syvokon	7ff0f1525e	open_ai: Log inputs that caused parsing errors (#36063 ) Release Notes: - N/A Co-authored-by: Michael Sloan <mgsloan@gmail.com>	2025-08-12 21:49:19 +03:00
Richard Feldman	7d4d8b8398	Add GPT-5 support through OpenAI API (#35822 ) (This PR does not add GPT-5 to Zed Pro, but rather adds access if you're using your own OpenAI API key.) <img width="772" height="333" alt="Screenshot 2025-08-07 at 2 23 18 PM" src="https://github.com/user-attachments/assets/42e75082-118a-4737-89b6-a740ae33b169" /> --- NOTE: If your API key is not through a verified organization, you may see this error: <img width="549" height="253" alt="Screenshot 2025-08-07 at 2 04 54 PM" src="https://github.com/user-attachments/assets/d0b6d739-9c39-4af3-88d7-0c9609b0e6ba" /> Even if your org is verified, you still may not have access to GPT-5, in which case you could see this error: <img width="543" height="98" alt="Screenshot 2025-08-07 at 2 09 18 PM" src="https://github.com/user-attachments/assets/e3ed31e3-2a11-4f07-8f3c-5b410fbe4540" /> One way to test if you're in this situation is to visit https://platform.openai.com/chat/edit?models=gpt-5 and see if you get the same "you don't have access to GPT-5" error on OpenAI's official playground. It looks like this: <img width="581" height="196" alt="Screenshot 2025-08-07 at 2 15 25 PM" src="https://github.com/user-attachments/assets/ea1454ca-3c10-4703-8126-c02cb92a34f2" /> Release Notes: - Added GPT-5, as well as its mini and nano variants. To use this, you need to have an OpenAI API key configured via the `OPENAI_API_KEY` environment variable.	2025-08-07 23:35:41 +00:00
Umesh Yadav	3f4098e87b	open_ai: Make OpenAI error message generic (#33383 ) Context: In this PR: https://github.com/zed-industries/zed/pull/33362, we started to use underlying open_ai crate for making api calls for vercel as well. Now whenever we get the error we get something like the below. Where on part of the error mentions OpenAI but the rest of the error returns the actual error from provider. This PR tries to make the error generic for now so that people don't get confused seeing OpenAI in their v0 integration. ``` Error interacting with language model Failed to connect to OpenAI API: 403 Forbidden {"success":false,"error":"Premium or Team plan required to access the v0 API: https://v0.dev/chat/settings/billing"} ``` Release Notes: - N/A	2025-06-28 14:38:27 +02:00
Umesh Yadav	108162423d	language_models: Emit UsageUpdate events for token usage in DeepSeek and OpenAI (#33242 ) Closes #ISSUE Release Notes: - N/A	2025-06-25 09:42:30 +02:00
Bennet Bo Fenner	c34b24b5fb	open_ai: Fix issues with OpenAI compatible APIs (#32982 ) Ran into this while adding support for Vercel v0s models: - The timestamp seems to be returned in Milliseconds instead of seconds so it breaks the bounds of `created: u32`. We did not use this field anywhere so just decided to remove it - Sometimes the `choices` field can be empty when the last chunk comes in because it only contains `usage` Release Notes: - N/A	2025-06-18 21:51:51 +00:00
Richard Feldman	5405c2c2d3	Standardize on u64 for token counts (#32869 ) Previously we were using a mix of `u32` and `usize`, e.g. `max_tokens: usize, max_output_tokens: Option<u32>` in the same `struct`. Although [tiktoken](https://github.com/openai/tiktoken) uses `usize`, token counts should be consistent across targets (e.g. the same model doesn't suddenly get a smaller context window if you're compiling for wasm32), and these token counts could end up getting serialized using a binary protocol, so `usize` is not the right choice for token counts. I chose to standardize on `u64` over `u32` because we don't store many of them (so the extra size should be insignificant) and future models may exceed `u32::MAX` tokens. Release Notes: - N/A	2025-06-17 10:43:07 -04:00
Ben Brandt	2d4e427b45	OpenAI cleanups (#32597 ) Release Notes: - openai: Remove support for deprecated o1-preview and o1-mini models - openai: Support streaming for o1 model	2025-06-12 08:55:48 +00:00
Ben Brandt	8cc5b04045	open_ai: Remove redundant serde aliases and add model limits (#32572 ) Remove unnecessary alias attributes from Model enum variants and add max_output_tokens limits for all OpenAI models. Also fix supports_system_messages to explicitly handle all model variants. Release Notes: - N/A	2025-06-11 22:51:41 +02:00
Adrian Furo	e6f51966a1	open_ai: Fix parallel tools issue (#30467 ) There is no ISSUE opened on this topic Release Notes: - N/A --------- Co-authored-by: Peter Tripp <peter@zed.dev> Co-authored-by: Ben Brandt <benjamin.j.brandt@gmail.com>	2025-05-26 11:46:35 +00:00
Ben Brandt	ef0e1cb2ba	open_ai: Make Assistant message content optional (#31418 ) Fixes regression caused by: https://github.com/zed-industries/zed/pull/30639 Assistant messages can come back with no content, and we no longer allowed that in the deserialization. Release Notes: - open_ai: fixed deserialization issue if assistant content was empty	2025-05-26 09:59:39 +00:00
Kirill Bulatov	16366cf9f2	Use `anyhow` more idiomatically (#31052 ) https://github.com/zed-industries/zed/issues/30972 brought up another case where our context is not enough to track the actual source of the issue: we get a general top-level error without inner error. The reason for this was `.ok_or_else(\|\| anyhow!("failed to read HEAD SHA"))?; ` on the top level. The PR finally reworks the way we use anyhow to reduce such issues (or at least make it simpler to bubble them up later in a fix). On top of that, uses a few more anyhow methods for better readability. * `.ok_or_else(\|\| anyhow!("..."))`, `map_err` and other similar error conversion/option reporting cases are replaced with `context` and `with_context` calls * in addition to that, various `anyhow!("failed to do ...")` are stripped with `.context("Doing ...")` messages instead to remove the parasitic `failed to` text * `anyhow::ensure!` is used instead of `if ... { return Err(...); }` calls * `anyhow::bail!` is used instead of `return Err(anyhow!(...));` Release Notes: - N/A	2025-05-20 23:06:07 +00:00
Agus Zubiaga	dd6594621f	Add image input support for OpenAI models (#30639 ) Release Notes: - Added input image support for OpenAI models	2025-05-13 17:32:42 +02:00
Marshall Bowers	b54bbebc03	open_ai: Remove list of supported countries (#29347 ) This PR removes the list of supported countries from the `open_ai` crate, as it is no longer referenced in this repo. Release Notes: - N/A	2025-04-24 14:55:37 +00:00
Michael Sloan	fbf7caf93e	Default to fast model for thread summaries and titles + don't include system prompt / context / thinking segments (#29102 ) * Adds a fast / cheaper model to providers and defaults thread summarization to this model. Initial motivation for this was that https://github.com/zed-industries/zed/pull/29099 would cause these requests to fail when used with a thinking model. It doesn't seem correct to use a thinking model for summarization. * Skips system prompt, context, and thinking segments. * If tool use is happening, allows 2 tool uses + one more agent response before summarizing. Downside of this is that there was potential for some prefix cache reuse before, especially for title summarization (thread summarization omitted tool results and so would not share a prefix for those). This seems fine as these requests should typically be fairly small. Even for full thread summarization, skipping all tool use / context should greatly reduce the token use. Release Notes: - N/A	2025-04-19 23:26:29 +00:00
Umesh Yadav	8117940aca	Add support for OpenAI o3 and o4-mini models (#28881 ) Release Notes: - Add support for OpenAI o3 and o4-mini models via OpenAI API and Copilot Chat providers. --------- Co-authored-by: Peter Tripp <peter@zed.dev>	2025-04-17 10:58:41 -04:00
Umesh Yadav	84aa480344	Add support for OpenAI GPT-4.1 models (#28708 ) Release Notes: - Add support for OpenAI GPT-4.1 via Copilot Chat and OpenAI API --------- Co-authored-by: Danilo Leal <daniloleal09@gmail.com> Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-04-14 16:15:59 -03:00
Marshall Bowers	819bb8fffb	open_ai: Disable `parallel_tool_calls` (#28056 ) This PR disables `parallel_tool_calls` for the models that support it, as the Agent currently expects at most one tool use per turn. It was a bit of trial and error to figure this out. OpenAI's API annoyingly will return an error if passing `parallel_tool_calls` to a model that doesn't support it. Release Notes: - N/A	2025-04-03 22:07:37 +00:00
Marshall Bowers	7492ec3f67	Add tool use support for OpenAI models (#28051 ) This PR adds support for using tools to the OpenAI models. Release Notes: - agent: Added support for tool use with OpenAI models (Preview only).	2025-04-03 20:55:11 +00:00
Marshall Bowers	e5b347b03a	Remove unused `extract_tool_args_from_events` functions (#28038 ) This PR removes the unused `extract_tool_args_from_events` functions that were defined in some of the LLM provider crates. Release Notes: - N/A	2025-04-03 18:38:35 +00:00
Piotr Osiewicz	dc64ec9cc8	chore: Bump Rust edition to 2024 (#27800 ) Follow-up to https://github.com/zed-industries/zed/pull/27791 Release Notes: - N/A	2025-03-31 20:55:27 +02:00
Peter Tripp	d1af7b1322	Update Assistant context limits (#25087 ) - Update GitHub Copilot Chat context limits - Add decimal separators for consistency	2025-02-19 11:06:20 -05:00
Patrick Detlefsen	c0dd7e8367	open_ai: Include o3-mini in `Model::from_id` (#24261 )	2025-02-05 16:45:38 -05:00
João Marcos	5bd7eaa173	Solve 50+ `cargo doc` warnings (#24071 ) Release Notes: - N/A	2025-02-01 06:19:29 +00:00

1 2

81 commits