vrr/free-claude-code

mirror of https://github.com/Alishahryar1/free-claude-code.git synced 2026-04-28 11:30:03 +00:00

Author	SHA1	Message	Date
Alishahryar1	6f3d762a4f	Revert "Add per-model thinking toggles" This reverts commit `1f12a33dd7`.	2026-04-24 00:26:15 -07:00
Alishahryar1	1f12a33dd7	Add per-model thinking toggles	2026-04-24 00:14:49 -07:00
arssing	2fe15bd2cd	feat: add proxy support for httpx clients (#125 ) Add proxy support for providers based on [doc](https://www.python-httpx.org/advanced/proxies/): - Add per-provider proxy support (HTTP and SOCKS5) for all 4 providers: nvidia_nim, open_router, lmstudio, llamacpp - Each provider gets its own env var (NVIDIA_NIM_PROXY, OPENROUTER_PROXY, LMSTUDIO_PROXY, LLAMACPP_PROXY) for independent proxy configuration --------- Co-authored-by: Alishahryar1 <alishahryar2@gmail.com>	2026-04-22 17:06:16 -07:00
Pavel Yurchenko	e719e4aed2	feat: deepseek api support (#118 ) ## Summary * add native DeepSeek provider support via the shared OpenAI-compatible provider base * allow `deepseek/...` model prefixes in config validation * add `DEEPSEEK_API_KEY` and `DEEPSEEK_BASE_URL` settings * add DeepSeek entries to `.env.example` and `config/env.example` * implement `DeepSeekProvider` and register it in provider dependencies * add a DeepSeek request builder with DeepSeek-specific thinking payload handling * preserve Anthropic thinking blocks as `reasoning_content` for DeepSeek-compatible continuation flows * update `claude-pick` to discover DeepSeek models from the DeepSeek API * document DeepSeek usage in `README.md` * add tests for config validation, provider dependency wiring, request building, and streaming behavior ## Motivation DeepSeek exposes an OpenAI-compatible API and can be used directly without routing through OpenRouter. This lets users spend their existing DeepSeek balance through the proxy while keeping the same Claude Code workflow and per-model provider mapping. ## Example ```dotenv DEEPSEEK_API_KEY="sk-..." DEEPSEEK_BASE_URL="https://api.deepseek.com" MODEL_OPUS="deepseek/deepseek-reasoner" MODEL_SONNET="deepseek/deepseek-chat" MODEL_HAIKU="deepseek/deepseek-chat" MODEL="deepseek/deepseek-chat" --------- Co-authored-by: Alishahryar1 <alishahryar2@gmail.com>	2026-04-22 17:06:01 -07:00
Alishahryar1	835d0454e8	Fixes for issue 113 and 116	2026-04-18 16:32:31 -07:00
th-ch	f703a0e403	Implement optional authentication (Anthropic style) (#80 ) Some checks are pending CI / checks (push) Waiting to run Details	2026-03-27 11:11:47 -07:00
Alishahryar1	5a36a32836	feat: add llama.cpp provider for local anthropic messages API	2026-03-08 10:38:25 -07:00
Ali Khokhar	0b324e0421	Per claude model mapping (#66 )	2026-03-01 21:32:23 -08:00
Alishahryar1	34757511a0	Improve deterministic error surfacing across stream and API	2026-03-01 01:32:52 -08:00
Ali Khokhar	aee9f0ad93	Add code review fix plan covering 11 issues across modularity, encapsulation, performance, and dead code (#62 )	2026-03-01 00:45:33 -08:00
Alishahryar1	a74ec74271	Major refactor done with minimax m2.5	2026-02-28 04:36:29 -08:00
Alishahryar1	79a1ae0c54	minor refactor using minimax m2.5	2026-02-27 20:44:39 -08:00
Ali Khokhar	c4d8681000	Backup/before cleanup 20260222 230402 (#58 )	2026-02-27 19:50:21 -08:00
Claude	afaf50a972	Add queue-level concurrency limit to provider streaming Adds max_concurrency cap to GlobalRateLimiter using asyncio.Semaphore. A request now waits for a concurrency slot before the sliding window rate limit check, so at most N streams are open to the provider simultaneously, even when the rate window would allow more. Changes: - providers/rate_limit.py: max_concurrency param, _concurrency_sem, concurrency_slot() asynccontextmanager - providers/openai_compat.py: pass max_concurrency to limiter; wrap execute_with_retry + stream iteration in concurrency_slot() - providers/base.py: max_concurrency field on ProviderConfig - config/settings.py: provider_max_concurrency setting (PROVIDER_MAX_CONCURRENCY env var, default None = unlimited) - api/dependencies.py: pass provider_max_concurrency into all three provider ProviderConfig instantiations - .env.example: document PROVIDER_MAX_CONCURRENCY (commented out) - tests/providers/test_provider_rate_limit.py: 5 new tests covering concurrency limit enforcement, slot release on exception, noop when unconfigured - tests/api/test_dependencies.py: add provider_max_concurrency=None to mock settings helper https://claude.ai/code/session_014mrF1WMNgmNjtPBuoQHsbg	2026-02-19 14:23:21 +00:00
Alishahryar1	b05d0d2703	new linter rules and fixes	2026-02-18 04:13:41 -08:00
Cursor Agent	e9beb28897	fix: validate API keys at provider init to prevent 403 'authorization missing' When NVIDIA_NIM_API_KEY or OPENROUTER_API_KEY is empty or not set, the proxy forwarded requests without a valid Authorization header, causing providers to return 403 with 'Header of type authorization was missing'. Now fail fast with HTTP 503 and a clear message telling users to add the key to .env, with links to obtain keys. Fixes #29 Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 07:33:56 +00:00
Cursor Agent	72b7e34999	Phase 3: Fix encapsulation violations - Add MessageTree.set_current_task() method - Update tree_processor to use set_current_task instead of _current_task - Move nim_settings out of ProviderConfig, pass only to NvidiaNimProvider - Update api/dependencies and all tests Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 01:58:51 +00:00
Alishahryar1	01852e1638	Add configurable HTTP timeouts for provider API requests Updated the README to include new timeout settings. Implemented these timeouts in the provider classes and added corresponding tests to ensure they are correctly passed to the client. Also included environment variable support for the new settings.	2026-02-16 01:40:15 -08:00
Alishahryar1	539854fe7b	Refactor done using GLM-5	2026-02-15 21:58:03 -08:00
Alishahryar1	b83be84313	Add LM Studio provider support - Introduced `LMStudioProvider` to the provider system. - Added a new fixture `lmstudio_provider` in `conftest.py` for testing. - Updated `get_provider` function to handle `lmstudio` as a valid provider type. - Enhanced README and `.env.example` to include LM Studio configuration details. - Updated settings to accommodate LM Studio's base URL and provider type. - Added tests to verify the functionality of the LM Studio provider.	2026-02-15 19:41:03 -08:00
Alishahryar1	054f9869b7	Refactor rate limiting configuration to use unified provider settings - Replaced NVIDIA NIM and OpenRouter specific rate limit settings with a generic provider rate limit in settings, tests, and environment files. - Updated README.md to reflect the new provider rate limit configuration. - Adjusted tests to validate the new provider rate limit attributes.	2026-02-15 11:03:59 -08:00
Alishahryar1	e5a096049d	feat: add OpenRouter support and configuration options - Introduced OpenRouter as a new provider option in settings and environment configuration. - Updated README.md to include instructions for using OpenRouter. - Enhanced the message converter to support reasoning content for OpenRouter. - Added tests for OpenRouter provider functionality and message conversion. - Updated dependencies to include OpenRouterProvider.	2026-02-15 10:50:53 -08:00
Alishahryar1	ab74791e4b	Improved logging coverage	2026-02-14 23:14:58 -08:00
Alishahryar1	6102583026	Major Refactor Part 2 with kimi-k2.5 in claude code	2026-02-05 16:09:16 -08:00
Alishahryar1	fcbe204f44	Major refactor done with kimi-k2.5 in claude code	2026-02-05 10:51:33 -08:00
Alishahryar1	8ce86f4267	fixed type errors	2026-01-31 14:13:09 -08:00
Alishahryar1	b0f77b67cc	changed close to aclose	2026-01-31 14:05:41 -08:00
Alishahryar1	0ea3ec8741	added tree persistence	2026-01-31 13:56:06 -08:00
Alishahryar1	3b037932d7	refactor to reduce coupling	2026-01-30 13:39:40 -08:00
Alishahryar1	f1efa22a82	fixed issues after refactor	2026-01-29 14:50:05 -08:00
Alishahryar1	8678a62915	Major refactor done by itself	2026-01-29 14:40:08 -08:00

31 commits