vrr/free-claude-code

mirror of https://github.com/Alishahryar1/free-claude-code.git synced 2026-05-22 03:01:49 +00:00

Author	SHA1	Message	Date
Alishahryar1	29e7714337	feat(logging): structured TRACE events and end-to-end request correlation Add core/trace.py with trace_event, traced_async_stream, and payload snapshots. Merge TRACE fields into JSON logs; promote claude_session_id, http path/method. Instrument API, messaging/CLI, and OpenAI-compat/native provider paths. Harden log sink with enqueue and stdlib intercept re-entrancy guard. Document behavior in .env.example and README; extend tests.	2026-05-10 18:24:48 -07:00
Alishahryar1	d3a3b37e9e	Filter OpenRouter model variants by thinking support Some checks failed CI / checks (push) Has been cancelled Details	2026-04-30 22:01:36 -07:00
Alishahryar1	eb5516e53b	Validate configured models at startup	2026-04-30 00:33:45 -07:00
Alishahryar1	f3a7528d49	Major refactor: API, providers, messaging, and Anthropic protocol Some checks are pending CI / checks (push) Waiting to run Details Consolidates the incremental refactor work into a single change set: modular web tools (api/web_tools), native Anthropic request building and SSE block policy, OpenAI conversion and error handling, provider transports and rate limiting, messaging handler and tree queue, safe logging, smoke tests, and broad test coverage.	2026-04-26 03:01:14 -07:00
Alishahryar1	f29e693dc5	Add per-model thinking toggles	2026-04-25 20:51:07 -07:00
Alishahryar1	66ef23072c	Refactor provider routing and smoke coverage	2026-04-24 19:34:34 -07:00
Alishahryar1	6f3d762a4f	Revert "Add per-model thinking toggles" This reverts commit `1f12a33dd7`.	2026-04-24 00:26:15 -07:00
Alishahryar1	1f12a33dd7	Add per-model thinking toggles	2026-04-24 00:14:49 -07:00
arssing	2fe15bd2cd	feat: add proxy support for httpx clients (#125 ) Add proxy support for providers based on [doc](https://www.python-httpx.org/advanced/proxies/): - Add per-provider proxy support (HTTP and SOCKS5) for all 4 providers: nvidia_nim, open_router, lmstudio, llamacpp - Each provider gets its own env var (NVIDIA_NIM_PROXY, OPENROUTER_PROXY, LMSTUDIO_PROXY, LLAMACPP_PROXY) for independent proxy configuration --------- Co-authored-by: Alishahryar1 <alishahryar2@gmail.com>	2026-04-22 17:06:16 -07:00
Alishahryar1	835d0454e8	Fixes for issue 113 and 116	2026-04-18 16:32:31 -07:00
Alishahryar1	79a1ae0c54	minor refactor using minimax m2.5	2026-02-27 20:44:39 -08:00
Claude	45b7e4cafd	Make PROVIDER_MAX_CONCURRENCY required with default of 5 - `max_concurrency` is now always an `int` (default 5) — `None`/unlimited is no longer a valid state; omitting the env var uses the default - `GlobalRateLimiter`: semaphore is always created; `concurrency_slot()` no longer has None guards; log message always includes concurrency - `ProviderConfig.max_concurrency`: `int = 5` (was `int \| None = None`) - `Settings.provider_max_concurrency`: `int = Field(default=5, ...)` — setting env var to an invalid value (e.g. empty string) raises - `.env.example`: uncommented `PROVIDER_MAX_CONCURRENCY=5` - README: updated config table default from `—` to `5` - Tests: removed `test_concurrency_slot_noop_when_not_configured`; updated mock settings to use `5` instead of `None` https://claude.ai/code/session_014mrF1WMNgmNjtPBuoQHsbg	2026-02-19 14:39:42 +00:00
Claude	afaf50a972	Add queue-level concurrency limit to provider streaming Adds max_concurrency cap to GlobalRateLimiter using asyncio.Semaphore. A request now waits for a concurrency slot before the sliding window rate limit check, so at most N streams are open to the provider simultaneously, even when the rate window would allow more. Changes: - providers/rate_limit.py: max_concurrency param, _concurrency_sem, concurrency_slot() asynccontextmanager - providers/openai_compat.py: pass max_concurrency to limiter; wrap execute_with_retry + stream iteration in concurrency_slot() - providers/base.py: max_concurrency field on ProviderConfig - config/settings.py: provider_max_concurrency setting (PROVIDER_MAX_CONCURRENCY env var, default None = unlimited) - api/dependencies.py: pass provider_max_concurrency into all three provider ProviderConfig instantiations - .env.example: document PROVIDER_MAX_CONCURRENCY (commented out) - tests/providers/test_provider_rate_limit.py: 5 new tests covering concurrency limit enforcement, slot release on exception, noop when unconfigured - tests/api/test_dependencies.py: add provider_max_concurrency=None to mock settings helper https://claude.ai/code/session_014mrF1WMNgmNjtPBuoQHsbg	2026-02-19 14:23:21 +00:00
Alishahryar1	b05d0d2703	new linter rules and fixes	2026-02-18 04:13:41 -08:00
Cursor Agent	bfc781e0ed	Phase 4-6: Dead code removal, performance, minor fixes Phase 4: - Remove legacy SessionRecord, _sessions, _msg_to_session from SessionStore - Fix hardcoded provider in root endpoint (use settings.provider_type) - Update session store tests Phase 5: - Use list-based string accumulation in ThinkingSegment, TextSegment, ToolCallSegment - Cache MAX_MESSAGE_LOG_ENTRIES_PER_CHAT at SessionStore init - Use iterative DFS in MessageTree.get_descendants Phase 6: - Add comment for abstract async generator workaround in BaseProvider - Rename TELEGRAM_EDIT log tags to PLATFORM_EDIT in handler Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 02:01:01 +00:00
Cursor Agent	72b7e34999	Phase 3: Fix encapsulation violations - Add MessageTree.set_current_task() method - Update tree_processor to use set_current_task instead of _current_task - Move nim_settings out of ProviderConfig, pass only to NvidiaNimProvider - Update api/dependencies and all tests Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 01:58:51 +00:00
Alishahryar1	01852e1638	Add configurable HTTP timeouts for provider API requests Updated the README to include new timeout settings. Implemented these timeouts in the provider classes and added corresponding tests to ensure they are correctly passed to the client. Also included environment variable support for the new settings.	2026-02-16 01:40:15 -08:00
Alishahryar1	7dfcad2a4c	Enhance logging and error handling across multiple modules - Added request ID context to logging in FastAPI routes and NVIDIA NIM provider. - Improved logging format to include context variables for better traceability. - Updated message handling in Telegram and Claude handlers to log message previews. - Enhanced error logging in NVIDIA NIM provider with request ID for easier debugging. - Added logging for tree repository actions to track tree and node registrations.	2026-02-15 02:01:57 -08:00
Alishahryar1	96747f2216	Updated token counting and removed non streaming support	2026-02-14 19:10:09 -08:00
Alishahryar1	6102583026	Major Refactor Part 2 with kimi-k2.5 in claude code	2026-02-05 16:09:16 -08:00
Alishahryar1	fcbe204f44	Major refactor done with kimi-k2.5 in claude code	2026-02-05 10:51:33 -08:00
Alishahryar1	8ce86f4267	fixed type errors	2026-01-31 14:13:09 -08:00
Alishahryar1	3acabcb8e0	add new template kwargs and fixed linter errors	2026-01-31 00:03:21 -08:00
Alishahryar1	6c9f0c8a5a	initial commit	2026-01-28 11:05:01 -08:00

24 commits