vrr/free-claude-code

447 commits 2 branches 0 tags 6.2 MiB

Author	SHA1	Message	Date
Alishahryar1	d6a0e1a401	Provider inferred from model name using prefix	2026-02-19 20:53:02 -08:00
Claude	45b7e4cafd	Make PROVIDER_MAX_CONCURRENCY required with default of 5 - `max_concurrency` is now always an `int` (default 5) — `None`/unlimited is no longer a valid state; omitting the env var uses the default - `GlobalRateLimiter`: semaphore is always created; `concurrency_slot()` no longer has None guards; log message always includes concurrency - `ProviderConfig.max_concurrency`: `int = 5` (was `int \| None = None`) - `Settings.provider_max_concurrency`: `int = Field(default=5, ...)` — setting env var to an invalid value (e.g. empty string) raises - `.env.example`: uncommented `PROVIDER_MAX_CONCURRENCY=5` - README: updated config table default from `—` to `5` - Tests: removed `test_concurrency_slot_noop_when_not_configured`; updated mock settings to use `5` instead of `None` https://claude.ai/code/session_014mrF1WMNgmNjtPBuoQHsbg	2026-02-19 14:39:42 +00:00
Claude	afaf50a972	Add queue-level concurrency limit to provider streaming Adds max_concurrency cap to GlobalRateLimiter using asyncio.Semaphore. A request now waits for a concurrency slot before the sliding window rate limit check, so at most N streams are open to the provider simultaneously, even when the rate window would allow more. Changes: - providers/rate_limit.py: max_concurrency param, _concurrency_sem, concurrency_slot() asynccontextmanager - providers/openai_compat.py: pass max_concurrency to limiter; wrap execute_with_retry + stream iteration in concurrency_slot() - providers/base.py: max_concurrency field on ProviderConfig - config/settings.py: provider_max_concurrency setting (PROVIDER_MAX_CONCURRENCY env var, default None = unlimited) - api/dependencies.py: pass provider_max_concurrency into all three provider ProviderConfig instantiations - .env.example: document PROVIDER_MAX_CONCURRENCY (commented out) - tests/providers/test_provider_rate_limit.py: 5 new tests covering concurrency limit enforcement, slot release on exception, noop when unconfigured - tests/api/test_dependencies.py: add provider_max_concurrency=None to mock settings helper https://claude.ai/code/session_014mrF1WMNgmNjtPBuoQHsbg	2026-02-19 14:23:21 +00:00
Alishahryar1	b05d0d2703	new linter rules and fixes	2026-02-18 04:13:41 -08:00
Cursor Agent	e9beb28897	fix: validate API keys at provider init to prevent 403 'authorization missing' When NVIDIA_NIM_API_KEY or OPENROUTER_API_KEY is empty or not set, the proxy forwarded requests without a valid Authorization header, causing providers to return 403 with 'Header of type authorization was missing'. Now fail fast with HTTP 503 and a clear message telling users to add the key to .env, with links to obtain keys. Fixes #29 Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 07:33:56 +00:00
Cursor Agent	4b4f87515d	Phase 7: Directory restructuring (messaging/ and tests/) - Create messaging/platforms/ (base, discord, telegram, factory) - Create messaging/rendering/ (discord_markdown, telegram_markdown) - Create messaging/trees/ (data, repository, processor, queue_manager) - Organize tests/ into api/, providers/, messaging/, cli/, config/ - Add backward-compatible re-exports at old locations - Update handler.py and test_messaging_factory.py imports - Fix Telegram type hints for TELEGRAM_AVAILABLE=False case - Fix Python 3 except syntax in discord_markdown Co-authored-by: Ali Khokhar <alishahryar2@gmail.com>	2026-02-17 02:25:42 +00:00

Renamed from tests/test_dependencies.py (Browse further)

6 commits