open-notebook

mirror of https://github.com/lfnovo/open-notebook.git synced 2026-04-28 03:19:59 +00:00

Author	SHA1	Message	Date
Luis Novo	3f352cfcce	feat: credential-based API key management (#477 ) (#540 ) * feat: replace provider config with credential-based system (#477) Introduce a new credential management system replacing the old ProviderConfig singleton and standalone Models page. Each credential stores encrypted API keys and provider-specific configuration with full CRUD support via a unified settings UI. Backend: - Add Credential domain model with encrypted API key storage - Add credentials API router (CRUD, discovery, registration, testing) - Add encryption utilities for secure key storage - Add key_provider for DB-first env-var fallback provisioning - Add connection tester and model discovery services - Integrate ModelManager with credential-based config - Add provider name normalization for Esperanto compatibility - Add database migrations 11-12 for credential schema Frontend: - Rewrite settings/api-keys page with credential management UI - Add model discovery dialog with search and custom model support - Add compact default model assignments (primary/advanced layout) - Add inline model testing and credential connection testing - Add env-var migration banner - Update navigation to unified settings page - Remove standalone models page and old settings components i18n: - Update all 7 locale files with credential and model management keys Closes #477 Co-Authored-By: JFMD <git@jfmd.us> Co-Authored-By: OraCatQAQ <570768706@qq.com> * fix: address PR #540 review comments - Fix docs referencing removed Models page - Fix error-handler returning raw messages instead of i18n keys - Fix auth.py misleading docstring and missing no-password guard - Fix connection_tester using wrong env var for openai_compatible - Add provision_provider_keys before model discovery/sync - Update CLAUDE.md to reflect credential-based system - Fix missing closing brace in api-keys page useEffect * fix: add logging to credential migration and surface errors in UI - Add comprehensive logging to migrate-from-env and migrate-from-provider-config endpoints (start, per-provider progress, success/failure with stack traces, final summary) - Fix frontend migration hooks ignoring errors array from response - Show error toast when migration fails instead of "nothing to migrate" - Invalidate status/envStatus queries after migration so banner updates * docs: update CLAUDE.md files for credential system Replace stale ProviderConfig and /api-keys/ references across 8 CLAUDE.md files to reflect the new Credential-based system from PR #540. * docs: update user documentation for credential-based system Replace env var API key instructions with Settings UI credential workflow across all user-facing documentation. The new flow is: set OPEN_NOTEBOOK_ENCRYPTION_KEY → start services → add credential in Settings UI → test → discover models → register. - Rewrite ai-providers.md, api-configuration.md, environment-reference.md - Update all quick-start guides and installation docs - Update ollama.md, openai-compatible.md, local-tts/stt networking sections - Update reverse-proxy.md, development-setup.md, security.md - Fix broken links to non-existent docs/deployment/ paths - Add credentials endpoints to api-reference.md - Move all API key env vars to deprecated/legacy sections * chore: bump version to 1.7.0-rc1 Release candidate for credential-based provider management system. * fix: initialize provider before try block in test_credential Prevents UnboundLocalError when Credential.get() throws (e.g., invalid credential_id) before provider is assigned. * fix: reorder down migration to drop index before table Removes duplicate REMOVE FIELD statement and reorders so the index is dropped before the table, preventing rollback failures. * refactor: simplify encryption key to always derive via SHA-256 Remove the dual code path in _ensure_fernet_key() that detected native Fernet keys. Since the credential system is new, always deriving via SHA-256 removes unnecessary complexity. Also removes the generate_key() function and Fernet.generate_key() references from docs. * fix: correct mock patch targets in embedding tests and URL validation Fix embedding tests patching wrong module path for model_manager (was targeting open_notebook.utils.embedding.model_manager but it's imported locally from open_notebook.ai.models). Also fix URL validation to allow unresolvable hostnames since they may be valid in the deployment environment (e.g., Azure endpoints, internal DNS). * feat: add global setup banner for encryption and migration status Show a persistent banner in AppShell when encryption key is missing (red) or env var API keys can be migrated (amber), so users see these prompts on every page instead of only on Settings > API Keys. Includes a docs link for the encryption banner and i18n support across all 7 locales. * docs: several improvements to docker-compose e env examples * Update README.md Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com> * docs: fix env var format in README and update model setup instructions Align the encryption key snippet in README Step 2 with the list format used in the compose file. Replace deprecated "Settings → Models" instructions with credential-based Discover Models flow. * fix: address credential system review issues - Fix SSRF bypass via IPv4-mapped IPv6 addresses (::ffff:169.254.x.x) - Fix TTS connection test missing config parameter - Add Azure-specific model discovery using api-key auth header - Add Vertex static model list for credential-based discovery - Fix PROVIDER_DISCOVERY_FUNCTIONS incorrect azure/vertex mapping - Extract business logic to api/credentials_service.py (service layer) - Move credential Pydantic schemas to api/models.py - Update tests to use new service imports and ValueError assertions * fix: sanitize error responses and migrate key_provider to Credential - Replace raw exception messages in all credential router 500 responses with generic error strings (internal details logged server-side only) - Refactor key_provider.py to use Credential.get_by_provider() instead of deprecated ProviderConfig.get_instance() - Remove unused functions (get_provider_configs, get_default_api_key, get_provider_config) that were dead code --------- Co-authored-by: JFMD <git@jfmd.us> Co-authored-by: OraCatQAQ <570768706@qq.com>	2026-02-10 08:30:22 -03:00
Luis Novo	03f9edfec2	feat: use standard HTTP_PROXY/HTTPS_PROXY environment variables (#499 ) Update proxy configuration to use industry-standard environment variables (HTTP_PROXY, HTTPS_PROXY, NO_PROXY) instead of custom variables. The underlying libraries (esperanto, content-core, podcast-creator) now automatically detect proxy settings from these standard variables. - Bump content-core>=1.14.1 (fixes #494) - Bump esperanto>=2.18 - Bump podcast-creator>=0.9 - Update documentation with new proxy configuration	2026-01-29 23:31:02 -03:00
LUIS NOVO	ea7a41077b	docs: update all database examples for more clarity and better database names.	2026-01-04 09:23:15 -03:00
LUIS NOVO	e334291bf0	fix: improve SSL handling fixes #274	2025-11-27 11:34:04 -03:00
Luis Novo	f79a9040ae	Release 1.2 (#242 ) * chore: improve podcast transcripts * fix: remove date from insight - fixes #241 * fix: improve scrolling on source and insights - fixes #237 * chore: update esperanto to fix: #234 * chore: update esperanto to fix #226 * fix: process vectorization as subcommands to handle larger documents more gracefully - fix: #229 * feat: enable background job retry capabilities * feat: reenable content types that were disabled during alpha version * fix: remove unnecessary model caching causing many issues. * feat: support multiple azure endpoints and keys just like openai compatible. Fixes #215 * docs: update azure variables * chore: bump and update dependencies	2025-11-01 14:40:00 -03:00
Luis Novo	9bdfd99f1b	feat: simplify reverse proxy configuration with Next.js rewrites (#213 ) Some checks are pending Development Build / extract-version (push) Waiting to run Details Development Build / test-build-regular (push) Blocked by required conditions Details Development Build / test-build-single (push) Blocked by required conditions Details Development Build / summary (push) Blocked by required conditions Details * feat: simplify reverse proxy configuration with Next.js rewrites Add Next.js API rewrites to proxy /api/* requests internally from port 8502 to the FastAPI backend on port 5055. This eliminates the need for complex reverse proxy configurations with multiple upstreams and location blocks. Changes: - Add rewrites to next.config.ts proxying /api/* to INTERNAL_API_URL - Introduce INTERNAL_API_URL env var (defaults to http://localhost:5055) - Update supervisord configs to pass INTERNAL_API_URL to Next.js - Document INTERNAL_API_URL in .env.example with usage examples - Add simplified reverse proxy examples for nginx, Traefik, Caddy, Coolify - Update README architecture diagram to show internal proxying - Add explanatory comments to _config route handler Benefits: - Reduces reverse proxy config from 12 lines to 3 (75% reduction) - Single-port deployment (8502 only) for 95% of use cases - Zero breaking changes - backward compatible with existing setups - Zero performance overhead (validated through testing) - Preserves proxy headers (X-Forwarded-) for rate limiting/SSL Resolves: #179 Related: OSS-321 fix: rename _config to config to fix production routing CRITICAL BUG FIX: The /_config endpoint has never worked in production builds because Next.js treats folders starting with underscore as "private folders" and excludes them from routing entirely. This endpoint is critical for: - Providing API_URL to the browser at runtime - Enabling zero-config deployments with auto-detection - Supporting reverse proxy scenarios where API URL differs from frontend URL Changes: - Rename frontend/src/app/_config/ → frontend/src/app/config/ - Update client code references (/_config → /config) - Update documentation with correct endpoint path - Bump version to 1.1.0 (minor version for new rewrites feature + bug fix) Impact: - Runtime configuration now works in production builds - /config returns {"apiUrl":"http://localhost:5055"} correctly - Auto-detection for reverse proxy deployments now functional Related: #179, OSS-321 * fix: resolve React hook exhaustive-deps warning in AddExistingSourceDialog Wrap performSearch function in useCallback to properly memoize it and satisfy React Hook exhaustive-deps rule. This prevents unnecessary re-renders and ensures the useEffect dependency array is correctly specified. Changes: - Import useCallback from React - Wrap performSearch with useCallback([debouncedSearchQuery, allSources]) - Add performSearch to useEffect dependency array * final fixes	2025-10-24 11:24:14 -03:00
Luis Novo	b5666c4d68	Fix/increase fix: increase API client timeouts for transformation operations timeouts (#170 ) * fix: increase API client timeouts for transformation operations - Increase frontend timeout from 30s to 300s (5 minutes) - Increase Streamlit API client timeout from 30s to 300s - Add API_CLIENT_TIMEOUT environment variable for configurability - Add ESPERANTO_LLM_TIMEOUT environment variable documentation - Update .env.example with comprehensive timeout documentation Fixes #131 - API timeout errors during transformation generation Transformations now have sufficient time to complete on slower hardware (Ollama, LM Studio) without frontend timeout errors. Users can now configure timeouts for both the API client layer (API_CLIENT_TIMEOUT) and the LLM provider layer (ESPERANTO_LLM_TIMEOUT) to accommodate their specific hardware and network conditions. * docs: add timeout configuration documentation - Add comprehensive timeout troubleshooting section to common-issues.md - Add FAQ entry about timeout errors during transformations - Document API_CLIENT_TIMEOUT and ESPERANTO_LLM_TIMEOUT usage - Provide specific timeout recommendations for different hardware/network scenarios - Link to GitHub issue #131 for reference * chore: bump * refactor: improve timeout configuration with validation and consistency Based on PR review feedback, this commit addresses several improvements: Timeout Validation: - Add validation to ensure timeout values are between 30s and 3600s - Invalid values fall back to default 300s with warning logs - Handles edge cases (negative, zero, invalid strings) Fix Hard-coded Timeouts: - Replace all hard-coded timeout values in api/client.py - ask_simple: 300s → self.timeout - execute_transformation: 120s → self.timeout - embed_content: 120s → self.timeout - create_source: 300s → self.timeout - rebuild_embeddings: Uses smart logic (2x timeout, max 3600s) Improved Documentation: - Add clarifying comments about ms vs seconds (frontend vs backend) - Document that frontend uses 300000ms = backend 300s - Add inline documentation for rebuild_embeddings timeout logic Development Dependencies: - Add pytest>=8.0.0 to dev dependencies for future test coverage This makes timeout configuration more robust, consistent, and user-friendly while maintaining backward compatibility.	2025-10-19 11:37:24 -03:00
Luis Novo	04b5a9c96a	Implement a serverside fix for reverse proxy users (#169 )	2025-10-19 08:02:21 -03:00
Luis Novo	4c2b8257fc	OpenAI compatible multimodal (#167 ) * fix text * remove lint from docker publish workflow * gemini base url docs * feat: add multimodal support for openai-compatible providers - Add helper function to check OpenAI-compatible provider availability per mode - Update provider detection to support language, embedding, STT, and TTS modalities - Implement mode-specific environment variable detection (LLM, EMBEDDING, STT, TTS) - Maintain backward compatibility with generic OPENAI_COMPATIBLE_BASE_URL - Add comprehensive unit tests for all configuration scenarios - Update .env.example with mode-specific environment variables - Update provider support matrix in ai-models.md - Create comprehensive openai-compatible.md setup guide This enables users to configure different OpenAI-compatible endpoints for different AI capabilities (e.g., LM Studio for language models, dedicated server for embeddings) while maintaining full backward compatibility. * upgrade * chore: change docker release strategy	2025-10-19 07:44:05 -03:00
Luis Novo	b7e656a319	Version 1 (#160 ) New front-end Launch Chat API Manage Sources Enable re-embedding of all contents Sources can be added without a notebook now Improved settings Enable model selector on all chats Background processing for better experience Dark mode Improved Notes Improved Docs: - Remove all Streamlit references from documentation - Update deployment guides with React frontend setup - Fix Docker environment variables format (SURREAL_URL, SURREAL_PASSWORD) - Update docker image tag from :latest to :v1-latest - Change navigation references (Settings → Models to just Models) - Update development setup to include frontend npm commands - Add MIGRATION.md guide for users upgrading from Streamlit - Update quick-start guide with correct environment variables - Add port 5055 documentation for API access - Update project structure to reflect frontend/ directory - Remove outdated source-chat documentation files	2025-10-18 12:46:22 -03:00
LUIS NOVO	124d7d110c	docs: TTS_BATCH_SIZE Some checks failed Development Build / extract-version (push) Has been cancelled Details Development Build / lint-and-check (push) Has been cancelled Details Development Build / test-build-regular (push) Has been cancelled Details Development Build / test-build-single (push) Has been cancelled Details Development Build / summary (push) Has been cancelled Details	2025-09-14 11:05:34 -03:00
LUIS NOVO	adc8629ea9	docs: add openai compatible env var documentation	2025-07-27 22:39:28 -03:00
LUIS NOVO	893b2f408b	docs: fix env example	2025-07-27 22:31:51 -03:00
Luis Novo	3b2ced54e2	fix environment variable error and enable docker build automation (#94 ) * chore: fix database import error * remove unused file and improve env example * docker build automation	2025-07-17 09:54:28 -03:00
Luis Novo	d7b0fff954	Api podcast migration (#93 ) Creates the API layer for Open Notebook Creates a services API gateway for the Streamlit front-end Migrates the SurrealDB SDK to the official one Change all database calls to async New podcast framework supporting multiple speaker configurations Implement the surreal-commands library for async processing Improve docker image and docker-compose configurations	2025-07-17 08:36:11 -03:00
LUIS NOVO	62a2a39017	docs: remove old chunking configuration	2025-06-10 12:15:49 -03:00
LUIS NOVO	05b28a1f99	docs: remove unneded API BASE	2025-06-10 12:14:49 -03:00
LUIS NOVO	69c0840a25	docs: add voyage API requirements	2025-06-10 12:14:28 -03:00
LUIS NOVO	05a64d90a8	docs: new env variable examples	2025-06-10 11:55:07 -03:00
LUIS NOVO	f4d233925e	docs: add new models env variable examples	2025-06-10 11:54:51 -03:00
LUIS NOVO	74daa15ce7	docs: add examples for URL keys	2025-05-30 15:24:15 -03:00
LUIS NOVO	aa4912334b	fix: replace GEMINI_API_KEY with GOOGLE_API_KEY as per new SDK	2025-05-22 09:12:36 -03:00
熊鑫伟 Xinwei Xiong	83ce4689eb	feat: Update .env.example add OpenAI API Base	2025-04-08 17:07:03 +08:00
LUIS NOVO	666a4f85b9	update env sample	2024-11-13 15:21:01 -03:00
LUIS NOVO	0a868fceb5	remove model info from docs	2024-10-30 14:06:41 -03:00
LUIS NOVO	8e91574938	clean up env variables	2024-10-27 17:26:10 -03:00
LUIS NOVO	01f8eab10e	add podcast support	2024-10-26 05:17:58 -03:00
LUIS NOVO	c3f3c9cb93	update docs to new release info	2024-10-23 15:40:28 -03:00
LUIS NOVO	e70788910d	vertexai instructions	2024-10-22 22:52:51 -03:00
LUIS NOVO	9042b08ae3	add model router and improve prompts	2024-10-22 18:24:24 -03:00
LUIS NOVO	93f766f40d	add support for vertex, anthropic, litellm, ollama and open router	2024-10-22 16:41:20 -03:00
LUIS NOVO	bcd260a28b	Initial commit with all features	2024-10-21 14:56:10 -03:00

32 commits