mirror of https://github.com/lfnovo/open-notebook.git synced 2026-04-30 20:39:55 +00:00

Fix/increase fix: increase API client timeouts for transformation operations timeouts (#170 )

* fix: increase API client timeouts for transformation operations

- Increase frontend timeout from 30s to 300s (5 minutes)
- Increase Streamlit API client timeout from 30s to 300s
- Add API_CLIENT_TIMEOUT environment variable for configurability
- Add ESPERANTO_LLM_TIMEOUT environment variable documentation
- Update .env.example with comprehensive timeout documentation

Fixes #131 - API timeout errors during transformation generation
Transformations now have sufficient time to complete on slower
hardware (Ollama, LM Studio) without frontend timeout errors.

Users can now configure timeouts for both the API client layer
(API_CLIENT_TIMEOUT) and the LLM provider layer (ESPERANTO_LLM_TIMEOUT)
to accommodate their specific hardware and network conditions.

* docs: add timeout configuration documentation

- Add comprehensive timeout troubleshooting section to common-issues.md
- Add FAQ entry about timeout errors during transformations
- Document API_CLIENT_TIMEOUT and ESPERANTO_LLM_TIMEOUT usage
- Provide specific timeout recommendations for different hardware/network scenarios
- Link to GitHub issue #131 for reference

* chore: bump

* refactor: improve timeout configuration with validation and consistency

Based on PR review feedback, this commit addresses several improvements:

**Timeout Validation:**
- Add validation to ensure timeout values are between 30s and 3600s
- Invalid values fall back to default 300s with warning logs
- Handles edge cases (negative, zero, invalid strings)

**Fix Hard-coded Timeouts:**
- Replace all hard-coded timeout values in api/client.py
- ask_simple: 300s → self.timeout
- execute_transformation: 120s → self.timeout
- embed_content: 120s → self.timeout
- create_source: 300s → self.timeout
- rebuild_embeddings: Uses smart logic (2x timeout, max 3600s)

**Improved Documentation:**
- Add clarifying comments about ms vs seconds (frontend vs backend)
- Document that frontend uses 300000ms = backend 300s
- Add inline documentation for rebuild_embeddings timeout logic

**Development Dependencies:**
- Add pytest>=8.0.0 to dev dependencies for future test coverage

This makes timeout configuration more robust, consistent, and user-friendly
while maintaining backward compatibility.

2025-10-19 11:37:24 -03:00

14 KiB

Raw Blame History

Common Issues and Solutions

This document covers the most frequently encountered issues when installing, configuring, and using Open Notebook, along with their solutions.

Installation Problems

Port Already in Use

Problem: Error message "Port 8502 is already in use" or similar port conflicts.

Symptoms:

Cannot start React frontend
Error messages about address already in use
Services failing to bind to ports

Solutions:

Find and stop conflicting process:

# Check what's using port 8502
lsof -i :8502

# Kill the process (replace PID with actual process ID)
kill -9 <PID>

Use different ports:

# For React frontend
uv run --env-file .env cd frontend && npm run dev --server.port=8503

# For Docker deployment, modify docker-compose.yml
ports:
  - "8503:8502"  # host:container

Common port conflicts:
- Port 8502 (Next.js): Often used by other Next.js apps
- Port 5055 (API): May conflict with other web services
- Port 8000 (SurrealDB): May conflict with other databases

Permission Denied (Docker)

Problem: Docker commands fail with permission denied errors.

Symptoms:

"permission denied while trying to connect to the Docker daemon socket"
Docker commands require sudo

Solutions:

Add user to docker group (Linux):

sudo usermod -aG docker $USER

# Log out and log back in, or run:
newgrp docker

Start Docker service (Linux):

sudo systemctl start docker
sudo systemctl enable docker

Restart Docker Desktop (Windows/Mac):
- Close Docker Desktop completely
- Restart Docker Desktop
- Wait for it to fully start

Python/uv Installation Issues

Problem: uv command not found or Python version conflicts.

Symptoms:

"uv: command not found"
Python version mismatch errors
Virtual environment issues

Solutions:

Install uv package manager:

# macOS
brew install uv

# Linux/WSL
curl -LsSf https://astral.sh/uv/install.sh | sh
source ~/.bashrc

# Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

Fix Python version issues:

# Install specific Python version
uv python install 3.11

# Pin Python version for project
uv python pin 3.11

# Recreate virtual environment
uv sync --reinstall

Clear uv cache:
```
uv cache clean
```

SurrealDB Connection Issues

Problem: Cannot connect to SurrealDB database.

Symptoms:

"Connection refused" errors
Database queries failing
Timeout errors

Solutions:

Check SurrealDB is running:

# For Docker
docker compose ps surrealdb

# Check logs
docker compose logs surrealdb

Verify connection settings:

# Check environment variables
echo $SURREAL_URL
echo $SURREAL_USER

# Test connection
curl http://localhost:8000/health

Restart SurrealDB:

docker compose restart surrealdb
# Wait 10 seconds for startup
sleep 10

Check file permissions:

# Ensure data directory is writable
ls -la surreal_data/

# Fix permissions if needed
sudo chown -R $USER:$USER surreal_data/

Runtime Errors

AI Provider API Errors

Problem: Errors when using AI models (OpenAI, Anthropic, etc.).

Symptoms:

"Invalid API key" errors
"Rate limit exceeded" messages
Model not found errors

Solutions:

Verify API keys:

# Check key format (don't expose full key)
echo $OPENAI_API_KEY | cut -c1-10

# Test OpenAI key
curl -H "Authorization: Bearer $OPENAI_API_KEY" \
     https://api.openai.com/v1/models

Check billing and usage:
- OpenAI: Visit https://platform.openai.com/account/billing
- Anthropic: Visit https://console.anthropic.com/account/billing
- Ensure you have sufficient credits

Verify model availability:

# Check model names in settings
# Use gpt-5-mini instead of gpt-4-mini
# Use claude-3-haiku-20240307 instead of claude-3-haiku

Handle rate limits:
- Wait before retrying
- Use lower-tier models for testing
- Check provider rate limits

API Timeout Errors During Transformations

Problem: Timeout errors when running transformations or generating insights, even though the operation completes successfully.

Symptoms:

"timeout of 30000ms exceeded" in React frontend
"Failed to connect to API: timed out" in Streamlit UI
Transformation completes after a few minutes, but error appears after 30-60 seconds
Common with local models (Ollama), remote LM Studio, or slow hardware

Solutions:

Increase API client timeout (recommended):
```
# Add to your .env file
API_CLIENT_TIMEOUT=600  # 10 minutes (600 seconds)
```
This controls how long the frontend/UI waits for API responses. Default is 300 seconds (5 minutes).

Adjust timeout based on your setup:

# Fast cloud APIs (OpenAI, Anthropic, Groq)
API_CLIENT_TIMEOUT=300  # 5 minutes (default)

# Local Ollama on GPU
API_CLIENT_TIMEOUT=600  # 10 minutes

# Local Ollama on CPU or slow hardware
API_CLIENT_TIMEOUT=1200  # 20 minutes

# Remote LM Studio over slow network
API_CLIENT_TIMEOUT=900  # 15 minutes

Increase LLM provider timeout if needed:
```
# Add to your .env file if the model itself is timing out
ESPERANTO_LLM_TIMEOUT=180  # 3 minutes (default is 60s)
```
Only increase this if you see errors during actual model inference, not just HTTP timeouts.
Use faster models for testing:
- Test with cloud APIs first to verify setup
- Try smaller local models (e.g., gemma2:2b instead of llama3:70b)
- Preload models before running transformations: ollama run model-name

Restart services after configuration changes:

# For Docker
docker compose down
docker compose up -d

# For source installation
make stop-all
make start-all

Important Notes:

API_CLIENT_TIMEOUT should be HIGHER than ESPERANTO_LLM_TIMEOUT for proper error handling
If transformations complete successfully after refresh, you only need to increase API_CLIENT_TIMEOUT
First time running a model may be slower due to model loading

Related GitHub Issue: #131

Memory and Performance Issues

Problem: Application running slowly or crashing due to memory issues.

Symptoms:

Slow response times
Out of memory errors
Application crashes
High CPU usage

Solutions:

Increase Docker memory:

# Docker Desktop → Settings → Resources → Memory
# Increase to 4GB or more

Monitor resource usage:

# Check Docker stats
docker stats

# Check system resources
htop
top

Optimize model usage:
- Use smaller models (gpt-5-mini vs gpt-5)
- Reduce context window size
- Process fewer documents at once

Clear application cache:

# Clear Python cache
find . -name "__pycache__" -type d -exec rm -rf {} +

# Clear Next.js cache
rm -rf ~/.streamlit/cache/

Background Job Failures

Problem: Background tasks (podcast generation, transformations) failing.

Symptoms:

Jobs stuck in "processing" state
No podcast audio generated
Transformations not completing

Solutions:

Check worker status:

# Check if worker is running
pgrep -f "surreal-commands-worker"

# Restart worker
make worker-restart

Check job logs:

# View worker logs
docker compose logs worker

# Check command status in database
# (Access through UI or API)

Verify AI provider configuration:
- Ensure TTS/STT models are configured
- Check API keys for required providers
- Test models individually

Clear stuck jobs:

# Restart all services
make stop-all
make start-all

File Upload Issues

Problem: Cannot upload files or file processing fails.

Symptoms:

Upload button not working
File processing errors
Unsupported file type messages

Solutions:

Check file size limits:

# Default Next.js limit is 200MB
# Large files may timeout

Verify file types:
- PDF: Standard PDF files (not password protected)
- Images: PNG, JPG, GIF, WebP
- Audio: MP3, WAV, M4A
- Video: MP4, AVI, MOV (for transcript extraction)
- Documents: TXT, DOC, DOCX

Check file permissions:

# Ensure files are readable
ls -la /path/to/file

# Fix permissions
chmod 644 /path/to/file

Test with smaller files:
- Try with a simple text file first
- Gradually increase complexity

Performance Issues

Slow Search and Chat

Problem: Search and chat responses are very slow.

Symptoms:

Long wait times for responses
Timeout errors
Poor user experience

Solutions:

Optimize embedding model:
- Use faster embedding models
- Reduce embedding dimensions
- Process fewer documents at once

Database optimization:

# Check database performance
docker compose logs surrealdb

# Consider using RocksDB for better performance
# (Already configured in docker-compose.yml)

Reduce context size:
- Limit search results
- Use shorter prompts
- Reduce notebook size
Use faster models:
- gpt-5-mini instead of gpt-5
- claude-3-haiku instead of claude-3-opus
- Local models for simple tasks

High Resource Usage

Problem: Application consuming too much CPU or memory.

Symptoms:

High CPU usage in task manager
System becoming unresponsive
Docker containers using excessive resources

Solutions:

Set resource limits:

# In docker-compose.yml
services:
  open_notebook:
    deploy:
      resources:
        limits:
          memory: 2G
          cpus: "1.0"

Monitor and identify bottlenecks:

# Check which service is consuming resources
docker stats

# Check system processes
htop

Optimize processing:
- Process documents in batches
- Use background jobs for heavy tasks
- Limit concurrent operations

Configuration Problems

Environment Variables Not Loading

Problem: Environment variables are not being read correctly.

Symptoms:

Default values being used instead of configured values
API keys not recognized
Connection errors to external services

Solutions:

Check file names:

# For source installation
ls -la .env

# For Docker
ls -la docker.env

Verify file format:

# Check for invisible characters
cat -A .env

# Ensure no spaces around equals
OPENAI_API_KEY=value  # Correct
OPENAI_API_KEY = value  # Incorrect

Check environment loading:

# Test environment variable
echo $OPENAI_API_KEY

# For Docker
docker compose config

Restart services after changes:

# For Docker
docker compose down
docker compose up -d

# For source installation
make stop-all
make start-all

Model Configuration Issues

Problem: AI models not working or configured incorrectly.

Symptoms:

Model not found errors
Incorrect responses
Configuration not saving

Solutions:

Check model names:

# Use exact model names from provider documentation
# OpenAI: gpt-5-mini, gpt-5, text-embedding-3-small
# Anthropic: claude-3-haiku-20240307, claude-3-sonnet-20240229

Verify provider configuration:
- Check API keys are valid
- Ensure models are available for your account
- Test with simple requests first
Reset model configuration:
- Go to Models
- Clear all configurations
- Reconfigure with known working models
Check provider status:
- Visit provider status pages
- Check for service outages
- Try alternative providers

Database Schema Issues

Problem: Database schema conflicts or migration issues.

Symptoms:

Field validation errors
Query failures
Data not saving correctly

Solutions:

Check database logs:
```
docker compose logs surrealdb
```

Reset database (WARNING: This deletes all data):

# Stop services
make stop-all

# Remove database files
rm -rf surreal_data/

# Restart services (will recreate database)
make start-all

Manual schema update:

# Run migrations
uv run python -m open_notebook.database.async_migrate

Check SurrealDB version:

# Ensure using compatible version
docker compose pull surrealdb
docker compose up -d

Getting Help

If you've tried the solutions above and are still experiencing issues:

Collect diagnostic information:

# System information
uname -a
docker version
docker compose version

# Service status
make status

# Recent logs
docker compose logs --tail=100 > logs.txt

Create a minimal reproduction:
- Start with a fresh installation
- Use minimal configuration
- Document exact steps to reproduce
Ask for help:
- Discord: https://discord.gg/37XJPXfz2w
- GitHub Issues: https://github.com/lfnovo/open-notebook/issues
- Include all diagnostic information

Remember to remove API keys and sensitive information before sharing logs or configuration files.

14 KiB Raw Blame History

Common Issues and Solutions

Installation Problems

Port Already in Use

Permission Denied (Docker)

Python/uv Installation Issues

SurrealDB Connection Issues

Runtime Errors

AI Provider API Errors

API Timeout Errors During Transformations

Memory and Performance Issues

Background Job Failures

File Upload Issues

Performance Issues

Slow Search and Chat

High Resource Usage

Configuration Problems

Environment Variables Not Loading

Model Configuration Issues

Database Schema Issues

Getting Help

14 KiB

Raw Blame History