Commit graph

17 commits

Author SHA1 Message Date
MSI\ModSetter
9e8929ed2d refactor: Update import path for TaskLoggingService in documents_routes.py 2025-07-21 06:20:44 -07:00
MSI\ModSetter
931fafa403 refactor: Remove deprecated document processing services and update imports
- Deleted the document_processing module and its associated docling_service.
- Updated imports in documents_routes.py and background_tasks.py to reflect the new service structure.
- Ensured compatibility with the task logging system by adjusting type hints for log entries.
2025-07-21 06:19:37 -07:00
Abdullah 3li
f117d94ef7 fix: Resolve merge conflict in documents_routes.py
- Integrated Docling ETL service with new task logging system
- Maintained consistent logging pattern across all ETL services
- Added progress and success/failure logging for Docling processing
2025-07-21 10:43:15 +03:00
Abdullah 3li
aa00822169 feat: Add Docling support as ETL_SERVICE option
- Added DOCLING as third ETL_SERVICE option (alongside UNSTRUCTURED/LLAMACLOUD)
- Implemented add_received_file_document_using_docling function
- Added Docling processing logic in documents_routes.py
- Enhanced chunking with configurable overlap support
- Added comprehensive document processing service
- Supports both CPU and GPU processing with user selection

Addresses #161 - Add Docling Support as an ETL_SERVICE
Follows same pattern as LlamaCloud integration (PR #123)
2025-07-20 11:42:55 +03:00
MSI\ModSetter
1eb072cc69 feat(BACKEND): Added Log Management System for better Bug's Tracking
- Background tasks are now logged so non tech users can effectively track the failurte points easily.
2025-07-16 01:10:33 -07:00
DESKTOP-RTLN3BA\$punk
21fb231683 fix: Markdown & Text files as default support. 2025-07-07 22:55:51 -07:00
DESKTOP-RTLN3BA\$punk
a85f7920a9 feat: added configurable LLM's 2025-06-09 15:50:15 -07:00
DESKTOP-RTLN3BA\$punk
73751c0eb1 feat: Removed Hard Dependency on Unstructured.io
- Added Llamaparse Support :)
2025-05-30 19:17:19 -07:00
DESKTOP-RTLN3BA\$punk
a8080d2dc7 feat: Added Speech to Text support.
- Supports audio & video files.
- Will be useful for Youtube vids which dont have transcripts.
2025-05-13 21:13:53 -07:00
DESKTOP-RTLN3BA\$punk
1586a0bd78 chore: Added direct handling for markdown files.
- Fixed podcast imports.
2025-05-07 22:04:57 -07:00
DESKTOP-RTLN3BA\$punk
2008b07304 fix: Docs & Chats in other search spaces 2025-04-17 23:19:56 -07:00
DESKTOP-RTLN3BA\$punk
b43272a115 feat(youtube): integrate YouTube video processing connector
- Added support for processing YouTube videos, including transcript extraction and document creation.
- Implemented a new background task for adding YouTube video documents.
- Enhanced the connector service to search for YouTube videos and return relevant results.
- Updated frontend components to include YouTube video options in the dashboard and connector sources.
- Added necessary dependencies for YouTube transcript API.
2025-04-11 15:05:17 -07:00
DESKTOP-RTLN3BA\$punk
1609e59086 YouTube video processing utils 2025-04-09 18:46:10 -07:00
DESKTOP-RTLN3BA\$punk
8cd1264d3f feat: Updated the extension for SurfSense v0.0.6 2025-03-26 20:02:53 -07:00
DESKTOP-RTLN3BA\$punk
ee0c518553 not-integreated: Add DocumentHybridSearchRetriever 2025-03-20 22:56:24 -07:00
DESKTOP-RTLN3BA\$punk
709aa6f303 feat: Added Docker Support and missing dependencies. 2025-03-20 18:52:06 -07:00
DESKTOP-RTLN3BA\$punk
da23012970 feat: SurfSense v0.0.6 init 2025-03-14 18:53:14 -07:00