Commit graph

75 commits

Author SHA1 Message Date
Rohan Verma
c2030cec48
Merge pull request #275 from CREDO23/improvement/persist-refreshed-token-in-google-related-connector
[Improvement] Google connectors | Update the connector config after refreshing the token
2025-08-26 18:47:36 -07:00
CREDO23
ecbb1f27e0 clean up 2025-08-26 11:53:27 +02:00
CREDO23
85664f2ff8 update the connector config after refreshing google calendar access token 2025-08-26 11:49:31 +02:00
DESKTOP-RTLN3BA\$punk
3b87ecc3c5 fix: made notion indexing async 2025-08-21 14:43:04 -07:00
DESKTOP-RTLN3BA\$punk
f443a6636f fix: slack indexing
- Indivisual messages as Document instead of concatinating it.
2025-08-21 14:23:52 -07:00
CREDO23
9711af2b72 refresh the token when expired 2025-08-21 01:09:13 +02:00
CREDO23
b0b6df0971 updated the connector config after refreshing the token 2025-08-20 20:32:08 +02:00
DESKTOP-RTLN3BA\$punk
1c4c61eb04 feat: Fixed Document Summary Content across connectors and processors 2025-08-18 20:51:48 -07:00
CREDO23
089c9d1625 use new indexer files structureclear 2025-08-15 10:11:50 +02:00
CREDO23
69f6a0a278 fix scopes issues for google services 2025-08-15 09:12:40 +02:00
CREDO23
1de0f0309c support google mail indexing 2025-08-15 09:11:09 +02:00
DESKTOP-RTLN3BA\$punk
54374bd7be ruff format 2025-08-12 15:33:17 -07:00
DESKTOP-RTLN3BA\$punk
5aa52375c3 refactor: refactored background_tasks & indexing_tasks 2025-08-12 15:28:13 -07:00
CREDO23
edf46e4de1 update seach source connector schema 2025-08-03 12:16:40 +02:00
CREDO23
4cb00735ac add coderabbit suggestions 2025-07-30 22:25:47 +02:00
CREDO23
b1204f8ac0 add clickup indexing 2025-07-30 21:31:52 +02:00
Utkarsh-Patel-13
1148426009 Biome: Merge upstream main to incldue confluence changes 2025-07-28 09:10:29 -07:00
Utkarsh-Patel-13
bd8dc7934d Fix for podcast assitantt content 2025-07-27 14:14:15 -07:00
CREDO23
8e52a0b201 add relevant coderrabit suggestions 2025-07-27 14:03:39 +02:00
CREDO23
131d362f1e suppport confluence pages indexing 2025-07-26 14:44:26 +02:00
Utkarsh-Patel-13
2827522ebc Fixed formatting and linting post Jira connector PR 2025-07-25 10:52:34 -07:00
CREDO23
655352fc09 fix issues indexing with jira connector 2025-07-25 08:59:32 +02:00
CREDO23
4984aab3f1 update auth type 2025-07-25 08:59:32 +02:00
CREDO23
cd05a06a91 update connector indexing / update connector service 2025-07-25 08:59:32 +02:00
CREDO23
ca98693005 update connector indexing / update connector service 2025-07-25 08:59:32 +02:00
Utkarsh-Patel-13
d359a59f6d Fixed all ruff lint and formatting errors 2025-07-24 14:43:48 -07:00
MSI\ModSetter
931fafa403 refactor: Remove deprecated document processing services and update imports
- Deleted the document_processing module and its associated docling_service.
- Updated imports in documents_routes.py and background_tasks.py to reflect the new service structure.
- Ensured compatibility with the task logging system by adjusting type hints for log entries.
2025-07-21 06:19:37 -07:00
Abdullah 3li
f117d94ef7 fix: Resolve merge conflict in documents_routes.py
- Integrated Docling ETL service with new task logging system
- Maintained consistent logging pattern across all ETL services
- Added progress and success/failure logging for Docling processing
2025-07-21 10:43:15 +03:00
Abdullah 3li
aa00822169 feat: Add Docling support as ETL_SERVICE option
- Added DOCLING as third ETL_SERVICE option (alongside UNSTRUCTURED/LLAMACLOUD)
- Implemented add_received_file_document_using_docling function
- Added Docling processing logic in documents_routes.py
- Enhanced chunking with configurable overlap support
- Added comprehensive document processing service
- Supports both CPU and GPU processing with user selection

Addresses #161 - Add Docling Support as an ETL_SERVICE
Follows same pattern as LlamaCloud integration (PR #123)
2025-07-20 11:42:55 +03:00
Utkarsh-Patel-13
92781e726c Updated Streaming Service to efficently stream content\
\
- Earlier for each chunk, whole message (with all annotations included)
  were streamed. Leading to extremely large data length.
- Fixed to only stream new chunk.
- Updated ANSWER part to be streamed as message content (following
  Vercel's Stream Protocol)\
- Fixed yield typo
2025-07-18 17:43:07 -07:00
MSI\ModSetter
ba44256bf0 feat(BACKEND): Added task logging for podcast generation
- Integrated TaskLoggingService to log the start, progress, success, and failure of podcast generation tasks.
- Updated user ID handling to ensure it is consistently converted to a string across various tasks.
- Modified frontend success message to direct users to the logs tab for status updates on podcast generation.
2025-07-17 02:39:36 -07:00
MSI\ModSetter
3f62121ced feat(BACKEND): Implement task logging for connector indexing tasks
- Added TaskLoggingService to log the start, progress, success, and failure of indexing tasks for Slack, Notion, GitHub, Linear, and Discord connectors.
- Updated frontend to reflect changes in indexing status messages.
2025-07-17 02:16:53 -07:00
MSI\ModSetter
1eb072cc69 feat(BACKEND): Added Log Management System for better Bug's Tracking
- Background tasks are now logged so non tech users can effectively track the failurte points easily.
2025-07-16 01:10:33 -07:00
DESKTOP-RTLN3BA\$punk
5045b7433a refactor: Move utility services to a dedicated 'services' module
- Updated import paths for LLM, connector, query, and streaming services to reflect their new location in the 'services' module.
- Removed obsolete utility service files that have been migrated.
2025-07-06 17:51:24 -07:00
DESKTOP-RTLN3BA\$punk
d8f2c5f7cf fix: generate content hash based on search space id as well.
- Allows Reindexing in selperate seatch spaces.
2025-06-10 13:56:23 -07:00
DESKTOP-RTLN3BA\$punk
a85f7920a9 feat: added configurable LLM's 2025-06-09 15:50:15 -07:00
DESKTOP-RTLN3BA\$punk
99fa03d78b feat: Added Calender Based Indexing.
- This should stabalize manual syning.
2025-06-06 18:17:47 -07:00
DESKTOP-RTLN3BA\$punk
d7bb31f894 feat: Document Selector in Chat.
- Still need improvements but lets use it first.
2025-06-04 21:46:50 -07:00
DESKTOP-RTLN3BA\$punk
deea3fa544 chore: remove VERSION file and clean up unused imports across multiple files 2025-06-03 01:33:21 -07:00
DESKTOP-RTLN3BA\$punk
0c07898f4a feat: Added Q/A Mode in Research Agent 2025-06-03 00:10:35 -07:00
Muhamad Aji Wibisono
a3c91484ac feat: thread yielding for bot responsivity 2025-06-02 22:02:12 +07:00
Muhamad Aji Wibisono
ebfc2cb679 feat: optimized discord indexing by matching the document pattern 2025-06-02 21:04:13 +07:00
Muhamad Aji Wibisono
42cc1b8fbe feat: applied coderabbit suggestions 2025-06-02 21:03:50 +07:00
Muhamad Aji Wibisono
d11b636113 feat: removed limit on message fetching 2025-06-02 20:26:14 +07:00
Muhamad Aji Wibisono
a0f9efdb67 fix: async thread on discord but 2025-06-02 19:22:04 +07:00
Muhamad Aji Wibisono
4b3c662478 feat: added discord indexer 2025-06-02 18:30:38 +07:00
DESKTOP-RTLN3BA\$punk
0365febc8c fix for content hashing 2025-05-30 19:30:56 -07:00
DESKTOP-RTLN3BA\$punk
73751c0eb1 feat: Removed Hard Dependency on Unstructured.io
- Added Llamaparse Support :)
2025-05-30 19:17:19 -07:00
DESKTOP-RTLN3BA\$punk
a870e07cbf Merge commit 'fd6da4c4723ab28fc40605d109a9ba66cbd364e3' into dev 2025-05-28 23:52:33 -07:00
DESKTOP-RTLN3BA\$punk
5411bac8e0 feat: Added content based hashing to prevent duplicates and fix resync issues 2025-05-28 23:52:00 -07:00