Commit graph

64 commits

Author SHA1 Message Date
DESKTOP-RTLN3BA\$punk
54374bd7be ruff format 2025-08-12 15:33:17 -07:00
DESKTOP-RTLN3BA\$punk
5aa52375c3 refactor: refactored background_tasks & indexing_tasks 2025-08-12 15:28:13 -07:00
CREDO23
edf46e4de1 update seach source connector schema 2025-08-03 12:16:40 +02:00
CREDO23
4cb00735ac add coderabbit suggestions 2025-07-30 22:25:47 +02:00
CREDO23
b1204f8ac0 add clickup indexing 2025-07-30 21:31:52 +02:00
Utkarsh-Patel-13
1148426009 Biome: Merge upstream main to incldue confluence changes 2025-07-28 09:10:29 -07:00
Utkarsh-Patel-13
bd8dc7934d Fix for podcast assitantt content 2025-07-27 14:14:15 -07:00
CREDO23
8e52a0b201 add relevant coderrabit suggestions 2025-07-27 14:03:39 +02:00
CREDO23
131d362f1e suppport confluence pages indexing 2025-07-26 14:44:26 +02:00
Utkarsh-Patel-13
2827522ebc Fixed formatting and linting post Jira connector PR 2025-07-25 10:52:34 -07:00
CREDO23
655352fc09 fix issues indexing with jira connector 2025-07-25 08:59:32 +02:00
CREDO23
4984aab3f1 update auth type 2025-07-25 08:59:32 +02:00
CREDO23
cd05a06a91 update connector indexing / update connector service 2025-07-25 08:59:32 +02:00
CREDO23
ca98693005 update connector indexing / update connector service 2025-07-25 08:59:32 +02:00
Utkarsh-Patel-13
d359a59f6d Fixed all ruff lint and formatting errors 2025-07-24 14:43:48 -07:00
MSI\ModSetter
931fafa403 refactor: Remove deprecated document processing services and update imports
- Deleted the document_processing module and its associated docling_service.
- Updated imports in documents_routes.py and background_tasks.py to reflect the new service structure.
- Ensured compatibility with the task logging system by adjusting type hints for log entries.
2025-07-21 06:19:37 -07:00
Abdullah 3li
f117d94ef7 fix: Resolve merge conflict in documents_routes.py
- Integrated Docling ETL service with new task logging system
- Maintained consistent logging pattern across all ETL services
- Added progress and success/failure logging for Docling processing
2025-07-21 10:43:15 +03:00
Abdullah 3li
aa00822169 feat: Add Docling support as ETL_SERVICE option
- Added DOCLING as third ETL_SERVICE option (alongside UNSTRUCTURED/LLAMACLOUD)
- Implemented add_received_file_document_using_docling function
- Added Docling processing logic in documents_routes.py
- Enhanced chunking with configurable overlap support
- Added comprehensive document processing service
- Supports both CPU and GPU processing with user selection

Addresses #161 - Add Docling Support as an ETL_SERVICE
Follows same pattern as LlamaCloud integration (PR #123)
2025-07-20 11:42:55 +03:00
Utkarsh-Patel-13
92781e726c Updated Streaming Service to efficently stream content\
\
- Earlier for each chunk, whole message (with all annotations included)
  were streamed. Leading to extremely large data length.
- Fixed to only stream new chunk.
- Updated ANSWER part to be streamed as message content (following
  Vercel's Stream Protocol)\
- Fixed yield typo
2025-07-18 17:43:07 -07:00
MSI\ModSetter
ba44256bf0 feat(BACKEND): Added task logging for podcast generation
- Integrated TaskLoggingService to log the start, progress, success, and failure of podcast generation tasks.
- Updated user ID handling to ensure it is consistently converted to a string across various tasks.
- Modified frontend success message to direct users to the logs tab for status updates on podcast generation.
2025-07-17 02:39:36 -07:00
MSI\ModSetter
3f62121ced feat(BACKEND): Implement task logging for connector indexing tasks
- Added TaskLoggingService to log the start, progress, success, and failure of indexing tasks for Slack, Notion, GitHub, Linear, and Discord connectors.
- Updated frontend to reflect changes in indexing status messages.
2025-07-17 02:16:53 -07:00
MSI\ModSetter
1eb072cc69 feat(BACKEND): Added Log Management System for better Bug's Tracking
- Background tasks are now logged so non tech users can effectively track the failurte points easily.
2025-07-16 01:10:33 -07:00
DESKTOP-RTLN3BA\$punk
5045b7433a refactor: Move utility services to a dedicated 'services' module
- Updated import paths for LLM, connector, query, and streaming services to reflect their new location in the 'services' module.
- Removed obsolete utility service files that have been migrated.
2025-07-06 17:51:24 -07:00
DESKTOP-RTLN3BA\$punk
d8f2c5f7cf fix: generate content hash based on search space id as well.
- Allows Reindexing in selperate seatch spaces.
2025-06-10 13:56:23 -07:00
DESKTOP-RTLN3BA\$punk
a85f7920a9 feat: added configurable LLM's 2025-06-09 15:50:15 -07:00
DESKTOP-RTLN3BA\$punk
99fa03d78b feat: Added Calender Based Indexing.
- This should stabalize manual syning.
2025-06-06 18:17:47 -07:00
DESKTOP-RTLN3BA\$punk
d7bb31f894 feat: Document Selector in Chat.
- Still need improvements but lets use it first.
2025-06-04 21:46:50 -07:00
DESKTOP-RTLN3BA\$punk
deea3fa544 chore: remove VERSION file and clean up unused imports across multiple files 2025-06-03 01:33:21 -07:00
DESKTOP-RTLN3BA\$punk
0c07898f4a feat: Added Q/A Mode in Research Agent 2025-06-03 00:10:35 -07:00
Muhamad Aji Wibisono
a3c91484ac feat: thread yielding for bot responsivity 2025-06-02 22:02:12 +07:00
Muhamad Aji Wibisono
ebfc2cb679 feat: optimized discord indexing by matching the document pattern 2025-06-02 21:04:13 +07:00
Muhamad Aji Wibisono
42cc1b8fbe feat: applied coderabbit suggestions 2025-06-02 21:03:50 +07:00
Muhamad Aji Wibisono
d11b636113 feat: removed limit on message fetching 2025-06-02 20:26:14 +07:00
Muhamad Aji Wibisono
a0f9efdb67 fix: async thread on discord but 2025-06-02 19:22:04 +07:00
Muhamad Aji Wibisono
4b3c662478 feat: added discord indexer 2025-06-02 18:30:38 +07:00
DESKTOP-RTLN3BA\$punk
0365febc8c fix for content hashing 2025-05-30 19:30:56 -07:00
DESKTOP-RTLN3BA\$punk
73751c0eb1 feat: Removed Hard Dependency on Unstructured.io
- Added Llamaparse Support :)
2025-05-30 19:17:19 -07:00
DESKTOP-RTLN3BA\$punk
a870e07cbf Merge commit 'fd6da4c4723ab28fc40605d109a9ba66cbd364e3' into dev 2025-05-28 23:52:33 -07:00
DESKTOP-RTLN3BA\$punk
5411bac8e0 feat: Added content based hashing to prevent duplicates and fix resync issues 2025-05-28 23:52:00 -07:00
google-labs-jules[bot]
299bb35d8c Hi there, it's Jules! I've just finished making some improvements.
Here's a rundown of what I did:

Fix: Robust Slack rate limiting, error handling & GitHub org repos

This update delivers comprehensive improvements to Slack connector stability and enhances the GitHub connector.

**Slack Connector (`slack_history.py`, `connectors_indexing_tasks.py`):**
- I've implemented proactive delays (1.2s for `conversations.history`, 3s for `conversations.list` pagination) and `Retry-After` header handling for 429 rate limit errors across `conversations.list`, `conversations.history`, and `users.info` API calls.
- I'll now gracefully handle `not_in_channel` errors when fetching conversation history by logging a warning and skipping the channel.
- I've refactored channel info fetching: `get_all_channels` now returns richer channel data (including `is_member`, `is_private`).
- I've removed direct calls to `conversations.info` from `connectors_indexing_tasks.py`, using the richer data from `get_all_channels` instead, to prevent associated rate limits.
- I corrected a `SyntaxError` (non-printable character) in `slack_history.py`.
- I've enhanced logging for rate limit actions, delays, and errors.
- I've updated unit tests in `test_slack_history.py` to cover all new logic.

**GitHub Connector (`github_connector.py`):**
- I've modified `get_user_repositories` to fetch all repositories accessible by you (owned, collaborated, organization) by changing the API call parameter from `type='owner'` to `type='all'`.
- I've included unit tests in `test_github_connector.py` for this change.
2025-05-27 13:39:42 +00:00
DESKTOP-RTLN3BA\$punk
4c8b03ce2b feat: Code Block Syntax Highlighting & Copy Func 2025-05-19 23:19:20 -07:00
Rohan Verma
1d1523a891
Merge pull request #88 from ritikprajapat21/main
Fix #33: Refactored code
2025-05-11 23:59:17 -07:00
DESKTOP-RTLN3BA\$punk
a9db0a8ceb feat: Introduce the RAPTOR Search. 2025-05-11 23:05:56 -07:00
ritikprajapat21
16867b4bed Changes updated 2025-05-12 09:37:37 +05:30
Ritik Prajapat
c1acd9cfaf
Merge branch 'MODSetter:main' into main 2025-05-12 09:36:47 +05:30
DESKTOP-RTLN3BA\$punk
a58550818b feat: Added chat_history to researcher agent 2025-05-10 20:06:19 -07:00
ritikprajapat21
ccd12d3ad2 Fix #33: Refactored code 2025-05-10 17:11:51 +05:30
DESKTOP-RTLN3BA\$punk
1586a0bd78 chore: Added direct handling for markdown files.
- Fixed podcast imports.
2025-05-07 22:04:57 -07:00
DESKTOP-RTLN3BA\$punk
b4bee887bd feat: Added Podcast Feature and its actually fast.
- Fully Async
2025-05-05 23:18:12 -07:00
DESKTOP-RTLN3BA\$punk
22da221ad5 feat: Shifted to RecursiveChunker and CodeChunker
- Codebase Q/A should be lot better now.
2025-04-29 23:02:07 -07:00