Commit graph

31 commits

Author SHA1 Message Date
Muhamad Aji Wibisono
a0f9efdb67 fix: async thread on discord but 2025-06-02 19:22:04 +07:00
Muhamad Aji Wibisono
4b3c662478 feat: added discord indexer 2025-06-02 18:30:38 +07:00
DESKTOP-RTLN3BA\$punk
0365febc8c fix for content hashing 2025-05-30 19:30:56 -07:00
DESKTOP-RTLN3BA\$punk
73751c0eb1 feat: Removed Hard Dependency on Unstructured.io
- Added Llamaparse Support :)
2025-05-30 19:17:19 -07:00
DESKTOP-RTLN3BA\$punk
a870e07cbf Merge commit 'fd6da4c4723ab28fc40605d109a9ba66cbd364e3' into dev 2025-05-28 23:52:33 -07:00
DESKTOP-RTLN3BA\$punk
5411bac8e0 feat: Added content based hashing to prevent duplicates and fix resync issues 2025-05-28 23:52:00 -07:00
google-labs-jules[bot]
299bb35d8c Hi there, it's Jules! I've just finished making some improvements.
Here's a rundown of what I did:

Fix: Robust Slack rate limiting, error handling & GitHub org repos

This update delivers comprehensive improvements to Slack connector stability and enhances the GitHub connector.

**Slack Connector (`slack_history.py`, `connectors_indexing_tasks.py`):**
- I've implemented proactive delays (1.2s for `conversations.history`, 3s for `conversations.list` pagination) and `Retry-After` header handling for 429 rate limit errors across `conversations.list`, `conversations.history`, and `users.info` API calls.
- I'll now gracefully handle `not_in_channel` errors when fetching conversation history by logging a warning and skipping the channel.
- I've refactored channel info fetching: `get_all_channels` now returns richer channel data (including `is_member`, `is_private`).
- I've removed direct calls to `conversations.info` from `connectors_indexing_tasks.py`, using the richer data from `get_all_channels` instead, to prevent associated rate limits.
- I corrected a `SyntaxError` (non-printable character) in `slack_history.py`.
- I've enhanced logging for rate limit actions, delays, and errors.
- I've updated unit tests in `test_slack_history.py` to cover all new logic.

**GitHub Connector (`github_connector.py`):**
- I've modified `get_user_repositories` to fetch all repositories accessible by you (owned, collaborated, organization) by changing the API call parameter from `type='owner'` to `type='all'`.
- I've included unit tests in `test_github_connector.py` for this change.
2025-05-27 13:39:42 +00:00
DESKTOP-RTLN3BA\$punk
4c8b03ce2b feat: Code Block Syntax Highlighting & Copy Func 2025-05-19 23:19:20 -07:00
Rohan Verma
1d1523a891
Merge pull request #88 from ritikprajapat21/main
Fix #33: Refactored code
2025-05-11 23:59:17 -07:00
DESKTOP-RTLN3BA\$punk
a9db0a8ceb feat: Introduce the RAPTOR Search. 2025-05-11 23:05:56 -07:00
ritikprajapat21
16867b4bed Changes updated 2025-05-12 09:37:37 +05:30
Ritik Prajapat
c1acd9cfaf
Merge branch 'MODSetter:main' into main 2025-05-12 09:36:47 +05:30
DESKTOP-RTLN3BA\$punk
a58550818b feat: Added chat_history to researcher agent 2025-05-10 20:06:19 -07:00
ritikprajapat21
ccd12d3ad2 Fix #33: Refactored code 2025-05-10 17:11:51 +05:30
DESKTOP-RTLN3BA\$punk
1586a0bd78 chore: Added direct handling for markdown files.
- Fixed podcast imports.
2025-05-07 22:04:57 -07:00
DESKTOP-RTLN3BA\$punk
b4bee887bd feat: Added Podcast Feature and its actually fast.
- Fully Async
2025-05-05 23:18:12 -07:00
DESKTOP-RTLN3BA\$punk
22da221ad5 feat: Shifted to RecursiveChunker and CodeChunker
- Codebase Q/A should be lot better now.
2025-04-29 23:02:07 -07:00
DESKTOP-RTLN3BA\$punk
130f43a0fa feat: Removed GPT-Researcher in favour of own SurfSense LangGraph Agent 2025-04-20 19:19:35 -07:00
Adamsmith6300
f2f426d5eb merge conflicts 2025-04-16 21:34:51 -07:00
Adamsmith6300
ae8c74a5aa select repos when adding gh connector 2025-04-16 19:59:38 -07:00
DESKTOP-RTLN3BA\$punk
e0eb9d4b8b feat: Added Linear Connector 2025-04-15 23:10:35 -07:00
Adamsmith6300
a69bbb32f7 Merge branch 'main' of https://github.com/MODSetter/SurfSense into add-github-connector 2025-04-14 15:25:29 -07:00
Adamsmith6300
a26fac435b documents table migration, fix/update github indexing 2025-04-13 21:23:05 -07:00
DESKTOP-RTLN3BA\$punk
0b93c9dfef Fixed current agent citation issues and added sub_section_writer agent for upcoming SurfSense research agent 2025-04-13 20:47:23 -07:00
Adamsmith6300
bb198e38c0 add github connector, add alembic for db migrations, fix bug updating connectors 2025-04-13 13:56:22 -07:00
DESKTOP-RTLN3BA\$punk
b43272a115 feat(youtube): integrate YouTube video processing connector
- Added support for processing YouTube videos, including transcript extraction and document creation.
- Implemented a new background task for adding YouTube video documents.
- Enhanced the connector service to search for YouTube videos and return relevant results.
- Updated frontend components to include YouTube video options in the dashboard and connector sources.
- Added necessary dependencies for YouTube transcript API.
2025-04-11 15:05:17 -07:00
DESKTOP-RTLN3BA\$punk
77833f21f3 feat: Added Extension 2025-03-26 20:41:47 -07:00
DESKTOP-RTLN3BA\$punk
24fd873ca7 fix: Fixed Slack Reindexing 2025-03-26 17:44:38 -07:00
DESKTOP-RTLN3BA\$punk
23da404177 fix: Fixed Notion Reindexing & Updation 2025-03-26 17:19:10 -07:00
DESKTOP-RTLN3BA\$punk
2e702902e4 feat: Integrate query reformulation in stream_connector_search_results 2025-03-19 23:57:02 -07:00
DESKTOP-RTLN3BA\$punk
da23012970 feat: SurfSense v0.0.6 init 2025-03-14 18:53:14 -07:00