Help Instance Help

Data Source Connectors

The RAG server supports automatic document syncing from 30 external sources. Connectors are adapted from the open-source RAGFlow project (Apache 2.0 license) and extended with OAuth2 flows, tenant isolation, and per-datasource scheduling.

Syncs run automatically every 15 minutes by default (configurable per datasource). New, updated, and deleted documents are reflected incrementally — only changes since the last sync are processed.

Connecting a Data Source

  1. Navigate to Data Sources in the sidebar

  2. Click Add Source and select a connector

  3. For OAuth2 connectors: click Connect to authorize via the provider

  4. For API key connectors: fill in the credential fields and save

  5. Syncs start automatically after the first successful connection

Connector Reference

Cloud Storage

Connector

Auth

What it syncs

Google Drive

OAuth2 (Google)

Files and folders from My Drive and Shared Drives

OneDrive

OAuth2 (Microsoft)

Files from personal OneDrive

SharePoint

OAuth2 (Microsoft)

Documents from SharePoint sites

Box

OAuth2 (Box)

Files and folders from Box accounts

Dropbox

OAuth2 (Dropbox)

Files and folders from Dropbox

Azure Blob

API key (connection string)

Blobs from an Azure Blob Storage container

Seafile

API key (server URL + token)

Files from Seafile libraries

Collaboration & Knowledge Bases

Connector

Auth

What it syncs

Confluence

OAuth2 (Atlassian)

Pages, blog posts, and attachments from Confluence spaces

Notion

API key (integration token)

Pages and databases from Notion workspaces

Slack

OAuth2 (Slack)

Messages and threads from channels

Microsoft Teams

OAuth2 (Microsoft)

Channel messages from Teams

Asana

API key

Tasks and comments from Asana projects

Zendesk

API key (subdomain + token)

Help Center articles and support tickets

Salesforce

API key (instance URL + token)

Cases, knowledge articles, and objects

DingTalk AI Table

API key

Records from DingTalk AI Table bases

Moodle

API key (site URL + token)

Courses, resources, and forum posts from Moodle LMS

Dev Tools

Connector

Auth

What it syncs

GitHub

OAuth2 (GitHub)

Repository files, README, issues, pull requests

GitLab

API key (personal access token)

Repository files, issues, merge requests

Jira

OAuth2 (Atlassian)

Issues, epics, comments from Jira projects

Bitbucket

API key (username + app password)

Repository files, pull requests, issues

Email

Connector

Auth

What it syncs

Gmail

OAuth2 (Google)

Emails and attachments from Gmail

Outlook

OAuth2 (Microsoft)

Emails and attachments from Outlook/Exchange

IMAP

Credentials (host + user + password)

Emails from any IMAP-compatible mail server

Databases

Connector

Auth

What it syncs

Airtable

API key (personal access token)

Records from Airtable bases and tables

RDBMS

Credentials (host, port, user, password, DB)

Rows from PostgreSQL or MySQL tables, converted to text

Protocols

Connector

Auth

What it syncs

RSS

None (public feeds)

Articles from RSS/Atom feeds

REST API

Optional (Bearer token or API key header)

JSON responses from any REST endpoint

WebDAV

Credentials (server URL + user + password)

Files from WebDAV-compatible servers

OAuth2 Setup

OAuth2 connectors require provider app registration. Set these environment variables on the server:

GOOGLE_OAUTH_CLIENT_ID=... GOOGLE_OAUTH_CLIENT_SECRET=... GITHUB_OAUTH_CLIENT_ID=... GITHUB_OAUTH_CLIENT_SECRET=... SLACK_OAUTH_CLIENT_ID=... SLACK_OAUTH_CLIENT_SECRET=... BOX_OAUTH_CLIENT_ID=... BOX_OAUTH_CLIENT_SECRET=... DROPBOX_OAUTH_CLIENT_ID=... DROPBOX_OAUTH_CLIENT_SECRET=... CONFLUENCE_OAUTH_CLIENT_ID=... CONFLUENCE_OAUTH_CLIENT_SECRET=... JIRA_OAUTH_CLIENT_ID=... JIRA_OAUTH_CLIENT_SECRET=...

The OAuth2 redirect URI registered in the provider app must be:

{FRONTEND_URL}/datasources/oauth2-callback/{provider}

Origin

The connector implementations are adapted from the RAGFlow open-source project by InfiniFlow, Inc., licensed under the Apache License 2.0. Adaptations include tenant isolation, encrypted credential storage, incremental sync scheduling, and OAuth2 flow integration.

16 June 2026