
Upload documents and configure data sources for your RAG system.
Add your knowledge base content through document uploads, URL sources, or existing document libraries.
Knowledge sources are the foundation of your RAG system. The quality and relevance of your search results depend directly on the documents you provide. You can add content through direct uploads or by selecting from previously stored documents.
You can add content to your RAG project in two ways:
| Source | Description |
|---|---|
| Document Upload | Upload files directly (PDF, HTML, CSV, DOCX, MD, XLS) |
| Document Library | Select from previously uploaded documents stored in your account |
Different file formats require different processing pipelines. Understanding which format you're working with helps set expectations for text extraction quality and processing time.
| Format | Extension | Notes |
|---|---|---|
.pdf | Text extraction with optional OCR | |
| HTML | .html, .htm | Script and style tags stripped |
| CSV | .csv | Tabular data converted to text |
| DOCX | .docx | Word documents with formatting |
| Markdown | .md, .mdx | Preserves heading structure |
| Excel | .xls, .xlsx | Spreadsheet data as text |
The Document Library stores all previously uploaded documents in your account, making them available for reuse across multiple RAG projects. This eliminates the need to re-upload common reference materials and keeps your knowledge base consistent. Shared documents from team storage are also accessible through the library.
Previously uploaded documents appear in your Document Library and can be reused across projects. This avoids re-uploading the same files. Shared documents from team storage can also be accessed here.
Following these guidelines ensures optimal processing quality and retrieval performance: