
Recommended settings for deploying a Simple (Naive) RAG system.
Simple RAG uses pure vector (embedding) similarity to find relevant document chunks. It's the fastest architecture to deploy and works well for simple Q&A over a single domain.
The following settings provide a solid baseline for Simple RAG. Adjust based on your evaluation results and specific use case requirements.
Project setup defines the identity and purpose of your RAG system. For Simple RAG, keep the domain focused to maximize retrieval accuracy.
| Setting | Value |
|---|---|
| Domain | Single focused domain |
| Use Case | Q&A Chatbot or Document Search |
Add your documents through direct upload or from your Document Library. For Simple RAG, stick to a single domain and keep documents under 50 MB each for optimal processing.
| Setting | Value |
|---|---|
| Source | Document Upload or Document Library |
| Max File Size | 50 MB per file |
| Recommended Formats | PDF, Markdown, DOCX |
Document processing prepares your raw text for embedding. Simple RAG works best with straightforward preprocessing — clean the text, split into fixed-size chunks, and preserve sentence boundaries.
| Setting | Value |
|---|---|
| Chunking Method | Fixed-Size |
| Chunk Size | 500 tokens |
| Overlap | 10% |
| Respect Sentence Boundaries | Enabled |
Pipeline configuration controls how queries are processed and results are returned. Simple RAG uses pure vector search with no keyword matching or graph traversal, keeping latency low and deployment simple.
| Setting | Value |
|---|---|
| Search Method | Dense |
| Embedding Model | all-MiniLM-L6-v2 or all-mpnet-base-v2 |
| Similarity Function | Cosine |
| Retrieval Method | Custom Document Template |
| BM25 | Disabled |
| LLM Integration | Optional |
| Top K | 5 |
| Score Threshold | 0.7 |
Once configured, generate your API key and integrate using REST API, Python SDK, or MCP Protocol. Test your endpoints before deploying to production.
| Setting | Value |
|---|---|
| API Key | Generated automatically (starts with rk_) |
| Integration | REST API, Python SDK, or MCP |
| Test Endpoints | Use built-in testing tools |
Consider Hybrid RAG or Graph RAG when:
Start with Simple RAG to establish a baseline, then measure retrieval quality before upgrading to Hybrid or Graph RAG.