RAG Wizard

Simple RAG Configuration

Recommended settings for deploying a Simple (Naive) RAG system.

Simple RAG uses pure vector (embedding) similarity to find relevant document chunks. It's the fastest architecture to deploy and works well for simple Q&A over a single domain.

Configuration Flow

Recommended Settings

The following settings provide a solid baseline for Simple RAG. Adjust based on your evaluation results and specific use case requirements.

Step 1: Project Setup

Project setup defines the identity and purpose of your RAG system. For Simple RAG, keep the domain focused to maximize retrieval accuracy.

Setting	Value
Domain	Single focused domain
Use Case	Q&A Chatbot or Document Search

Step 2: Knowledge Sources

Add your documents through direct upload or from your Document Library. For Simple RAG, stick to a single domain and keep documents under 50 MB each for optimal processing.

Setting	Value
Source	Document Upload or Document Library
Max File Size	50 MB per file
Recommended Formats	PDF, Markdown, DOCX

Step 3: Document Processing

Document processing prepares your raw text for embedding. Simple RAG works best with straightforward preprocessing — clean the text, split into fixed-size chunks, and preserve sentence boundaries.

Setting	Value
Chunking Method	Fixed-Size
Chunk Size	500 tokens
Overlap	10%
Respect Sentence Boundaries	Enabled

Step 5: Pipeline Configuration

Pipeline configuration controls how queries are processed and results are returned. Simple RAG uses pure vector search with no keyword matching or graph traversal, keeping latency low and deployment simple.

Setting	Value
Search Method	Dense
Embedding Model	`all-MiniLM-L6-v2` or `all-mpnet-base-v2`
Similarity Function	Cosine
Retrieval Method	Custom Document Template
BM25	Disabled
LLM Integration	Optional
Top K	5
Score Threshold	0.7

Step 6: API Setup

Once configured, generate your API key and integrate using REST API, Python SDK, or MCP Protocol. Test your endpoints before deploying to production.

Setting	Value
API Key	Generated automatically (starts with `rk_`)
Integration	REST API, Python SDK, or MCP
Test Endpoints	Use built-in testing tools

When to Use Simple RAG

Quick proof-of-concept or prototype
Single-domain knowledge base
Straightforward Q&A over documentation
Limited compute resources
< 1000 documents

When to Upgrade

Consider Hybrid RAG or Graph RAG when:

Users search with exact keywords that vector search misses
You need to answer relationship-based questions
Your domain has complex entity relationships
Retrieval accuracy plateaus with vector search alone

Start with Simple RAG to establish a baseline, then measure retrieval quality before upgrading to Hybrid or Graph RAG.

← PreviousAPI Setup Next →Hybrid RAG Config