Logo IconGuided Mind
v2.4Sign In
RAG Wizard

Simple RAG Configuration

Recommended settings for deploying a Simple (Naive) RAG system.

Simple RAG uses pure vector (embedding) similarity to find relevant document chunks. It's the fastest architecture to deploy and works well for simple Q&A over a single domain.

Configuration Flow

The following settings provide a solid baseline for Simple RAG. Adjust based on your evaluation results and specific use case requirements.

Step 1: Project Setup

Project setup defines the identity and purpose of your RAG system. For Simple RAG, keep the domain focused to maximize retrieval accuracy.

SettingValue
DomainSingle focused domain
Use CaseQ&A Chatbot or Document Search

Step 2: Knowledge Sources

Add your documents through direct upload or from your Document Library. For Simple RAG, stick to a single domain and keep documents under 50 MB each for optimal processing.

SettingValue
SourceDocument Upload or Document Library
Max File Size50 MB per file
Recommended FormatsPDF, Markdown, DOCX

Step 3: Document Processing

Document processing prepares your raw text for embedding. Simple RAG works best with straightforward preprocessing — clean the text, split into fixed-size chunks, and preserve sentence boundaries.

SettingValue
Chunking MethodFixed-Size
Chunk Size500 tokens
Overlap10%
Respect Sentence BoundariesEnabled

Step 5: Pipeline Configuration

Pipeline configuration controls how queries are processed and results are returned. Simple RAG uses pure vector search with no keyword matching or graph traversal, keeping latency low and deployment simple.

SettingValue
Search MethodDense
Embedding Modelall-MiniLM-L6-v2 or all-mpnet-base-v2
Similarity FunctionCosine
Retrieval MethodCustom Document Template
BM25Disabled
LLM IntegrationOptional
Top K5
Score Threshold0.7

Step 6: API Setup

Once configured, generate your API key and integrate using REST API, Python SDK, or MCP Protocol. Test your endpoints before deploying to production.

SettingValue
API KeyGenerated automatically (starts with rk_)
IntegrationREST API, Python SDK, or MCP
Test EndpointsUse built-in testing tools

When to Use Simple RAG

  • Quick proof-of-concept or prototype
  • Single-domain knowledge base
  • Straightforward Q&A over documentation
  • Limited compute resources
  • < 1000 documents

When to Upgrade

Consider Hybrid RAG or Graph RAG when:

  • Users search with exact keywords that vector search misses
  • You need to answer relationship-based questions
  • Your domain has complex entity relationships
  • Retrieval accuracy plateaus with vector search alone

Start with Simple RAG to establish a baseline, then measure retrieval quality before upgrading to Hybrid or Graph RAG.