
Build a basic vector search system with embedding similarity using the RAG Wizard.
Naive RAG uses pure vector (embedding) similarity to find relevant document chunks. It's the fastest architecture to deploy and works well for simple Q&A over a single domain.
Configure your project with a focused domain:
Upload documents from a single domain for best results. Mix multiple topics and the similarity scores will drop.
| Setting | Recommended Value |
|---|---|
| Chunking Method | Fixed-Size |
| Chunk Size | 512 tokens |
| Overlap | 50 tokens |
| Respect Sentence Boundaries | Enabled |
| Setting | Value |
|---|---|
| Search Method | Dense |
| Embedding Model | text-embedding-3-small |
| Similarity Function | Cosine |
Start with text-embedding-3-small for speed. Switch to text-embedding-3-large if you notice low recall on technical content.
Generate your API key and integrate using REST API, Python SDK, or MCP Protocol:
curl -X POST "https://api.guidedmind.ai/rag/search" \
-H "X-API-Key: rk_your_key_here" \
-H "Content-Type: application/json" \
-d '{
"query": "What is the refund policy?",
"options": {
"limit": 5,
"threshold": 0.7
}
}'| Scenario | Fit |
|---|---|
| Single-domain documents | Excellent |
| Natural language queries | Excellent |
| Simple Q&A | Good |
| Keyword-heavy queries | Poor |
| Multi-hop reasoning | Poor |
Consider switching to Hybrid RAG if:
Consider Graph RAG if: