RAG Wizard

Troubleshooting

Common issues and solutions for RAG Wizard, API calls, and MCP connections.

This guide covers the most common issues you'll encounter while building and deploying RAG systems, with concrete fixes for each scenario.

Upload Failures

File Too Large

Error: File size exceeds 50MB limit

Cause: Single file exceeds the maximum allowed size.

Fix: Split large PDFs or compress images before uploading.

# Split PDF by pages (requires qpdf)
qpdf input.pdf --pages . 1-50 -- part1.pdf
qpdf input.pdf --pages . 51-100 -- part2.pdf
 
# Compress images
convert input.png -quality 80 output.png

For very large documents (100+ pages), consider uploading in sections and using the same project to keep them in one index.

Unsupported File Format

Error: Unsupported file type: .xyz

Supported formats:

Format	Extension	Max Size
PDF	`.pdf`	50MB
Markdown	`.md`, `.mdx`	10MB
Plain Text	`.txt`	10MB
CSV	`.csv`	25MB
Word	`.docx`	25MB
PowerPoint	`.pptx`	25MB

Fix: Convert unsupported formats to one of the supported types above.

Upload Timeout

Error: Request timeout after 30s

Cause: Large file upload exceeded the default timeout.

Fix: Use the POST /rag/upload-and-process endpoint with multipart form data, which streams the upload:

curl -X POST "https://api.guidedmind.ai/rag/upload-and-process" \
  -H "X-API-Key: rk_your_key_here" \
  -F "file=@large_document.pdf" \
  -F "project_id=proj_abc123"

Processing Errors

Low Similarity Scores

Symptom: Search results return scores below 0.3 for queries that should match.

Diagnosis: Use the Similarity Score Test to identify the root cause.

Common causes and fixes:

Cause	Fix
Wrong embedding model for domain	Switch to `text-embedding-3-large` for technical content
Chunks too large	Reduce `chunk_size` to 250-500
Chunks too small	Increase `chunk_size` to 500-1000
Low overlap	Increase `chunk_overlap` to 50-150
Wrong search method	Try Hybrid search for mixed content

Documents Not Appearing in Search

Checklist:

Verify document status is processed (not uploading or failed)
Confirm the document belongs to the correct project
Check that the search query uses natural language (not keywords)
Lower the threshold parameter to see more results

# Test with low threshold to see all available chunks
curl -X POST "https://api.guidedmind.ai/rag/search" \
  -H "X-API-Key: rk_your_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "test",
    "options": {
      "threshold": 0.1,
      "limit": 20
    }
  }'

Duplicate Chunks

Symptom: Same content appears multiple times in search results.

Cause: Overlapping chunks with high chunk_overlap value, or duplicate documents uploaded.

Fix:

Reduce chunk_overlap to 50-100
Check for duplicate files in your data source
Use the Chunk Review Tool to preview chunk boundaries

Rate Limiting

Understanding Rate Limits

Plan	Requests/min	Requests/day
Free	30	1,000
Pro	100	10,000
Enterprise	Custom	Custom

Handling 429 Errors

Error Response:

{
  "error": "rate_limit_exceeded",
  "message": "Too many requests. Retry after 30 seconds.",
  "retry_after": 30
}

Fix: Implement exponential backoff:

import time
import requests
 
def search_with_retry(query, max_retries=3):
    for attempt in range(max_retries):
        response = requests.post(
            "https://api.guidedmind.ai/rag/search",
            headers={"X-API-Key": "rk_your_key_here"},
            json={"query": query}
        )
        
        if response.status_code == 429:
            retry_after = response.json().get("retry_after", 30)
            wait_time = retry_after * (2 ** attempt)
            print(f"Rate limited. Waiting {wait_time}s...")
            time.sleep(wait_time)
            continue
            
        return response.json()
    
    raise Exception("Max retries exceeded")

Implement exponential backoff with jitter
Cache search results for repeated queries
Batch uploads instead of individual requests
Monitor your usage in the dashboard

Don't

Retry immediately without delay
Ignore the Retry-After header
Make parallel requests to bypass limits
Share API keys across services

API Key Issues

Invalid API Key

Error:

{
  "error": "authentication_failed",
  "message": "Invalid or expired API key"
}

Fix:

Verify your API key starts with rk_
Check for trailing whitespace when copying
Regenerate from the dashboard if lost

API Access Disabled

Error:

{
  "error": "access_denied",
  "message": "API access is not enabled for this project"
}

Fix: Enable API access in Step 5 of the RAG Wizard (API Endpoints tab), or contact support to enable it for existing projects.

Key Permissions

Each API key is scoped to a specific project. You cannot use Project A's key to access Project B's data.

# This will fail if the key belongs to a different project
curl -X POST "https://api.guidedmind.ai/rag/search" \
  -H "X-API-Key: rk_wrong_project_key" \
  -H "Content-Type: application/json" \
  -d '{"query": "test"}'
# → 403 Forbidden

MCP Connection Issues

SSE Connection Fails

Error: Failed to establish SSE connection

Checklist:

Verify your base URL includes /api/v1/mcp/sse
Confirm your API key is passed as a query parameter
Check that your client supports SSE (Server-Sent Events)

# Correct connection format
curl -N "https://api.guidedmind.ai/api/v1/mcp/sse?api_key=rk_your_key_here"
 
# Expected response
# event: open
# data: {"session_id": "uuid-here", "message_endpoint": "/api/v1/mcp/messages?session_id=uuid-here"}

Messages Not Received

Symptom: SSE connection opens but no tool results return.

Fix: Ensure you're sending messages to the correct message_endpoint from the SSE open event:

# Step 1: Get session_id from SSE open event
# (message_endpoint from step 1)
 
# Step 2: Send to the correct endpoint
curl -X POST "https://api.guidedmind.ai/api/v1/mcp/messages?session_id=YOUR_SESSION_ID" \
  -H "Content-Type: application/json" \
  -d '{
    "jsonrpc": "2.0",
    "id": 1,
    "method": "tools/call",
    "params": {
      "name": "rag_search",
      "arguments": {
        "query": "What is RAG?"
      }
    }
  }'

JSON-RPC Errors

Error: {"jsonrpc": "2.0", "error": {"code": -32601, "message": "Method not found"}}

Cause: Invalid method name in JSON-RPC request.

Valid methods:

Method	Purpose
`tools/list`	List available tools
`tools/call`	Call a specific tool
`initialize`	Initialize MCP session

Search Quality Issues

Irrelevant Results

Symptom: Search returns chunks unrelated to the query.

Iteration steps:

Test different embedding models:
- text-embedding-3-small — Fast, good for general content
- text-embedding-3-large — Best quality, use for technical/legal
- all-MiniLM-L6-v2 — Lightweight, good for simple queries
Adjust chunking:
- Smaller chunks (250-500) — More precise matches
- Larger chunks (1000-2000) — Better context retention
Try Hybrid search: Combines dense (embedding) + sparse (BM25) for best coverage

Empty Results

Symptom: Search returns zero results.

Checklist:

Documents are processed (not stuck in uploading state)
Threshold is not too high (try 0.3 as starting point)
Query uses natural language, not single keywords
Project has at least one processed document

# Diagnostic query with minimal filtering
curl -X POST "https://api.guidedmind.ai/rag/search" \
  -H "X-API-Key: rk_your_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "anything",
    "options": {
      "threshold": 0.0,
      "limit": 5
    }
  }'

Getting Help

If you're still stuck after trying the fixes above:

Check your project logs in the dashboard for processing errors
Use the Pipeline Test Tool to isolate configuration issues
Contact support with your project ID and a description of the issue

When contacting support, include: your project ID, the API endpoint you're calling, a sample request payload, and the full error response. This helps us diagnose issues faster.

← PreviousTesting & Evaluation Next →Naive RAG