
How documents are processed with knowledge graph extraction.
Graph RAG indexing adds knowledge graph extraction alongside vector embeddings, creating a triple-index system: vector DB, (optional) BM25, and Knowledge Graph DB.
Same as Simple/Hybrid RAG: extract text and apply preprocessing settings.
Chunks are created for both embedding and graph extraction.
Converts chunks to dense vectors for semantic search.
The LLM processes chunks to extract:
Identifies and classifies named entities from the text.
Determines relationships between extracted entities.
Stores entities and relationships in a graph database (Neo4j).