RAG is the technology behind Perplexity, ChatGPT Search, Google AI Overviews, and most AI systems connected to the web. Instead of relying only on training knowledge, the LLM consults documents in real time and cites the sources in the answer.
How it works
- User asks a question.
- System converts the question into an embedding (numeric vector).
- Semantic search in a vector index finds the most relevant chunks.
- Retrieved chunks are injected into the LLM's context.
- LLM generates an answer citing the sources.
Implications for SEO/GEO
To be retrieved by RAG, your content needs to be:
- Chunkable: self-contained 200-500 word passages with their own meaning.
- Semantically clear: one central idea per paragraph.
- Indexable: accessible HTML, no heavy JavaScript dependency.
- Citable: source authority is checked before citation.