How Each Index Works#
This guide describes how each index works with diagrams.
Some terminology:
Node: Corresponds to a chunk of text from a Document. LlamaIndex takes in Document objects and internally parses/chunks them into Node objects.
Response Synthesis: Our module which synthesizes a response given the retrieved Node. You can see how to specify different response modes.
Summary Index (formerly List Index)#
The summary index simply stores Nodes as a sequential chain.
Querying#
During query time, if no other query parameters are specified, LlamaIndex simply loads all Nodes in the list into our Response Synthesis module.
The summary index does offer numerous ways of querying a summary index, from an embedding-based query which will fetch the top-k neighbors, or with the addition of a keyword filter, as seen below:
Vector Store Index#
The vector store index stores each Node and a corresponding embedding in a Vector Store.
Querying#
Querying a vector store index involves fetching the top-k most similar Nodes, and passing those into our Response Synthesis module.
Tree Index#
The tree index builds a hierarchical tree from a set of Nodes (which become leaf nodes in this tree).
Querying#
Querying a tree index involves traversing from root nodes down
to leaf nodes. By default, (child_branch_factor=1
), a query
chooses one child node given a parent node. If child_branch_factor=2
, a query
chooses two child nodes per level.
Keyword Table Index#
The keyword table index extracts keywords from each Node and builds a mapping from each keyword to the corresponding Nodes of that keyword.
Querying#
During query time, we extract relevant keywords from the query, and match those with pre-extracted Node keywords to fetch the corresponding Nodes. The extracted Nodes are passed to our Response Synthesis module.