Advanced RAG 05: Exploring Semantic Chunking

introducing principles and applications of semantic chunking

Florian June
Towards AI
Published in
14 min readFeb 27, 2024

--

After parsing the document, we can obtain structured or semi-structured data. The main task now is to break them down into smaller chunks to extract detailed features, and then embed these features to represent their semantics. Its position in RAG is shown in Figure 1.

Figure 1 : The position of the Chunking process(red box) in RAG. Image by author.

--

--

AI researcher, focusing on LLMs, RAG, Agent, Document AI, Data Structures. Find the newest AI article in my newsletter: https://florianjune.substack.com/