deepset Query Decomposition

Query Decomposition

agentic-aiagenticagentsgenaiAIhaystack-cookbookgenai-usecaseshaystack-ainotebooksPythonragai-tools

alph-notebooks/haystack-cookbook / query_decomposition.ipynb

Export

Run Notebooks

Contents

No cells yet

Add cells to see them here

Advanced RAG: Query Decomposition and Reasoning

by Tuana Celik (LI, Twitter)

This is part one of the Advanced Use Cases series:

1️⃣ Extract Metadata from Queries to Improve Retrieval cookbook & full article

2️⃣ Query Expansion cookbook & full article

3️⃣ Query Decomposition & the full article

4️⃣ Automated Metadata Enrichment

Query decomposition is a technique we can use to decompose complex queries into simpler steps, answering each sub-question, and getting an LLM to reason about the final answer based on the answers to the sub-questions.

In this recipe, we're using the structured output functionality (currently in beta) by OpenAI to construct Questions which lists the sub-questions based on the original question, as well as keeping track of the intermediate answers to each question.

📺 Code Along

[ ]

OpenAI API Key:··········
Cohere API Key:··········

Load the Dataset

For this demo, we're using the Tuana/game-of-thrones dataset on Hugging Face, and we are using a Cohere embedding model to embed the contents.

[ ]

Calculating embeddings: 100%|██████████| 74/74 [01:24<00:00,  1.15s/it]

🧪 Experimental Addition to the OpenAIChatGenerator for Structured Output Support

🚀 This step is a completely optional advanced step

Let's extend the OpenAIChatGenerator to be able to make use of the strctured output option by OpenAI. Below, we extend the class to call self.client.beta.chat.completions.parse if the user has provides a respose_format in generation_kwargs. This will allow us to provifde a Pydantic Model to the gnerator and request our generator to respond with structured outputs that adhere to this Pydantic schema.

[ ]

Define the Pydantic Model

For query expansion, we want to keep track of intermediate questions that can be answered independently. So, below we define a Questions schema. Each Question is made up a question and an answer

[ ]

Define the Query Decomposition Prompt

The first step in our application will be to decompose a question. We define our first splitter_prompt instructing an LLM to take the question step by steo and produce multiple sub questions.

[ ]

Define a Multi Text Embedder and Retriever

Since we will likely be needing to answer multiple question, let's define a Milti Question Embedder and Retriever. This way, we can have 1 component that can accept multiple questsions, embed them, and another component that can retireve documents that relate to each question too.

[ ]

Define the Prompt to Answer Multiple Questions

Once we have our decomposed questions, we need to instruct an LLM to answer each question based on the context for each question.

[ ]

Define the Prompt to Reason About the Final Answer

Our final step will be to instruct the LLM to reason about the final answer based on the decomposed questions and answers to each.

[ ]

Final Step: Construct the Pipeline

[ ]

questions=[Question(question='How many siblings does Jamie have?', answer=None), Question(question='How many siblings does Sansa have?', answer=None)]

[ ]

<haystack.core.pipeline.pipeline.Pipeline object at 0x7cc0585b3760>
,🚅 Components
,  - prompt: PromptBuilder
,  - llm: OpenAIGenerator
,  - embedder: CohereMultiTextEmbedder
,  - multi_query_retriever: MultiQueryInMemoryEmbeddingRetriever
,  - multi_query_prompt: PromptBuilder
,  - query_resolver_llm: OpenAIGenerator
,  - reasoning_prompt: PromptBuilder
,  - reasoning_llm: OpenAIGenerator
,🛤️ Connections
,  - prompt.prompt -> llm.prompt (str)
,  - llm.structured_reply -> embedder.questions (BaseModel)
,  - llm.structured_reply -> multi_query_retriever.queries (BaseModel)
,  - embedder.embeddings -> multi_query_retriever.query_embeddings (List[List[float]])
,  - multi_query_retriever.question_context_pairs -> multi_query_prompt.question_context_pairs (List[Dict])
,  - multi_query_prompt.prompt -> query_resolver_llm.prompt (str)
,  - query_resolver_llm.structured_reply -> reasoning_prompt.question_answer_pair (BaseModel)
,  - reasoning_prompt.prompt -> reasoning_llm.prompt (str)

[ ]

The original query was split and resolved:

question='How many siblings does Jamie have?' answer='Jaime Lannister has one sibling, Cersei Lannister, who is his twin sister.'
question='How many siblings does Sansa have?' answer='Sansa Stark has five siblings: one older brother (Robb), one younger sister (Arya), and two younger brothers (Bran and Rickon), plus one older illegitimate half-brother (Jon Snow).'

So the original query is answered as follows:

To determine who has more siblings between Jaime and Sansa, we can analyze the information provided in the answers to the simpler questions.

- Jaime Lannister has **1 sibling**, which is Cersei Lannister.
- Sansa Stark has **5 siblings**, which include Robb, Arya, Bran, Rickon, and Jon Snow (considered a half-brother).

Since 5 (Sansa's siblings) is greater than 1 (Jaime's siblings), we can conclude that Sansa has more siblings than Jaime.

**Final Answer:** Sansa has more siblings than Jaime.

[ ]