cohere-ai · trentfowlercohere · Oct 29, 2024 · Oct 29, 2024
@@ -40,9 +40,15 @@ This tutorial is split into six parts, with each part focusing on one use case:
     - Multi-step tool calling
     - Multi-step, parallel tool calling
     - Self-correction
-- Part 4: Generating multi-faceted queries
-    - *coming next*
-- Part 5: Querying structured data (tables)
-    - *coming next*
-- Part 6: Querying structured data (databases)
-    - *coming next*
+- [Part 4: Generating multi-faceted queries](/v2/docs/generating-multi-faceted-queries)
+    - Multi-faceted data querying
+    - Setting up the tool to generate multi-faceted queries
+    - Performing multi-faceted queries
+- [Part 5: Querying structured data (tables)](/v2/docs/querying-structured-data-tables)
+    - Python tool for querying tabular data
+    - Setting up the tool to generate pandas queries
+    - Performing queries over structured data (table)
+- [Part 6: Querying structured data (databases)](/v2/docs/querying-structured-data-sql)
+    - Setting up a database
+    - Setting up the tool to generate SQL queries
+    - Performing queries over structured data (SQL)
@@ -0,0 +1,373 @@
+---
+title: Generating Multi-Faceted Queries
+slug: /v2/docs/generating-multi-faceted-queries
+
+description: "Build a system that generates multi-faceted queries to capture the full intent of a user's request."
+image: "../../../../assets/images/f1cc130-cohere_meta_image.jpg"  
+keywords: "Cohere, RAG, agents, function calling,tool use"
+---
+
+<a target="_blank" href="https://colab.research.google.com/github/cohere-ai/notebooks/blob/main/notebooks/guides/agentic-rag/agentic_rag_pt4_multi_faceted_queries.ipynb">Open in Colab</a>
+
+Consider a RAG system that needs to search through a large database of code examples and tutorials. A user might ask for "Python examples using the chat endpoint" or "JavaScript tutorials for text summarization".
+
+In a basic RAG setup, these queries would be passed as-is to a search function, potentially missing important context or failing to leverage the structured nature of the data. For example, the code examples database might consist of metadata such as the programming language, the created time, the tech stack used, and so on.
+
+It would be great if we could design a system that could leverage this metadata as a filter to retrieve only the relevant results.
+
+We can achieve this using a tool use approach. Here, we can build a system that generates multi-faceted queries to capture the full intent of a user's request. This allows for more precise and relevant results by utilizing the semi-structured nature of the data.
+
+Here are some examples of how this approach can be applied:
+1. E-commerce product searches: Filtering by price range, category, brand, customer ratings, and availability.
+2. Academic research databases: Narrowing results by publication year, field of study, citation count, and peer-review status.
+3. Job search platforms: Refining job listings by location, experience level, salary range, and required skills.
+
+In this tutorial, we'll cover:
+- Defining the function for data querying
+- Creating the tool for generating multi-faceted queries
+- Building an agent for performing multi-faceted queries
+- Running the agent
+
+We'll build an agent that helps developers find relevant code examples and tutorials for using Cohere.
+
+## Setup
+
+To get started, first we need to install the `cohere` library and create a Cohere client.
+
+
+```python PYTHON
+import json
+import os
+import cohere
+
+co = cohere.ClientV2("COHERE_API_KEY") # Get your free API key: https://dashboard.cohere.com/api-keys
+```
+
+Important: the source code for tool definitions can be [found here](https://github.com/cohere-ai/notebooks/blob/main/notebooks/guides/agentic-rag/tool_def.py). Make sure to have the tool_def.py file in the same directory as this notebook for the imports to work correctly.
+
+## Defining the function for data querying
+
+We'll remove the other tools from Part 1 and just use one – `search_code_examples`.
+
+Now, instead of just the `query` parameter, we'll add two more parameters: `programming_language` and `endpoints`:
+- `programming_language`: The programming language of the code example or tutorial.
+- `endpoints`: The Cohere endpoints used in the code example or tutorial.
+
+We'll use these parameters as the metadata to filter the code examples and tutorials.
+
+Let's rename the function to `search_code_examples_detailed` to reflect this change.
+
+And as in Part 1, for simplicity, we create `query` as just a mock parameter and no actual search logic will be performed based on it.
+
+
+```python PYTHON
+from tool_def import (
+    search_code_examples_detailed,
+    search_code_examples_detailed_tool,
+)
+```
+
+
+```python PYTHON
+
+functions_map = {
+    "search_code_examples_detailed": search_code_examples_detailed,
+}
+```
+
+## Creating the tool for generating multi-faceted queries
+
+With the `search_code_examples` modified, we now need to modify the tool definition as well. Here, we are adding two new properties to the tool definition:
+- `programming_language`: This is a string property which we provide a list of options for the model to choose. We do this by adding "Possible enum values" to the description, which in our case is `py, js`.
+- `endpoints`: We want the model to be able to choose from more than one endpoint, and so here we define an array property. When defining an array property, we need to specify the type of the items in the array using the `items` key, which in our case is `string`. We also provide a list of endpoint options for the model to choose from, which is `chat, embed, rerank, classify`.
+
+We set only the `query` parameter as required, while the other two parameters are optional.
+
+```python
+search_code_examples_detailed_tool = {
+        "type": "function",
+        "function": {
+            "name": "search_code_examples_detailed",
+            "description": "Searches code examples or tutorials of using Cohere.",
+            "parameters": {
+                "type": "object",
+                "properties": {
+                    "query": {
+                        "type": "string",
+                        "description": "The search query."
+                    },
+                    "programming_language": {
+                        "type": "string",
+                        "description": "The programming language of the code example or tutorial. Only use this property when specified by the user. Possible enum values: py, js."
+                    },
+                    "endpoints": {
+                        "type": "array",
+                        "items": {"type": "string"},
+                        "description": "The Cohere endpoints used in the code example or tutorial. Only use this property when asked by the user. Possible enum values: chat, embed, rerank, classify."
+                    }
+                },
+                "required": ["query"]
+            }
+        }
+}
+```
+```python PYTHON
+tools = [search_code_examples_detailed_tool]
+```
+
+## Building an agent for performing multi-faceted queries
+
+Next, let's create a `run_agent` function to run the agentic RAG workflow, the same as in Part 1.
+
+The only change we are making here is to make the system message simpler and more specific since the agent now only has one tool.
+
+
+```python PYTHON
+system_message="""## Task and Context
+You are an assistant who helps developers find code examples and tutorials on using Cohere."""
+```
+
+
+```python PYTHON
+model = "command-r-plus-08-2024"
+
+def run_agent(query, messages=None):
+    if messages is None:
+        messages = []
+
+    if "system" not in {m.get("role") for m in messages}:
+        messages.append({"role": "system", "content": system_message})
+
+    # Step 1: get user message
+    print(f"QUESTION:\n{query}")
+    print("="*50)
+
+    messages.append({"role": "user", "content": query})
+
+    # Step 2: Generate tool calls (if any)
+    response = co.chat(
+        model=model,
+        messages=messages,
+        tools=tools,
+        temperature=0.1
+    )
+
+    while response.message.tool_calls:
+
+        print("TOOL PLAN:")
+        print(response.message.tool_plan,"\n")
+        print("TOOL CALLS:")
+        for tc in response.message.tool_calls:
+            print(f"Tool name: {tc.function.name} | Parameters: {tc.function.arguments}")
+        print("="*50)
+
+        messages.append({"role": "assistant", "tool_calls": response.message.tool_calls, "tool_plan": response.message.tool_plan})        
+
+        # Step 3: Get tool results
+        for tc in response.message.tool_calls:
+            tool_result = functions_map[tc.function.name](**json.loads(tc.function.arguments))
+            tool_content = []
+            for data in tool_result:
+                tool_content.append({"type": "document", "document": {"data": json.dumps(data)}})
+                # Optional: add an "id" field in the "document" object, otherwise IDs are auto-generated
+            messages.append({"role": "tool", "tool_call_id": tc.id, "content": tool_content})
+
+        # Step 4: Generate response and citations 
+        response = co.chat(
+            model=model,
+            messages=messages,
+            tools=tools,
+            temperature=0.1
+        )
+
+    messages.append({"role": "assistant", "content": response.message.content[0].text})
+
+    # Print final response
+    print("RESPONSE:")
+    print(response.message.content[0].text)
+    print("="*50)
+
+    # Print citations (if any)
+    verbose_source = False # Change to True to display the contents of a source
+    if response.message.citations:
+        print("CITATIONS:\n")
+        for citation in response.message.citations:
+            print(f"Start: {citation.start}| End:{citation.end}| Text:'{citation.text}' ")
+            print("Sources:")
+            for idx, source in enumerate(citation.sources):
+                print(f"{idx+1}. {source.id}")
+                if verbose_source:
+                    print(f"{source.tool_output}")
+            print("\n")
+
+    return messages
+```
+
+## Running the agent
+
+Let's start with a broad query about "RAG code examples".
+
+Since it's broad, this query shouldn't require any metadata filtering.
+
+And this is shown by the agent's response, which provides only one parameter, `query`, in its tool call.
+
+
+```python PYTHON
+messages = run_agent("Do you have any RAG code examples")
+# Tool name: search_code_examples | Parameters: {"query":"RAG code examples"}
+```
+```
+QUESTION:
+Do you have any RAG code examples
+==================================================
+TOOL PLAN:
+I will search for RAG code examples. 
+
+TOOL CALLS:
+Tool name: search_code_examples_detailed | Parameters: {"query":"RAG"}
+==================================================
+RESPONSE:
+Yes, we have a code example for RAG with Chat Embed and Rerank via Pinecone.
+==================================================
+CITATIONS:
+
+Start: 32| End:76| Text:'RAG with Chat Embed and Rerank via Pinecone.' 
+Sources:
+1. search_code_examples_detailed_bqm9ww9cp4nw:2
+```
+
+
+
+Let's try a more specific query about "javascript tutorials on text summarization". 
+
+This time, the agent uses the `programming_language` parameter and passed the value `js` to it.
+
+
+```python PYTHON
+messages = run_agent("Javascript tutorials on summarization")
+# Tool name: search_code_examples | Parameters: {"programming_language":"js","query":"..."}
+```
+```
+QUESTION:
+Javascript tutorials on summarization
+==================================================
+TOOL PLAN:
+I will search for 'summarization' in the 'js' programming language. 
+
+TOOL CALLS:
+Tool name: search_code_examples_detailed | Parameters: {"programming_language":"js","query":"summarization"}
+==================================================
+RESPONSE:
+Here is a tutorial on building a Chrome extension to summarize web pages using JavaScript.
+==================================================
+CITATIONS:
+
+Start: 22| End:72| Text:'building a Chrome extension to summarize web pages' 
+Sources:
+1. search_code_examples_detailed_hf9rv18987vp:0
+``` 
+
+
+
+Let's now try a query that involves filtering based on the endpoints. Here, the user asks for "code examples of using embed and rerank endpoints".
+
+And since we have set up the `endpoints` parameter to be an array, the agent is able to call the tool with a list of endpoints as its argument.
+
+
+```python PYTHON
+messages = run_agent("Code examples of using embed and rerank endpoints.")
+
+# Tool name: search_code_examples | Parameters: {"endpoints":["embed","rerank"],"query":"..."}
+```
+```
+QUESTION:
+Code examples of using embed and rerank endpoints.
+==================================================
+TOOL PLAN:
+I will search for code examples of using the embed and rerank endpoints. 
+
+TOOL CALLS:
+Tool name: search_code_examples_detailed | Parameters: {"endpoints":["embed","rerank"],"query":"embed and rerank"}
+==================================================
+RESPONSE:
+Here are some code examples of using the embed and rerank endpoints:
+- Wikipedia Semantic Search with Cohere Embedding Archives
+- RAG With Chat Embed and Rerank via Pinecone
+- Build Chatbots That Know Your Business with MongoDB and Cohere
+==================================================
+CITATIONS:
+
+Start: 71| End:127| Text:'Wikipedia Semantic Search with Cohere Embedding Archives' 
+Sources:
+1. search_code_examples_detailed_d3br7bwd05bp:0
+
+
+Start: 130| End:173| Text:'RAG With Chat Embed and Rerank via Pinecone' 
+Sources:
+1. search_code_examples_detailed_d3br7bwd05bp:1
+
+
+Start: 176| End:238| Text:'Build Chatbots That Know Your Business with MongoDB and Cohere' 
+Sources:
+1. search_code_examples_detailed_d3br7bwd05bp:2
+```
+
+
+
+Finally, let's try a query that involves filtering based on both the programming language and the endpoints. Here, the user asks for "Python examples of using the chat endpoint".
+
+And the agent correctly uses both parameters to query the code examples.
+
+
+```python PYTHON
+messages = run_agent("Python examples of using the chat endpoint.")
+
+#Tool name: search_code_examples | Parameters: {"endpoints":["chat"],"programming_language":"py","query":"..."}
+```
+```
+QUESTION:
+Python examples of using the chat endpoint.
+==================================================
+TOOL PLAN:
+I will search for Python examples of using the chat endpoint. 
+
+TOOL CALLS:
+Tool name: search_code_examples_detailed | Parameters: {"endpoints":["chat"],"programming_language":"py","query":"Python examples of using the chat endpoint"}
+==================================================
+RESPONSE:
+Here are some Python examples of using the chat endpoint:
+- Calendar Agent with Native Multi Step Tool
+- RAG With Chat Embed and Rerank via Pinecone
+- Build Chatbots That Know Your Business with MongoDB and Cohere
+==================================================
+CITATIONS:
+
+Start: 60| End:102| Text:'Calendar Agent with Native Multi Step Tool' 
+Sources:
+1. search_code_examples_detailed_kp25g4k49ej5:0
+
+
+Start: 105| End:148| Text:'RAG With Chat Embed and Rerank via Pinecone' 
+Sources:
+1. search_code_examples_detailed_kp25g4k49ej5:2
+
+
+Start: 151| End:213| Text:'Build Chatbots That Know Your Business with MongoDB and Cohere' 
+Sources:
+1. search_code_examples_detailed_kp25g4k49ej5:3
+```  
+
+
+## Summary
+
+In this tutorial, we learned about:
+- How to define the function for data querying
+- How to create the tool for generating multi-faceted queries
+- How to build an agent for performing multi-faceted queries
+- How to run the agent
+
+By implementing multi-faceted queries over semi-structured data, we've enhanced our RAG system to handle more specific and targeted searches. This approach allows for better utilization of metadata and more precise filtering of results, which is particularly useful when dealing with large collections of code examples and tutorials.
+
+While this tutorial demonstrates how to work with semi-structured data, the agentic RAG approach can be applied to structured data as well. That means we can build agents that can translate natural language queries into queries for tables or relational databases.
+
+In Part 5, we'll learn how to perform RAG over structured data (tables).