In this tutorial, we explore the concept and implementation of a ReAct Agent.
ReAct Agent stands for Reasoning + Action, meaning that the LLM explicitly goes through a reasoning phase, utilizes tools (or actions), and then generates the final answer based on the obtained results.
Throughout this tutorial, we will implement a ReAct Agent by covering the following:
Tool Setup: Utilizing various tools such as web search, file management, document search based on VectorStore, etc.
Agent Creation: Practice how to use the ReAct Agent in LangChain.
Graph Execution: Execute the agent to observe the answers to queries.
Please follow the sections below to go through the entire process.
A ReAct Agent uses various tools to solve problems. In this tutorial, we will set up and use the following tools.
Web Search
We use the TavilySearch tool for searching the latest information from the web. The code below creates a web search tool and tests it by retrieving some search results.
from langchain_community.tools.tavily_search import TavilySearchResults
# Create an instance of TavilySearchResults with k=5 for retrieving up to 5 search results
web_search = TavilySearchResults(k=5)
# Test the web search tool.
result = web_search.invoke(
"Please find information related to the major AI-related from CES 2025"
)
result
File Management
Using FileManagementToolkit, you can create, delete, and modify files.
from langchain_community.agent_toolkits import FileManagementToolkit
# Set 'tmp' as the working directory.
working_directory = "tmp"
file_management_tools = FileManagementToolkit(
root_dir=str(working_directory)
).get_tools()
file_management_tools
Retriever Tool
To search for information within documents such as PDFs, we create a Retriever. First, we load the PDF, split the text, then embed it into a VectorStore.
Tesla's Revenue Forecast Based on Business Model and Financial Statement Analysis
Please copy the downloaded file to the data folder for practice.
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_community.vectorstores import FAISS
from langchain_openai import OpenAIEmbeddings
from langchain_community.document_loaders import PyMuPDFLoader
# Example PDF file path (adjust according to your environment)
pdf_file_path = "data/shsconf_icdeba2023_02022.pdf"
# Load the PDF using PyMuPDFLoader
loader = PyMuPDFLoader(pdf_file_path)
# Split text into smaller chunks
text_splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=100)
split_docs = loader.load_and_split(text_splitter)
# Create FAISS VectorStore
vector = FAISS.from_documents(split_docs, OpenAIEmbeddings())
# Create a retriever from the VectorStore
pdf_retriever = vector.as_retriever()
from langchain_core.tools.retriever import create_retriever_tool
from langchain_core.prompts import PromptTemplate
# Create a tool for PDF-based search
retriever_tool = create_retriever_tool(
retriever=pdf_retriever,
name="pdf_retriever",
description="use this tool to search for information in Tesla PDF file",
document_prompt=PromptTemplate.from_template(
"<document><context>{page_content}</context><metadata><source>{source}</source><page>{page}</page></metadata></document>"
),
)
We will now create a ReAct Agent and visualize the agent graph. In LangChain, a ReAct agent generates answers through step-by-step reasoning and tool usage.
The code below uses create_react_agent from langgraph to easily build a ReAct Agent and visualize its structure as a graph.
from langchain_openai import ChatOpenAI
from langgraph.checkpoint.memory import MemorySaver
from langgraph.prebuilt import create_react_agent
# Memory and model configuration
memory = MemorySaver()
model = ChatOpenAI(model_name="gpt-4o-mini")
# Create ReAct Agent
agent = create_react_agent(model, tools=tools, checkpointer=memory)
The code below visualizes the agent's graph structure.
from IPython.display import Image, display
display(
Image(agent.get_graph().draw_mermaid_png(output_file_path="11-React-Agent.png"))
)
def print_stream(stream):
for s in stream:
message = s["messages"][-1]
if isinstance(message, tuple):
print(message)
else:
message.pretty_print()
Execute Agent
Let's execute the created ReAct Agent. We can track the step-by-step process of generating an answer to a query.
The example code below uses stream_graph to stream the agent's execution process. We place the user's query in the messages key, and the agent's reasoning will be displayed.
config = {"configurable": {"thread_id": "1"}}
inputs = {"messages": [("human", "Hello, my name is Teddy.")]}
print_stream(agent.stream(inputs, config=config, stream_mode="values"))
# Another example - maintaining chat flow
config = {"configurable": {"thread_id": "1"}}
inputs = {"messages": [("human", "What was my name again?")]}
print_stream(agent.stream(inputs, config=config, stream_mode="values"))
config = {"configurable": {"thread_id": "1"}}
inputs = {
"messages": [("human", "Please summarize Tesla from shsconf_icdeba2023_02022.pdf.")]
}
print_stream(agent.stream(inputs, config=config, stream_mode="values"))
# Example of using web search + file management
config = {"configurable": {"thread_id": "1"}}
inputs = {
"messages": [
(
"human",
"Search for news about American writer John Smith's Pulitzer Prize and draft a brief report based on it.",
)
]
}
print_stream(agent.stream(inputs, config=config, stream_mode="values"))
# A more concrete scenario example
instruction = """
Your task is to write a 'press release.'
----
Please process the following steps in order:
1. Search for news about American writer John Smith's Pulitzer Prize.
2. Based on the Pulitzer Prize news, write a press release/report.
3. Actively use markdown table format to summarize key points.
4. Save the output to a file named `agent_press_release.md`.
"""
config = {"configurable": {"thread_id": "1"}}
inputs = {"messages": [("human", instruction)]}
print_stream(agent.stream(inputs, config=config, stream_mode="values"))