ConversationTokenBufferMemory

Author: Kenny Jung
Design: Kenny Jung
Peer Review : Wooseok Jeong, JeongGi Park
Proofread : Juni Lee
This is a part of LangChain Open Tutorial

Overview

ConversationTokenBufferMemory stores recent conversation history in a buffer memory and determines when to flush conversation content based on token length rather than the number of conversations.

Key parameters:

max_token_limit: Sets the maximum token length for storing conversation content
return_messages: When True, returns the messages in chat format. When False, returns a string
human_prefix: Prefix to add before human messages (default: "Human")
ai_prefix: Prefix to add before AI messages (default: "AI")

References

LangChain Python API Reference > langchain: 0.3.13 > memory > ConversationTokenBufferMemory

Environment Setup

Set up the environment. You may refer to Environment Setup for more details.

[Note]

langchain-opentutorial is a package that provides a set of easy-to-use environment setup, useful functions and utilities for tutorials.
You can checkout the langchain-opentutorial for more details.

%%capture --no-stderr
%pip install langchain-opentutorial

# Install required packages
from langchain_opentutorial import package

package.install(
    [
        "langsmith",
        "langchain",
        "langchain_core",
        "langchain-anthropic",
        "langchain_community",
        "langchain_text_splitters",
        "langchain_openai",
    ],
    verbose=False,
    upgrade=False,
)

# Set environment variables
from langchain_opentutorial import set_env

set_env(
    {
        "OPENAI_API_KEY": "",
        "LANGCHAIN_API_KEY": "",
        "LANGCHAIN_TRACING_V2": "true",
        "LANGCHAIN_ENDPOINT": "https://api.smith.langchain.com",
        "LANGCHAIN_PROJECT": "ConversationTokenBufferMemory",
    }
)

Environment variables have been set successfully.

You can alternatively set OPENAI_API_KEY in .env file and load it.

[Note] This is not necessary if you've already set OPENAI_API_KEY in previous steps.

from dotenv import load_dotenv

load_dotenv(override=True)

True

Limiting Maximum Token Length to 50

This section demonstrates how to limit the conversation memory to 50 tokens

from langchain.memory import ConversationTokenBufferMemory
from langchain_openai import ChatOpenAI


# Create LLM model
llm = ChatOpenAI(model_name="gpt-4o-mini")

# Configure memory
memory = ConversationTokenBufferMemory(
    llm=llm,
    max_token_limit=50,
    return_messages=True,  # Limit maximum token length to 50
)

/var/folders/1h/lrydr5_50zx8thkmv_czrbzc0000gn/T/ipykernel_59195/361976439.py:9: LangChainDeprecationWarning: Please see the migration guide at: https://python.langchain.com/docs/versions/migrating_memory/
      memory = ConversationTokenBufferMemory(

# Add arbitrary conversations
memory.save_context(
    inputs={
        "human": "Hello, I recently purchased a machine tool from your company. Could you tell me how to install it?"
    },
    outputs={
        "ai": "Hello! Thank you for your purchase. Could you please tell me the machine model number?"
    },
)
memory.save_context(
    inputs={"human": "Yes, the model number is XG-200."},
    outputs={
        "ai": "Thank you. I'll help you with the installation guide for the XG-200 model. First, please check the power supply status at the installation site. The machine requires 220V power."
    },
)
memory.save_context(
    inputs={"human": "I've checked the power. What's the next step?"},
    outputs={
        "ai": "Good. Next, please place the machine on a flat and stable surface. Then, proceed with cable connections according to the provided user manual."
    },
)
memory.save_context(
    inputs={"human": "How do I make the connections?"},
    outputs={
        "ai": "Please refer to page 5 of the manual. There are detailed instructions for cable connections. If you have any difficulties with this process, I'll be happy to help further."
    },
)
memory.save_context(
    inputs={"human": "What should I do after the installation is complete?"},
    outputs={
        "ai": "Once the installation is complete, please turn on the power and perform the initial operation test. The test procedure is explained on page 10 of the manual. If there are any issues with the machine or if you need additional support, please don't hesitate to contact us."
    },
)
memory.save_context(
    inputs={"human": "Thank you, this has been very helpful!"},
    outputs={
        "ai": "We're always ready to help. If you have any additional questions or need support, please feel free to ask. Have a great day!"
    },
)

# Check the conversation history
memory.load_memory_variables({})["history"]

[HumanMessage(content='Thank you, this has been very helpful!', additional_kwargs={}, response_metadata={}),
     AIMessage(content="We're always ready to help. If you have any additional questions or need support, please feel free to ask. Have a great day!", additional_kwargs={}, response_metadata={})]

Setting Maximum Token Length to 150

Let's check how the conversation is stored when we set the maximum token length to 150.

# Memory configuration
memory = ConversationTokenBufferMemory(
    llm=llm,
    max_token_limit=150,
    return_messages=True,  # Limit maximum token length to 150
)

# Add arbitrary conversations
memory.save_context(
    inputs={
        "human": "Hello, I recently purchased a machine tool from your company. Could you tell me how to install it?"
    },
    outputs={
        "ai": "Hello! Thank you for your purchase. Could you please tell me the machine model number?"
    },
)
memory.save_context(
    inputs={"human": "Yes, the model number is XG-200."},
    outputs={
        "ai": "Thank you. I'll help you with the installation guide for the XG-200 model. First, please check the power supply status at the installation site. The machine requires 220V power."
    },
)
memory.save_context(
    inputs={"human": "I've checked the power. What's the next step?"},
    outputs={
        "ai": "Good. Next, please place the machine on a flat and stable surface. Then, proceed with cable connections according to the provided user manual."
    },
)
memory.save_context(
    inputs={"human": "How do I make the connections?"},
    outputs={
        "ai": "Please refer to page 5 of the manual. There are detailed instructions for cable connections. If you have any difficulties with this process, I'll be happy to help further."
    },
)
memory.save_context(
    inputs={"human": "What should I do after the installation is complete?"},
    outputs={
        "ai": "Once the installation is complete, please turn on the power and perform the initial operation test. The test procedure is explained on page 10 of the manual. If there are any issues with the machine or if you need additional support, please don't hesitate to contact us."
    },
)
memory.save_context(
    inputs={"human": "Thank you, this has been very helpful!"},
    outputs={
        "ai": "We're always ready to help. If you have any additional questions or need support, please feel free to ask. Have a great day!"
    },
)

# Check the conversation history
memory.load_memory_variables({})["history"]

[HumanMessage(content='What should I do after the installation is complete?', additional_kwargs={}, response_metadata={}),
     AIMessage(content="Once the installation is complete, please turn on the power and perform the initial operation test. The test procedure is explained on page 10 of the manual. If there are any issues with the machine or if you need additional support, please don't hesitate to contact us.", additional_kwargs={}, response_metadata={}),
     HumanMessage(content='Thank you, this has been very helpful!', additional_kwargs={}, response_metadata={}),
     AIMessage(content="We're always ready to help. If you have any additional questions or need support, please feel free to ask. Have a great day!", additional_kwargs={}, response_metadata={})]

PreviousConversationBufferWindowMemory NextConversationEntityMemory

Last updated 2 months ago

Overview

Table of Contents

References

Environment Setup

Limiting Maximum Token Length to 50

Setting Maximum Token Length to 150