LangChain部分利用LangChain中的内存和存储：综合指南

首页 > 编程 > LangChain部分利用LangChain中的内存和存储：综合指南

LangChain部分利用LangChain中的内存和存储：综合指南

发布于2024-08-25

LangChain Part Leveraging Memory and Storage in LangChain: A Comprehensive Guide

LangChain 第 4 部分 - 在 LangChain 中利用内存和存储：综合指南

代码可以在这里找到：GitHub - jamesbmour/blog_tutorials:

在对话式人工智能和语言模型不断发展的世界中，维护上下文和有效管理信息流是构建智能应用程序的关键组成部分。 LangChain 是一个强大的框架，专为处理大型语言模型 (LLM) 而设计，为内存管理和数据持久性提供了强大的工具，从而能够创建上下文感知系统。

在本指南中，我们将深入探讨利用 LangChain 中的内存和存储来构建更智能、响应速度更快的应用程序的细微差别。

1. 在 LangChain 中使用内存

LangChain中的内存管理允许应用程序保留上下文，使交互更加连贯和上下文相关。让我们探索不同的内存类型及其用例。

1.1.记忆的类型

LangChain提供了多种内存类型来应对不同的场景。在这里，我们将重点关注两种关键类型：

ConversationBufferMemory

这种记忆类型非常适合短期上下文保留、捕捉和回忆对话中最近的互动。

from langchain.memory import ConversationBufferMemory

memory = ConversationBufferMemory()
memory.save_context({"input": "Hi, I'm Alice"}, {"output": "Hello Alice, how can I help you today?"})
memory.save_context({"input": "What's the weather like?"}, {"output": "I'm sorry, I don't have real-time weather information. Is there anything else I can help you with?"})

print(memory.load_memory_variables({}))

对话摘要内存

对于较长的对话，ConversationSummaryMemory 是一个不错的选择。它总结了要点，保持了上下文，但没有过多的细节。

from langchain.memory import ConversationSummaryMemory
from langchain.llms import Ollama 

llm = Ollama(model='phi3',temperature=0)
memory = ConversationSummaryMemory(llm=llm)
memory.save_context({"input": "Hi, I'm Alice"}, {"output": "Hello Alice, how can I help you today?"})
memory.save_context({"input": "I'm looking for a good Italian restaurant"}, {"output": "Great! I'd be happy to help you find a good Italian restaurant. Do you have any specific preferences or requirements, such as location, price range, or specific dishes you're interested in?"})

print(memory.load_memory_variables({}))

1.2.为您的使用案例选择正确的内存类型

选择合适的内存类型取决于几个因素：

持续时间和复杂性：短会话受益于 ConversationBufferMemory 的详细上下文保留，而长期交互可能需要通过 ConversationSummaryMemory 进行总结。
细节与概述：确定详细的交互历史记录还是高级摘要对您的应用程序更有价值。
性能：考虑内存大小和检索速度之间的权衡。

用例：

ConversationBufferMemory：快速客户支持或常见问题解答式交互的理想选择。
ConversationSummaryMemory：最适合长期参与，例如项目管理或持续的客户互动。

1.3.将内存集成到链和代理中

内存可以无缝集成到LangChain链和代理中，以增强会话能力。

from langchain.chains import ConversationChain  
from langchain.memory import ConversationBufferMemory
# llm = OpenAI(temperature=0)
memory = ConversationBufferMemory()
conversation = ConversationChain(
    llm=llm,
    memory=memory,
    verbose=True
)

conversation.predict(input="Hi, I'm Alice")
conversation.predict(input="What's my name?")

此示例说明了如何使用 ConversationBufferMemory 记住以前的交互，从而实现更自然的对话。

2. 保存和检索数据

持久存储可确保跨会话维护对话历史记录和上下文，从而实现交互的连续性。

2.1.存储对话历史记录和状态

对于基本持久性，您可以将基于文件的存储与 JSON 结合使用：

import json

class PersistentMemory:
    def __init__(self, file_path):
        self.file_path = file_path
        self.load_memory()

    def load_memory(self):
        try:
            with open(self.file_path, 'r') as f:
                self.chat_memory = json.load(f)
        except FileNotFoundError:
            self.chat_memory = {'messages': []}

    def save_memory(self):
        with open(self.file_path, 'w') as f:
            json.dump({'messages': self.chat_memory['messages']}, f)

# Usage
memory = PersistentMemory(file_path='conversation_history.json')
print(memory.chat_memory)

此方法允许您以简单、人类可读的格式保存对话历史记录。

2.2.与数据库和存储系统集成

为了实现更可扩展和更高效的存储，建议与 SQLite 等数据库集成：

import sqlite3

class SQLiteMemory:
    def __init__(self, db_path):
        self.db_path = db_path
        self.conn = sqlite3.connect(db_path)
        self.create_table()

    def create_table(self):
        cursor = self.conn.cursor()
        cursor.execute('''
            CREATE TABLE IF NOT EXISTS conversations
            (id INTEGER PRIMARY KEY, input TEXT, output TEXT)
        ''')
        self.conn.commit()

    def save_context(self, inputs, outputs):
        cursor = self.conn.cursor()
        cursor.execute('INSERT INTO conversations (input, output) VALUES (?, ?)',
                       (inputs['input'], outputs['output']))
        self.conn.commit()

    def load_memory_variables(self, inputs):
        cursor = self.conn.cursor()
        cursor.execute('SELECT input, output FROM conversations ORDER BY id DESC LIMIT 10')
        rows = cursor.fetchall()
        history = "\\n".join([f"Human: {row[0]}\\nAI: {row[1]}" for row in reversed(rows)])
        return {"history": history }

# Usage
memory = SQLiteMemory('conversation_history.db')

print(memory.load_memory_variables({}))

3 优化内存使用和性能

为了确保您的应用程序保持响应，请考虑以下优化策略：

高效数据结构：使用双端队列等结构来管理固定大小的缓冲区。
缓存策略：通过对经常访问的数据实施缓存来减少数据库查询。
数据修剪：定期修剪或汇总旧数据以保持可管理的内存大小。

这是具有基本缓存的内存类的示例：

import time

class CachedSQLiteMemory(SQLiteMemory):
    def __init__(self, db_path, cache_ttl=60):
        super().__init__(db_path)
        self.cache = None
        self.cache_time = 0
        self.cache_ttl = cache_ttl

    def load_memory_variables(self, inputs):
        current_time = time.time()
        if self.cache is None or (current_time - self.cache_time) > self.cache_ttl:
            var = self.cache
            self.cache = super().load_memory_variables(inputs)
            self.cache_time = current_time
            return self.cache

memory = CachedSQLiteMemory('conversation_history.db', cache_ttl=30)

此实现会缓存指定时间的数据库查询结果，减少数据库的负载并提高频繁访问内存数据的应用程序的性能。

结论

有效的内存管理是构建智能、上下文感知的对话式人工智能应用程序的基石。 LangChain提供了灵活而强大的内存管理框架，允许开发人员根据特定用例定制内存类型，实现持久存储解决方案，并优化大规模应用程序的性能。

通过选择正确的内存类型、集成持久存储并利用自定义内存类别和缓存策略等先进技术，您可以构建复杂的 AI 系统，即使在规模和复杂性方面也能维护上下文、改善用户体验并高效运行互动增加。

有了这些可用的工具和技术，您就可以充分利用 LangChain 的全部潜力来创建响应式、智能和情境感知的人工智能应用程序。无论您是开发客户支持机器人、虚拟助理还是复杂的对话系统，掌握 LangChain 中的内存和存储都将是您成功的关键因素。

如果您想支持我的写作或给我买啤酒：
https://buymeacoffee.com/bmours

版本声明本文转载于：https://dev.to/jamesbmour/langchain-part-4-leveraging-memory-and-storage-in-langchain-a-comprehensive-guide-h4m?1如有侵犯，请联系[email protected]删除