大模型 LangChain 框架基础与使用示例

LangChain 已成为当前 LLM 应用框架的事实标准。本文整理 LangChain 的基本概念及具体使用场景。

LangChain 架构图

LangChain 是什么

LangChain是一个基于大语言模型的应用开发框架，它主要通过两种方式规范和简化了使用 LLM 的方式：

集成：集成外部数据 (如文件、其他应用、API 数据等) 到 LLM 中；
Agent：允许 LLM 通过决策与特定的环境交互，并由 LLM 协助决定下一步的操作。

LangChain 的优点包括：

高度抽象的组件：规范和简化与语言模型交互所需的各种抽象和组件；
高度可自定义的 Chains：提供了大量预置 Chains 的同时，支持自行继承 BaseChain 并实现相关逻辑以及各个阶段的 callback handler 等；
活跃的社区与生态：LangChain 团队迭代速度非常快，能快速使用最新的语言模型特性，该团队也有 langsmith, auto-evaluator 等其它优秀项目，并且开源社区也有相当多的支持。

LangChain 架构细节

LangChain 的主要组件

这是一张 LangChain 的组件与架构图（langchain python 和 langchain JS/TS 的架构基本一致，本文中以 langchain python 来完成相关介绍），基本完整描述了 LangChain 的组件与抽象层（callback 不在这张图中，在下方我们会另外介绍），以及它们之间的相关联系。

LangChain 组件图

Model I/O

首先我们从最基本面的部分讲起，Model I/O 指的是和 LLM 直接进行交互的过程。

Model I/O 流程

在 Model I/O 这一流程中，LangChain 抽象的组件主要有三个：Language Model、Prompts、Output Parser。

下面展开介绍一下。

⚠️ 注：下面涉及的所有代码示例中的 OPENAI_API_KEY 和需要提前配置好，指 OpenAI/OpenAI 代理服务的，指 OpenAI 代理服务的。

from langchain.chat_models import ChatOpenAI from langchain.prompts import PromptTemplate from langchain.output_parsers import StructuredOutputParser, ResponseSchema from langchain.chains import LLMChain llm = ChatOpenAI(temperature=0.5, model_name="gpt-3.5-turbo-16k-0613", openai_api_key=OPENAI_API_KEY, openai_api_base=OPENAI_BASE_URL) template = """ ## Input {text} ## Instruction Please summarize the piece of text in the input part above. Respond in a manner that a 5 year old would understand. {format_instructions} YOUR RESPONSE: """ # 创建一个 Output Parser，包含两个输出字段，并指定类型和说明 output_parser = StructuredOutputParser.from_response_schemas( [ ResponseSchema(name="keywords", type="list", description="keywords of the text"), ResponseSchema(name="summary", type="string", description="summary of the text"), ] ) # 创建 Prompt Template，并将 format_instructions 通过 partial_variables 直接指定为 Output Parser 的 format prompt = PromptTemplate( input_variables=["text"], template=template, partial_variables={"format_instructions": output_parser.get_format_instructions()}, ) # 创建 Chain 并绑定 Prompt Template 和 Output Parser(它将自动使用 Output Parser 解析 llm 输出) summarize_chain = LLMChain(llm=llm, verbose=True, prompt=prompt, output_parser=output_parser) to_summarize_text = 'Abstract. Text-to-SQL aims at generating SQL queries for the given natural language questions and thus helping users to query databases. Prompt learning with large language models (LLMs) has emerged as a recent approach, which designs prompts to lead LLMs to understand the input question and generate the corresponding SQL. However, it faces challenges with strict SQL syntax requirements. Existing work prompts the LLMs with a list of demonstration examples (i.e. question-SQL pairs) to generate SQL, but the fixed prompts can hardly handle the scenario where the semantic gap between the retrieved demonstration and the input question is large.' output = summarize_chain.predict(text=to_summarize_text) import json print (json.dumps(output, indent=4))

from langchain.chains.router import MultiPromptChain from langchain.llms import OpenAI from langchain.chains import ConversationChain from langchain.chains.llm import LLMChain from langchain.prompts import PromptTemplate from langchain.chains.router.llm_router import LLMRouterChain, RouterOutputParser from langchain.chains.router.multi_prompt_prompt import MULTI_PROMPT_ROUTER_TEMPLATE # 定义要路由的 prompts physics_template = """You are a very smart physics professor. \You are great at answering questions about physics in a concise and easy to understand manner. \When you don't know the answer to a question you admit that you don't know. Here is a question: {input}""" math_template = """You are a very good mathematician. You are great at answering math questions. \You are so good because you are able to break down hard problems into their component parts, \answer the component parts, and then put them together to answer the broader question. Here is a question: {input}""" # 整理 prompt 和相关信息 prompt_infos = [ { "name": "physics", "description": "Good for answering questions about physics", "prompt_template": physics_template, }, { "name": "math", "description": "Good for answering math questions", "prompt_template": math_template, }, ] llm = OpenAI(temperature=0.5, openai_api_key=OPENAI_API_KEY, openai_api_base=OPENAI_BASE_URL+"/v1") destination_chains = {} for p_info in prompt_infos: # 以每个 prompt 为基础创建一个 destination_chain(开启 verbose) name = p_info["name"] prompt_template = p_info["prompt_template"] prompt = PromptTemplate(template=prompt_template, input_variables=["input"]) chain = LLMChain(llm=llm, prompt=prompt) destination_chains[name] = chain # 创建一个缺省 chain，如果没有其他 chain 满足路由条件，则使用该 chain default_chain = ConversationChain(llm=llm, output_key="text") destinations = [f"{p['name']}: {p['description']}" for p in prompt_infos] destinations_str = "\n".join(destinations) # 根据 prompt_infos 中的映射关系创建 router_prompt router_template = MULTI_PROMPT_ROUTER_TEMPLATE.format(destinations=destinations_str) router_prompt = PromptTemplate( template=router_template, input_variables=["input"], output_parser=RouterOutputParser(), ) # 创建 router_chain(开启 verbose) router_chain = LLMRouterChain(llm_chain=LLMChain(llm=llm, prompt=router_prompt, verbose=True), verbose=True) # 将 router_chain 和 destination_chains 以及 default_chain 组合成 MultiPromptChain(开启 verbose) chain = MultiPromptChain( router_chain=router_chain, destination_chains=destination_chains, default_chain=default_chain, verbose=True, ) # run chain.run("What is black body radiation?")

from langchain.chains import ConversationChain from langchain.memory import ConversationSummaryBufferMemory from langchain.llms import OpenAI from langchain.schema import SystemMessage, AIMessage, HumanMessage from langchain.memory.prompt import SUMMARY_PROMPT from langchain.prompts import PromptTemplate llm = OpenAI(temperature=0.7, openai_api_key=OPENAI_API_KEY, openai_api_base=OPENAI_BASE_URL+"/v1") # ConversationSummaryBufferMemory 默认使用 langchain.memory.prompt.SUMMARY_PROMPT 作为 summary 的 PromptTemplate # 如果对它 summary 的格式/内容有特殊要求，可以自定义 PromptTemplate（实测默认的 summary 有些流水账） prompt_template_str = """ ## Instruction Progressively summarize the lines of conversation provided, adding onto the previous summary returning a new concise and detailed summary. Don't repeat the conversation directly in the summary, extract key information instead. ## EXAMPLE Current summary: The human asks what the AI thinks of artificial intelligence. The AI thinks artificial intelligence is a force for good. New lines of conversation: Human: Why do you think artificial intelligence is a force for good? AI: Because artificial intelligence will help humans reach their full potential. New summary: The human inquires about the AI's opinion on artificial intelligence. The AI believes that it is a force for good as it can help humans reach their full potential. ## Current summary {summary} ## New lines of conversation {new_lines} ## New summary """ prompt = PromptTemplate( input_variables=SUMMARY_PROMPT.input_variables, # input_variables 为 SUMMARY_PROMPT 中的 input_variables 不变 template=prompt_template_str, # template 替换为上面重新编写的 prompt_template_str ) memory = ConversationSummaryBufferMemory(llm=llm, prompt=prompt, max_token_limit=60) # 添加历史 memory，其中第一条 SystemMessage 为历史对话中 Summary 的内容，第二条 HumanMessage 和第三条 AIMessage 为历史对话中最后的对话内容 memory.chat_memory.add_message(SystemMessage(content="The human asks what the AI thinks of artificial intelligence. The AI thinks artificial intelligence is a force for good because it will help humans reach their full potential. The human then asks the difference between python and golang in short. The AI responds that python is a high-level interpreted language with an emphasis on readability and code readability, while golang is a statically typed compiled language with a focus on concurrency and performance. Python is typically used for general-purpose programming, while golang is often used for building distributed systems.")) memory.chat_memory.add_user_message("Then if I want to build a distributed system, which language should I choose?") memory.chat_memory.add_ai_message("If you want to build a distributed system, I would recommend golang as it is a statically typed compiled language that is designed to facilitate concurrency and performance.") # 调用 memory.prune() 确保 chat_memory 中的对话内容不超过 max_token_limit memory.prune() conversation_with_summary = ConversationChain( llm=llm, # We set a very low max_token_limit for the purposes of testing. memory=memory, verbose=True, ) # memory.prune() 会在每次调用 predict() 后自动执行 conversation_with_summary.predict(input="Is there any well-known distributed system built with golang?") conversation_with_summary.predict(input="Is there a substitutes for Kubernetes in python?")

大模型 LangChain 框架基础与使用示例

LangChain 是什么

LangChain 的主要组件

Model I/O

更多推荐文章

相关免费在线工具

Language Model

Prompts

Output Parser

使用示例

Data connection

Document loaders

Document transformers

Vector stores

Retrievers

使用示例 5

Chains

Router Chain

Sequential Chain

Map-reduce Chain

使用示例

Memory

使用示例

Agent

Conversational Agent

OpenAI functions Agent

Plan and execute Agent

ReAct Agent

Self ask with search

总结

更多推荐文章

相关免费在线工具

大模型 LangChain 框架基础与使用示例

LangChain 是什么

LangChain 的主要组件

Model I/O

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

Language Model

Prompts

Output Parser

使用示例

Data connection

Document loaders

Document transformers

Vector stores

Retrievers

使用示例 5

Chains

Router Chain

Sequential Chain

Map-reduce Chain

使用示例

Memory

使用示例

Agent

Conversational Agent

OpenAI functions Agent

Plan and execute Agent

ReAct Agent

Self ask with search

总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具