基于 LangChain 从零搭建 AI Agent 实战指南 | 极客日志

PythonAI算法

基于 LangChain 从零搭建 AI Agent 实战指南

综述由AI生成LLM 存在知识滞后和幻觉问题，引入 Agent 机制可借助外部工具增强能力。基于 LangChain 框架，通过 ReAct 模式（推理 + 行动）实现了一个火车票预订智能体。内容涵盖 Agent 核心组件解析、Prompt 模板设计、自定义工具开发及完整运行流程演示，帮助开发者理解如何构建具备规划与执行能力的 AI 应用。

墨染流年发布于 2026/3/29更新于 2026/6/1219 浏览

为什么大模型需要 Agent 化

虽然大语言模型（LLM）能力强大，但受限于训练数据的时间窗口，它们存在明显的短板：知识滞后、幻觉问题、对时事了解有限，以及复杂推理和计算能力的不足。

举个例子，当用户让 LLM 买高铁票时，模型虽然理解了'买票'这个行为意图，但它本身并不掌握实时时刻表、价格或用户所在的城市信息。这时候，基于大模型的 Agent（LLM-based Agent）就能通过调用外部工具来弥补这些缺陷。

Agent 架构示意图

ReAct Agent 核心原理

ReAct 是近年来非常经典的 Agent 范式，论文链接：ReAct Agent 论文。

LLM Agent 的演进路线

从早期的直接回答（Standard IO），到思维链（CoT），再到函数调用（Function calling），现在的趋势是 Reason + Action。

ReAct = Reasoning（推理）+ Action（行动）。这种模式允许模型在采取行动前先进行思考，根据观察结果再决定下一步。

ReAct 流程

ReAct Agent 的组成部分

在使用 LangChain 构建时，主要包含以下模块：

Models：负责生成文本的 LLM。
Prompts：定义 Agent 的行为指令和约束条件。
Memory：记录 Action 的执行状态，缓存已知信息。
Indexes：用于结构化文档，方便模型检索交互。
Chains：LangChain 的核心链路，串联各个组件。
Agent：协调上述组件的主体。

Prompt 模板设计

Prompt 的质量直接影响 Agent 的表现。我们需要明确告诉模型如何思考、使用什么工具以及输出格式。

from langchain_core.prompts import PromptTemplate

template = '''Answer the following questions as best you can. You have access to the following tools: {tools}
Use the following format:
Question: the input question you must answer
Thought: you should always think about what to do
Action: the action to take, should be one of [{tool_names}]
Action Input: the input to the action
Observation: the result of the action
...
(this Thought/Action/Action Input/Observation can repeat N times)
Thought: I now know the final answer
Final Answer: the final answer to the original input question
Begin!
Question: {input}
Thought:{agent_scratchpad}'''

prompt = PromptTemplate.from_template(template)

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online

import json
import sys
from typing import List, Optional, Dict, Any, Tuple, Union
from uuid import UUID

import pydantic
from langchain.memory import ConversationTokenBufferMemory
from langchain.tools.render import render_text_description
from langchain_core.callbacks import BaseCallbackHandler
from langchain_core.language_models import BaseChatModel
from langchain_core.output_parsers import PydanticOutputParser, StrOutputParser
from langchain_core.outputs import GenerationChunk, ChatGenerationChunk, LLMResult
from langchain_core.prompts import PromptTemplate
from langchain_core.tools import StructuredTool
from langchain_openai import ChatOpenAI
from pydantic import BaseModel, Field, ValidationError

from typing import List
from langchain_core.tools import StructuredTool

def search_train_ticket(
    origin: str,
    destination: str,
    date: str,
    departure_time_start: str,
    departure_time_end: str
) -> List[dict[str, str]]:
    """按指定条件查询火车票"""
    # mock train list
    return [
        {
            "train_number": "G1234",
            "origin": "北京",
            "destination": "上海",
            "departure_time": "2024-06-01 8:00",
            "arrival_time": "2024-06-01 12:00",
            "price": "100.00",
            "seat_type": "商务座",
        },
        {
            "train_number": "G5678",
            "origin": "北京",
            "destination": "上海",
            "departure_time": "2024-06-01 18:30",
            "arrival_time": "2024-06-01 22:30",
            "price": "100.00",
            "seat_type": "商务座",
        },
        {
            "train_number": "G9012",
            "origin": "北京",
            "destination": "上海",
            "departure_time": "2024-06-01 19:00",
            "arrival_time": "2024-06-01 23:00",
            "price": "100.00",
            "seat_type": "商务座",
        }
    ]

def purchase_train_ticket(train_number: str) -> dict:
    """购买火车票"""
    return {
        "result": "success",
        "message": "购买成功",
        "data": {
            "train_number": "G1234",
            "seat_type": "商务座",
            "seat_number": "7-17A"
        }
    }

search_train_ticket_tool = StructuredTool.from_function(
    func=search_train_ticket,
    name="查询火车票",
    description="查询指定日期可用的火车票。",
)
purchase_train_ticket_tool = StructuredTool.from_function(
    func=purchase_train_ticket,
    name="购买火车票",
    description="购买火车票。会返回购买结果 (result), 和座位号 (seat_number)",
)
finish_placeholder = StructuredTool.from_function(
    func=lambda: None,
    name="FINISH",
    description="用于表示任务完成的占位符工具"
)

tools = [search_train_ticket_tool, purchase_train_ticket_tool, finish_placeholder]

prompt_text = """ 你是强大的 AI 火车票助手，可以使用工具与指令查询并购买火车票
你的任务是：{task_description}
你可以使用以下工具或指令，它们又称为动作或 actions: {tools}
当前的任务执行记录：{memory}
按照以下格式输出：
任务：你收到的需要执行的任务
思考：观察你的任务和执行记录，并思考你下一步应该采取的行动
然后，根据以下格式说明，输出你选择执行的动作/工具：{format_instructions}
"""

final_prompt = """ 你的任务是：{task_description}
以下是你的思考过程和使用工具与外部资源交互的结果。
{memory}
你已经完成任务。
现在请根据上述结果简要总结出你的最终答案。
直接给出答案。不用再解释或分析你的思考过程。
"""

class Action(BaseModel):
    """结构化定义工具的属性"""
    name: str = Field(description="工具或指令名称")
    args: Optional[Dict[str, Any]] = Field(description="工具或指令参数，由参数名称和参数值组成")

class MyPrintHandler(BaseCallbackHandler):
    """自定义 LLM CallbackHandler，用于打印大模型返回的思考过程"""
    def __init__(self):
        super().__init__()

    def on_llm_new_token(
        self,
        token: str,
        *,
        chunk: Optional[Union[GenerationChunk, ChatGenerationChunk]] = None,
        run_id: UUID,
        parent_run_id: Optional[UUID] = None,
        **kwargs: Any,
    ) -> Any:
        end = ""
        content = token + end
        sys.stdout.write(content)
        sys.stdout.flush()
        return token

    def on_llm_end(self, response: LLMResult, **kwargs: Any) -> Any:
        end = ""
        content = "\n" + end
        sys.stdout.write(content)
        sys.stdout.flush()
        return response

class MyAgent:
    def __init__(
        self,
        llm: BaseChatModel = ChatOpenAI(
            model="gpt-4-turbo",
            temperature=0,
            model_kwargs={"seed": 42},
        ),
        tools=None,
        prompt: str = "",
        final_prompt: str = "",
        max_thought_steps: Optional[int] = 10,
    ):
        if tools is None:
            tools = []
        self.llm = llm
        self.tools = tools
        self.final_prompt = PromptTemplate.from_template(final_prompt)
        self.max_thought_steps = max_thought_steps  # 最多思考步数，避免死循环
        self.output_parser = PydanticOutputParser(pydantic_object=Action)
        self.prompt = self.__init_prompt(prompt)
        self.llm_chain = self.prompt | self.llm | StrOutputParser()  # 主流程的 LCEL
        self.verbose_printer = MyPrintHandler()

    def __init_prompt(self, prompt):
        return PromptTemplate.from_template(prompt).partial(
            tools=render_text_description(self.tools),
            format_instructions=self.__chinese_friendly(
                self.output_parser.get_format_instructions(),
            ),
        )

    def run(self, task_description):
        """Agent 主流程"""
        thought_step_count = 0
        agent_memory = ConversationTokenBufferMemory(
            llm=self.llm,
            max_token_limit=4000,
        )
        agent_memory.save_context({"input": "\ninit"}, {"output": "\n开始"})

        while thought_step_count < self.max_thought_steps:
            print(f">>>>Round: {thought_step_count}<<<<")
            action, response = self.__step(task_description=task_description, memory=agent_memory)

            # 如果是结束指令，执行最后一步
            if action.name == "FINISH":
                break

            # 执行动作
            observation = self.__exec_action(action)
            print(f"----\nObservation:\n{observation}")

            # 更新记忆
            self.__update_memory(agent_memory, response, observation)
            thought_step_count += 1

        if thought_step_count >= self.max_thought_steps:
            reply = "抱歉，我没能完成您的任务。"
        else:
            final_chain = self.final_prompt | self.llm | StrOutputParser()
            reply = final_chain.invoke({
                "task_description": task_description,
                "memory": agent_memory
            })
        return reply

    def __step(self, task_description, memory) -> Tuple[Action, str]:
        """执行一步思考"""
        response = ""
        for s in self.llm_chain.stream(
            {"task_description": task_description, "memory": memory},
            config={"callbacks": [self.verbose_printer]}
        ):
            response += s
        action = self.output_parser.parse(response)
        return action, response

    def __exec_action(self, action: Action) -> str:
        observation = "没有找到工具"
        for tool in self.tools:
            if tool.name == action.name:
                try:
                    observation = tool.run(action.args)
                except ValidationError as e:
                    observation = f"Validation Error in args: {str(e)}, args: {action.args}"
                except Exception as e:
                    observation = f"Error: {str(e)}, {type(e).__name__}, args: {action.args}"
        return observation

    @staticmethod
    def __update_memory(agent_memory, response, observation):
        agent_memory.save_context(
            {"input": response},
            {"output": "\n返回结果:\n" + str(observation)}
        )

    @staticmethod
    def __chinese_friendly(string) -> str:
        lines = string.split('\n')
        for i, line in enumerate(lines):
            if line.startswith('{') and line.endswith('}'):
                try:
                    lines[i] = json.dumps(json.loads(line), ensure_ascii=False)
                except:
                    pass
        return '\n'.join(lines)

if __name__ == "__main__":
    my_agent = MyAgent(
        tools=tools,
        prompt=prompt_text,
        final_prompt=final_prompt,
    )
    task = "帮我买 24 年 6 月 1 日早上去上海的火车票"
    reply = my_agent.run(task)
    print(reply)

>>>>Round: 0<<<< 
任务：帮我买 24 年 6 月 1 日早上去上海的火车票 
思考：根据任务需求，首先需要查询 2024 年 6 月 1 日早上从当前位置到上海的火车票。这需要使用'查询火车票'工具，指定出发地、目的地、日期以及早上的时间范围。
动作/工具：{"name":"查询火车票", "args":{"origin":"当前位置", "destination":"上海", "date":"2024-06-01", "departure_time_start":"00:00", "departure_time_end":"12:00"}} 
---- Observation:
[{'train_number':'G1234', 'origin':'北京', 'destination':'上海', 'departure_time':'2024-06-01 8:00', 'arrival_time':'2024-06-01 12:00', 'price':'100.00', 'seat_type':'商务座'}, {'train_number':'G5678', 'origin':'北京', 'destination':'上海', 'departure_time':'2024-06-01 18:30', 'arrival_time':'2024-06-01 22:30', 'price':'100.00', 'seat_type':'商务座'}, {'train_number':'G9012', 'origin':'北京', 'destination':'上海', 'departure_time':'2024-06-01 19:00', 'arrival_time':'2024-06-01 23:00', 'price':'100.00', 'seat_type':'商务座'}]

>>>>Round: 1<<<< 
任务：帮我买 24 年 6 月 1 日早上去上海的火车票 
思考：根据查询结果，有三个车次可供选择，但只有车次 G1234 符合早上出发的要求，因此应选择购买这个车次的票。
动作/工具：{"name":"购买火车票", "args":{"train_number":"G1234"}} 
---- Observation:
{'result':'success', 'message':'购买成功', 'data':{'train_number':'G1234', 'seat_type':'商务座', 'seat_number':'7-17A'}}

>>>>Round: 2<<<< 
任务：帮我买 24 年 6 月 1 日早上去上海的火车票 
思考：根据执行记录，已经成功购买了 2024 年 6 月 1 日早上从北京到上海的火车票（车次 G1234）。因此，接下来的任务是完成这个购票任务。
动作/工具：{"name":"FINISH"} 
购买成功。您已成功购买 2024 年 6 月 1 日早上从北京出发前往上海的火车票，车次为 G1234，座位类型为商务座，座位号为 7-17A。

基于 LangChain 从零搭建 AI Agent 实战指南

为什么大模型需要 Agent 化

ReAct Agent 核心原理

LLM Agent 的演进路线

ReAct Agent 的组成部分

Prompt 模板设计

更多推荐文章

相关免费在线工具

核心代码实现

环境准备与依赖

定义工具（Tools）

Prompt 配置

主要任务 Prompt

最终回复 Prompt

辅助工具类

定义 Agent 主类

测试运行

运行结果解析

第一轮思考

第二轮思考

第三轮思考

更多推荐文章

相关免费在线工具

基于 LangChain 从零搭建 AI Agent 实战指南

为什么大模型需要 Agent 化

ReAct Agent 核心原理

LLM Agent 的演进路线

ReAct Agent 的组成部分

Prompt 模板设计

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

核心代码实现

环境准备与依赖

定义工具（Tools）

Prompt 配置

主要任务 Prompt

最终回复 Prompt

辅助工具类

定义 Agent 主类

测试运行

运行结果解析

第一轮思考

第二轮思考

第三轮思考

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具