AG-UI：构建 AI 前端交互的统一协议

AG-UI：构建 AI 前端交互的统一协议 | 极客日志

enum EventType {
  TEXT_MESSAGE_START = "TEXT_MESSAGE_START",
  TEXT_MESSAGE_CONTENT = "TEXT_MESSAGE_CONTENT",
  TEXT_MESSAGE_END = "TEXT_MESSAGE_END",
  TOOL_CALL_START = "TOOL_CALL_START",
  TOOL_CALL_ARGS = "TOOL_CALL_ARGS",
  TOOL_CALL_END = "TOOL_CALL_END",
  TOOL_CALL_RESULT = "TOOL_CALL_RESULT",
  STATE_SNAPSHOT = "STATE_SNAPSHOT",
  STATE_DELTA = "STATE_DELTA",
  MESSAGES_SNAPSHOT = "MESSAGES_SNAPSHOT",
  RAW = "RAW",
  CUSTOM = "CUSTOM",
  RUN_STARTED = "RUN_STARTED",
  RUN_FINISHED = "RUN_FINISHED",
  RUN_ERROR = "RUN_ERROR",
  STEP_STARTED = "STEP_STARTED",
  STEP_FINISHED = "STEP_FINISHED",
}

事件分类	事件类型	事件名称	功能描述	使用场景
生命周期事件（5）流程控制、错误处理	RUN_STARTED	运行开始	标记 Agent 执行开始	初始化 UI 状态，显示加载状态
	STEP_STARTED	步骤开始	标记单个步骤开始	显示当前执行步骤
	STEP_FINISHED	步骤完成	标记单个步骤完成	更新步骤状态，显示进度
	RUN_FINISHED	运行完成	标记整个执行完成	清理状态，显示最终结果
	RUN_ERROR	运行错误	标记执行出现错误	错误处理，显示错误信息
文本消息事件（3）实时对话、流式输出	TEXT_MESSAGE_START	消息开始	开始新的文本消息	创建消息容器
	TEXT_MESSAGE_CONTENT	消息内容	流式传输消息内容	实时显示打字效果
	TEXT_MESSAGE_END	消息结束	标记消息传输完成	完成消息渲染
工具调用事件（4）功能扩展、透明度	TOOL_CALL_START	工具调用开始	开始调用外部工具	显示工具调用状态
	TOOL_CALL_ARGS	工具参数	传输工具调用参数	显示调用参数信息
	TOOL_CALL_RESULT	工具调用结果	返回工具执行结果	显示工具返回的数据
	TOOL_CALL_END	工具调用结束	工具调用完成	显示调用结果
状态管理事件（3）数据同步、一致性	STATE_SNAPSHOT	状态快照	完整状态数据	同步完整应用状态
	STATE_DELTA	状态变更	增量状态更新	高效更新部分状态
	MESSAGES_SNAPSHOT	消息快照	完整消息历史	同步对话历史
特殊事件（2）系统集成、定制化	RAW	原始事件	未处理的原始数据	调试和扩展用途
	CUSTOM	自定义事件	用户定义的事件	特殊业务逻辑处理

sequenceDiagram
    participant User as 👤 用户
    participant Frontend as 🖥️ 前端
    participant Agent as 🤖 AI Agent
    participant Tool as 🛠️ 工具
    User->>Frontend: 发送消息
    Frontend->>Agent: 用户输入
    Agent->>Frontend: RUN_STARTED
    Agent->>Frontend: STEP_STARTED
    Agent->>Frontend: TEXT_MESSAGE_START
    Agent->>Frontend: TEXT_MESSAGE_CONTENT (流式)
    Agent->>Frontend: TEXT_MESSAGE_CONTENT (流式)
    Agent->>Frontend: TEXT_MESSAGE_END
    Agent->>Frontend: TOOL_CALL_START
    Agent->>Frontend: TOOL_CALL_ARGS
    Agent->>Tool: 执行工具
    Tool->>Agent: 工具结果
    Agent->>Frontend: TOOL_CALL_RESULT
    Agent->>Frontend: TOOL_CALL_END
    Agent->>Frontend: STATE_SNAPSHOT
    Agent->>Frontend: STEP_FINISHED
    Agent->>Frontend: RUN_FINISHED
    Frontend->>User: 显示完整响应

[
  // User
  { id: "msg_1", role: "user", content: "What's the weather in New York?" },
  // Assistant response with tool call
  {
    id: "msg_2",
    role: "assistant",
    content: "Let me check the weather for you.",
    toolCalls: [
      {
        id: "call_1",
        type: "function",
        function: {
          name: "get_weather",
          arguments: '{"location": "New York", "unit": "celsius"}'
        }
      }
    ]
  },
  // 注意，这里的工具调用其实是由 Agent 通知给前端，由前端用户决定是否调用工具，并把调用结果传给 Agent，让 Agent 继续后续流程
  // Tool result
  {
    id: "result_1",
    role: "tool",
    content: '{"temperature": 22, "condition": "Partly Cloudy", "humidity": 65}',
    toolCallId: "call_1"
  },
  // Assistant's final response using tool results
  {
    id: "msg_3",
    role: "assistant",
    content: "The weather in New York is partly cloudy with a temperature of 22°C and 65% humidity."
  }
]

// 1. 开始工具调用 - Agent 通知前端需要调用工具
const toolStart: ToolCallStartEvent = {
  type: EventType.TOOL_CALL_START,
  toolCallId: "tool_456",
  toolCallName: "get_weather",
  parentMessageId: "msg_123"
};

// 2. 传输参数 - 显示给用户工具调用的参数
const toolArgs: ToolCallArgsEvent = {
  type: EventType.TOOL_CALL_ARGS,
  toolCallId: "tool_456",
  delta: '{"location": "New York", "unit": "celsius"}'
};

// 3. 返回结果 - 用户确认后，前端执行工具并返回结果
const toolResult: ToolCallResultEvent = {
  type: EventType.TOOL_CALL_RESULT,
  messageId: "msg_124",
  toolCallId: "tool_456",
  content: '{"temperature": 22, "condition": "Partly Cloudy", "humidity": 65}',
  role: "tool"
};

// 4. 结束调用 - 工具调用完成
const toolEnd: ToolCallEndEvent = {
  type: EventType.TOOL_CALL_END,
  toolCallId: "tool_456"
};

Framework	Status	AG-UI Resources
No-framework	✅ Supported	➡️ Docs coming soon
LangGraph	✅ Supported	➡️ Demo
Mastra	✅ Supported	➡️ Demo
CrewAI	✅ Supported	➡️ Demo
AG2	✅ Supported	➡️ Demo
Agno	✅ Supported	➡️ Docs
LlamaIndex	✅ Supported	➡️ Docs
Pydantic AI	🛠️ In Progress	–
Vercel AI SDK	🛠️ In Progress	–
Google ADK	🛠️ In Progress	–
OpenAI Agent SDK	💡 Open to Contributions	–
AWS Bedrock Agents	💡 Open to Contributions	–
Cloudflare Agents	💡 Open to Contributions	–
Strands Agents SDK	💡 Open to Contributions	–

Language SDK	Status	AG-UI Resources
.NET	🛠️ In Progress	➡️ PR
Nim	🛠️ In Progress	➡️ PR
Rust	🛠️ In Progress

pip install copilotkit ...

import os
from dotenv import load_dotenv
load_dotenv()

from fastapi import FastAPI
import uvicorn
from copilotkit.integrations.fastapi import add_fastapi_endpoint
from copilotkit import CopilotKitRemoteEndpoint, LangGraphAgent
from sample_agent.agent import graph

app = FastAPI()
sdk = CopilotKitRemoteEndpoint(
    agents=[
        LangGraphAgent(
            name="sample_agent",
            description="一个模拟智能体",
            graph=graph,
        )
    ],
)
add_fastapi_endpoint(app, sdk, "/copilotkit")

def main():
    port = int(os.getenv("PORT", "8080"))
    uvicorn.run(
        "sample_agent.demo:app",
        host="0.0.0.0",
        port=port,
        reload=True,
    )

if __name__ == "__main__":
    main()

npx create-next-app@latest

npm install @copilotkit/react-ui @copilotkit/react-core
npm install @copilotkit/runtime class-validator

... const runtime = new CopilotRuntime({ remoteEndpoints: [{url: "http://localhost:8080/copilotkit"},], }); export const POST = async (req: NextRequest) => { const { handleRequest } = copilotRuntimeNextJSAppRouterEndpoint({ runtime, serviceAdapter, endpoint: "/api/copilotkit", }); return handleRequest(req); };

<CopilotKit agent="sample_agent" runtimeUrl="/api/copilotkit" showDevConsole={false} > {children} </CopilotKit>

import { CopilotSidebar } from "@copilotkit/react-ui";
import { useCoAgent, useCoAgentStateRender,useCopilotAction ,useLangGraphInterrupt} from "@copilotkit/react-core";

export default function App() {
  return (
    <>
      <Home/>
      <CopilotSidebar 
        defaultOpen={true} 
        instructions={"您应尽可能地帮助用户。请根据您拥有的数据以最佳方式回答问题。"} 
        labels={{ title: "智能 AI Copilot", initial: `# 👋 您好！ 我是你的智能 Copilot。演示功能： - **共享状态**: 搜索历史实时的展示 - **前端工具**: 调用前端工具打招呼 - **生成式 UI**: 获取天气信息展示卡片 - **HITL 流程**: 工具调用的人工审核` }}/>
    </>
  ); 
}

from langchain_mcp_adapters.client import MultiServerMCPClient
...
async def get_all_tools():
    """统一的工具准备函数，避免重复初始化 MCP 客户端 Returns: list: 包含所有可用工具的列表"""
    global _all_tools # 如果已经初始化过，直接返回
    if _all_tools is not None:
        return _all_tools
    # 创建 MCP 客户端以获取搜索工具
    try:
        client = MultiServerMCPClient(
            { "tavily-mcp": { "command": "npx", "args": ["-y", "tavily-mcp"], "env": {**os.environ}, "transport": "stdio" } }
        )
        # 获取 MCP 工具
        mcp_tools = await client.get_tools()
        _all_tools = mcp_tools + [get_weather]
        logger.info(f"工具初始化成功，可用工具：{[tool.name for tool in _all_tools]}")
    except Exception as e:
        logger.warning(f"⚠️ MCP 工具初始化失败：{e}")
        # 如果 MCP 工具失败，只使用邮件工具
        _all_tools = [get_weather]
        logger.info(f"使用备用工具：{[tool.name for tool in _all_tools]}")
    return _all_tools

... useCopilotAction({ name: "get_weather", description: "获取指定位置的天气信息。", available: "disabled", // 保持为 disabled，确保不被当作前端工具 render: ({status, args, result}) => { return ( <p className="text-gray-500 mt-2"> {status !== "complete" && "Calling weather API..."} {status === "complete" && <WeatherCard location={args.location} result={result} themeColor="#3b82f6" />} </p> ); }, }); useCopilotAction({ name: "sayHello", // Action 名称，Agent 将通过此名称来调用工具 description: "向指定用户问好", // 对该 Action 的描述（供 Agent 理解用途） parameters: [ // 定义参数列表 { name: "name", type: "string", description: "要问好的对象名字" } ], render: "正在发送问候...", // (可选) 执行时在 Chat 中显示的提示文本 handler: async ({ name }) => { // 定义具体执行逻辑的函数（异步支持） alert(`Hello, ${name}!`); // 这里在浏览器弹出提示框 return('问候已发送给' + name); // 返回结果给 agent } }); ...

def should_continue(state: AgentState):
    last_message = state["messages"][-1]
    if not hasattr(last_message, 'tool_calls') or not last_message.tool_calls:
        return END
    # 检查工具调用是前端还是后端
    tool_call_name = last_message.tool_calls[0].get("name")
    frontend_actions = state["copilotkit"]["actions"]
    is_frontend_action = any( action.get("name") == tool_call_name for action in frontend_actions )
    # 如果是前端动作，则结束，让 copilotkit 前端处理
    if is_frontend_action:
        return END
    else:
        # 否则，转到后端工具节点
        return "tool_node"

class AgentState(CopilotKitState):
    search_history: list[dict] = []

... # 更新状态信息 updated_state = {"messages": response} # 如果是搜索工具，更新搜索历史 - 搜索开始阶段 if response.tool_calls[0].get("name") in ["tavily-search", "tavily-extract", "tavily-crawl"]: search_history = state.get("search_history", []) search_query = response.tool_calls[0].get("args", {}) # 创建搜索历史记录 - 开始时标记为未完成 search_record = { "query": search_query.get("query", ""), "completed": False, "timestamp": time.strftime("%Y-%m-%d %H:%M:%S"), "tool_name": response.tool_calls[0].get("name") } logger.info(f"🔍 添加搜索查询到历史 (开始): {search_record}") search_history.append(search_record) updated_state["search_history"] = search_history return updated_state ...

... const {state} = useCoAgent<AgentState>({ name: "sample_agent", initialState: { search_history: [] }, }) ... useCoAgentStateRender<AgentState>({ name: "sample_agent", render: ({ status, state, nodeName }) => { return ( <div> {state.search_history?.map((search, index) => ( <div key={index}> {search.completed ? "✅" : "❌"} 正在执行：{search.query} {search.completed ? "" : "..."} </div> ))} </div> ) }, }); ... const [localHistory, setLocalHistory] = useState<AgentState['search_history']>([]);
useEffect(() => {
    if (state.search_history && state.search_history.length > 0 ) {
        const latestSearch = state.search_history[0];
        if(latestSearch.query.length <=0){ return; }
        setLocalHistory(prevHistory => {
            if (prevHistory.length > 0 && prevHistory[prevHistory.length - 1].query === latestSearch.query) {
                const newHistory = [...prevHistory];
                newHistory[prevHistory.length - 1] = latestSearch;
                return newHistory;
            } else {
                return [...prevHistory, latestSearch];
            }
        });
    }
}, [state.search_history]);

approval_request = {
    "type": "tool_approval_request",
    "tool_name": tool_call.get("name"),
    "tool_args": tool_call.get("args", {}),
    "tool_id": tool_call.get("id"),
    "timestamp": "2025-07-08"
}
# 拒绝
approve_status = interrupt(approval_request)
if approve_status in ["rejected", "reject"]:
    ....
# 如果审核通过，执行工具调用
elif approve_status in ["approved", "approve"]:
    ....

useLangGraphInterrupt({
    render: ({ event, resolve }) => {
        const { tool_name, tool_args } = event.value;
        return (
            <div className="bg-gradient-to-br from-blue-50 to-indigo-50 border border-blue-200 rounded-2xl p-6 my-4 shadow-lg">
                {/* 标题 */}
                <div className="flex items-center gap-3 mb-4">
                    <div className="w-10 h-10 bg-blue-100 rounded-full flex items-center justify-center">
                        <span className="text-lg">🔧</span>
                    </div>
                    <div>
                        <h3 className="text-lg font-bold text-gray-800">工具调用审核</h3>
                        <p className="text-sm text-gray-600">请确认是否执行以下工具调用</p>
                    </div>
                </div>
                {/* 工具信息 */}
                <div className="bg-white rounded-xl p-4 mb-4 border border-gray-100">
                    <div className="grid grid-cols-1 gap-3">
                        <div>
                            <label className="block text-xs font-medium text-gray-500 mb-1">工具名称</label>
                            <div className="bg-gray-50 px-3 py-2 rounded-lg">
                                <code className="text-blue-600 font-mono text-sm">{tool_name}</code>
                            </div>
                        </div>
                        <div>
                            <label className="block text-xs font-medium text-gray-500 mb-1">参数</label>
                            <div className="bg-gray-50 px-3 py-2 rounded-lg max-h-24 overflow-y-auto">
                                <pre className="text-xs text-gray-700 whitespace-pre-wrap font-mono">{JSON.stringify(tool_args, null, 2)}</pre>
                            </div>
                        </div>
                    </div>
                </div>
                {/* 操作按钮 */}
                <div className="mt-4">
                    <div className="flex gap-2">
                        <button type="button" onClick={() => resolve("approve")} className="flex-1 bg-green-500 hover:bg-green-600 text-white font-medium py-2 px-4 rounded-lg transition-colors duration-200 flex items-center justify-center gap-2 text-sm" >
                            <span>✅</span> 通过
                        </button>
                        <button type="button" onClick={() => resolve("reject")} className="flex-1 bg-red-500 hover:bg-red-600 text-white font-medium py-2 px-4 rounded-lg transition-colors duration-200 flex items-center justify-center gap-2 text-sm" >
                            <span>❌</span> 拒绝
                        </button>
                    </div>
                </div>
            </div>
        );
    }
});

import express, {Request, Response} from 'express';
import dotenv from 'dotenv';
dotenv.config();
import {RunAgentInputSchema, RunAgentInput, EventType, Message} from '@ag-ui/core';
import {EventEncoder} from '@ag-ui/encoder';
import {v4 as uuidv4} from 'uuid';
import OpenAI from 'openai';

const app = express();
app.use(express.json());

app.post('/awp', async (req: Request, res: Response) => {
    console.log('app.post > req:');
    try {
        // 解析请求体
        const input: RunAgentInput = RunAgentInputSchema.parse(req.body);
        // 设置 SSE headers
        res.setHeader('Content-Type', 'text/event-stream');
        res.setHeader('Cache-Control', 'no-cache');
        res.setHeader('Connection', 'keep-alive');
        const encoder = new EventEncoder();
        // 发送 started 事件
        const runStarted = { type: EventType.RUN_STARTED, threadId: input.threadId, runId: input.runId };
        res.write(encoder.encode(runStarted));
        // 初始化 OpenAI 客户端
        const client = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
        // 将 前端传入的 AG-UI 消息转换为 OpenAI 消息格式
        const requestMessages = input.messages
            .filter((msg: Message) => ['user', 'system', 'assistant'].includes(msg.role))
            .map((msg: Message) => ({ role: msg.role as 'user' | 'system' | 'assistant', content: msg.content || '' }));
        // 生成消息 ID
        const messageId = uuidv4();
        // 发送'文本消息开始'事件
        const textMessageStart = { type: EventType.TEXT_MESSAGE_START, messageId, role: 'assistant' };
        res.write(encoder.encode(textMessageStart));
        // 创建流式传输完成请求
        const stream = await client.chat.completions.create({ model: 'gpt-4o', messages: requestMessages, stream: true });
        console.log('🚀 > app.post > stream:', stream);
        // 处理流并发送'文本消息内容'事件
        for await (const chunk of stream) {
            if (chunk.choices[0]?.delta?.content) {
                const content = chunk.choices[0].delta.content;
                const textMessageContent = { type: EventType.TEXT_MESSAGE_CONTENT, messageId, delta: content };
                res.write(encoder.encode(textMessageContent));
            }
        }
        // 发送'文本消息结束'事件
        const textMessageEnd = { type: EventType.TEXT_MESSAGE_END, messageId };
        res.write(encoder.encode(textMessageEnd));
        console.log('🚀 > app.post > TEXT_MESSAGE_END:');
        // 发送 finished 事件
        const runFinished = { type: EventType.RUN_FINISHED, threadId: input.threadId, runId: input.runId };
        res.write(encoder.encode(runFinished));
        console.log('🚀 > app.post > RUN_FINISHED:');
        // 结束响应
        res.end();
    } catch (error) {
        res.status(422).json({error: 'Internal Server Error'});
    }
});

app.listen(3001, () => {
    console.log('Server running on http://localhost:3001');
});

AG-UI：构建 AI 前端交互的统一协议

AG-UI：构建 AI 前端交互的统一协议

引言

一、AG-UI 是什么？

1.1 核心定义

1.2 协议定位

二、为什么需要 AG-UI？

技术碎片化

实时性困难

人机协作缺失

三、AG-UI 核心架构

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

3.1 整体架构设计

3.2 核心特性

统一事件流

实时交互

工具编排

共享状态

并发与取消

安全边界

3.3 事件 Events

EventType 枚举定义

AG-UI 事件分类总览

事件流程示例

3.4 Agents 智能体

3.5 Messages

3.6 状态管理

3.7 工具调用 Tools

四、AG-UI 的技术优势

4.1 灵活性与兼容性

事件结构灵活性

4.2 开发者友好性

丰富的 SDK 支持

现成的集成方案

五、演示 Demo

5.1 CopilotKit 的演示 Demo

5.1.1 后端 python + Copilotkit LangGraph SDK

5.1.2 前端 React + Copilotkit React-ui

5.1.3 工具调用（不仅可以调用后端设置的工具（比如搜索、访问数据库、MCP），还可以调用前端定义的 UI'工具'（比如更改样式））

5.1.4 状态共享

5.1.5 HITL（Human-in-the-loop）人工审核（人机协作）流程

5.2 Node.js 示例

六、总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具