大模型 API 实战：打造带 RAG 的电商客服机器人 | 极客日志

PythonAI

大模型 API 实战：打造带 RAG 的电商客服机器人

通过标准化大模型 API 快速注册、创建密钥，在 Windows 上用 Flask 搭建集成 RAG 知识库的电商客服机器人。包括完整 Python 代码、内存会话管理、RAG 知识库配置与本地部署流程。

清酒独酌发布于 2026/6/300 浏览

调用大模型 API 开发智能客服，听起来复杂，但用标准化接口其实并不难。我从注册账号到本地部署一个带 RAG 的电商客服机器人，全程在 Windows 上搞定，下面记录一下。

平台能做什么

这个平台相当于一个模型超市，把各种大模型封装成统一的 API，不用去国外官网挨个注册，也不用管网络问题。按量计费，先试用再付费，对个人开发者很友好。它主要优势：

一个 API Key 就能调用多个模型，省事；
支持文本、图像、视频等多模态，国内外的模型都有；
接口兼容原厂参数，不用重新学；稳定性有 SLA 保障；
控制台提供用量统计、账单、RAG 知识库管理，适合小团队运营。

接入步骤

注册与开通模型

访问控制台，用手机号注册登录。新用户登录后一般会收到体验金，可以直接抵扣。在控制台里，你可以选'先用后付'或买优惠量包。去模型广场挑需要的模型，比如 GPT-5，按流程支付开通后，在'开通管理'里能看到状态变为'运行中'就可以调用了。

创建 API Key

这是调用凭证，得保管好。进入「API Key」→「新增 API Key」，选择类型：

标准模式：只调用基础模型 API；
融合模式：可以绑定知识库，调用时自动检索上下文。

我们先用标准模式练手。填个名称（比如'GPT-5 客服项目 Key'），保存后得到一串 key。千万别硬编码在代码里，建议用环境变量，也别提交到 Git。

安全建议

测试环境和生产环境分开创建 Key；
用环境变量加载，别直接写进代码；
不在邮件或 IM 里传递 Key；
通过控制台的'调用统计'监控用量，发现异常立刻停用。

第一次调用：文本对话

环境准备：Python 3.7+，安装 requests 和 python-dotenv。

在项目里创建 .env 文件：

AIONLY_API_KEY=你的key
AIONLY_CHAT_URL=https://api.example.com/v1/chat/completions

调用代码：

import os
import requests
from dotenv import load_dotenv

load_dotenv()
API_KEY = os.getenv("AIONLY_API_KEY")
API_URL = os.getenv("AIONLY_CHAT_URL")
HEADERS = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

def build_chat_payload(user_message, system_prompt="你是专业的 AI 助手，回答简洁准确"):
    return {
        "model": ,
        : [
            {: , : system_prompt},
            {: , : user_message}
        ],
        : ,
        : 
    }

 ():
    payload = build_chat_payload(user_message)
    :
        response = requests.post(API_URL, headers=HEADERS, json=payload)
        response.raise_for_status()
        result = response.json()
        reply = result[][][][]
        token_usage = result[]
         {
            : ,
            : reply,
            : token_usage[],
            : token_usage[],
            : token_usage[]
        }
     requests.exceptions.RequestException  e:
        error_msg = (e)
           ():
            error_msg += 
         {: , : error_msg}

 __name__ == :
    user_input = 
    result = call_aionly_chat(user_input)
     result[]:
        (, result[])
        ()
    :
        (, result[])

相关免费在线工具

RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online
Base64 字符串编码/解码
将字符串编码和解码为其 Base64 格式表示形式即可。在线工具，Base64 字符串编码/解码在线工具，online
Base64 文件转换器
将字符串、文件或图像转换为其 Base64 表示形式。在线工具，Base64 文件转换器在线工具，online

错误码	可能原因	解决办法
401	Key 不对或已停用	检查 Key，确认状态
403	模型没开通或 Key 类型不匹配	去开通管理确认，或重新创建对应类型 Key
429	请求频率太高	降速或联系客服提升 QPS

# 电商客服 FAQ
1. 退款申请后多久到账？
答：退款将在 1-3 个工作日内原路返回，具体到账时间以银行为准。
2. 订单发货时间？
答：普通商品 48 小时内发货，预售商品以详情页标注时间为准。
3. 如何修改收货地址？
答：订单发货前可在'我的订单'→'修改地址'中操作；已发货需联系快递拦截。

ecommerce-ai-chatbot/
├── app.py
├── api_client.py
├── .env
├── requirements.txt
└── templates/
    └── index.html

flask==2.3.3
requests==2.31.0
python-dotenv==1.0.0

import os
import requests
from dotenv import load_dotenv

user_history = {}  # 内存字典，key 为 user_id
load_dotenv()

class ApiClient:
    def __init__(self):
        self.api_key = os.getenv("AIONLY_API_KEY")
        self.chat_url = os.getenv("AIONLY_CHAT_URL")
        self.headers = {
            "Authorization": f"Bearer {self.api_key}",
            "Content-Type": "application/json"
        }

    def retrieve_knowledge(self, user_message):
        pass  # 融合模式 Key 会自动检索，不用另写

    def get_chat_reply(self, user_id, user_message):
        if user_id not in user_history:
            user_history[user_id] = []
        messages = [{"role": "system", "content": "你是电商 AI 客服，请根据知识库内容和用户问题，友好地回答。"}] + user_history[user_id] + [{"role": "user", "content": user_message}]
        payload = {
            "model": "gpt-5",
            "messages": messages,
            "temperature": 0.6,
            "max_tokens": 1024
        }
        try:
            response = requests.post(self.chat_url, headers=self.headers, json=payload)
            response.raise_for_status()
            result = response.json()
            reply = result["choices"][0]["message"]["content"]
            token_used = result["usage"]["total_tokens"]
            # 更新历史，保留最近 20 条（10 轮）
            user_history[user_id].append({"role": "user", "content": user_message})
            user_history[user_id].append({"role": "assistant", "content": reply})
            user_history[user_id] = user_history[user_id][-20:]
            return {
                "success": True,
                "reply": reply,
                "token_used": token_used
            }
        except Exception as e:
            error_msg = str(e)
            if 'response' in locals():
                error_msg += f" | {response.json()}"
            return {"success": False, "error": error_msg}

    def clear_user_history(self, user_id):
        if user_id in user_history:
            del user_history[user_id]

from flask import Flask, request, jsonify, render_template
import uuid
from api_client import ApiClient

app = Flask(__name__)
ai_client = ApiClient()

@app.route("/")
def index():
    return render_template("index.html")

@app.route("/api/chat", methods=["POST"])
def chat():
    data = request.json
    user_id = data.get("user_id")
    user_message = data.get("message", "").strip()
    if not user_id:
        user_id = str(uuid.uuid4())
    if not user_message:
        return jsonify({"success": False, "error": "请输入有效消息"})
    result = ai_client.get_chat_reply(user_id, user_message)
    result["user_id"] = user_id
    return jsonify(result)

@app.route("/api/clear-history", methods=["POST"])
def clear_history():
    data = request.json
    user_id = data.get("user_id")
    if user_id:
        ai_client.clear_user_history(user_id)
        return jsonify({"success": True})
    return jsonify({"success": False, "error": "user_id 不能为空"}), 400

if __name__ == "__main__":
    app.run(host="127.0.0.1", port=5000, debug=True)

<!DOCTYPE html>
<html lang="zh-CN">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>电商 AI 客服</title>
    <style>
        * { margin: 0; padding: 0; box-sizing: border-box; }
        body { font-family: Arial, sans-serif; max-width: 800px; margin: 0 auto; padding: 20px; }
        .chat-container { border: 1px solid #eee; border-radius: 8px; overflow: hidden; }
        .chat-header { background: #2272f9; color: white; padding: 16px; font-size: 18px; }
        .chat-history { height: 500px; overflow-y: auto; padding: 16px; background: #fafafa; }
        .message { margin: 8px 0; max-width: 70%; padding: 12px; border-radius: 8px; line-height: 1.5; }
        .user-message { background: #2272f9; color: white; margin-left: auto; }
        .ai-message { background: #fff; border: 1px solid #eee; margin-right: auto; }
        .system-message { color: #666; font-size: 12px; text-align: center; margin: 8px 0; }
        .input-container { display: flex; border-top: 1px solid #eee; }
        #message-input { flex: 1; padding: 12px 16px; border: none; outline: none; font-size: 14px; }
        #send-btn { padding: 0 24px; background: #2272f9; color: white; border: none; cursor: pointer; font-size: 14px; }
        #clear-btn { padding: 0 16px; background: #ff4444; color: white; border: none; cursor: pointer; font-size: 14px; }
    </style>
</head>
<body>
<div class="chat-container">
    <div class="chat-header">API 平台电商 AI 客服（7×24 小时在线）</div>
    <div class="chat-history" id="chat-history">
        <div class="system-message">欢迎咨询，我可以帮您查询订单、处理售后问题~</div>
    </div>
    <div class="input-container">
        <input type="text" id="message-input" placeholder="请输入您的问题（如：退款多久到账？）">
        <button id="clear-btn">清除历史</button>
        <button id="send-btn">发送</button>
    </div>
</div>
<script>
let userId = localStorage.getItem("ecommerce_chat_userid");
const chatHistory = document.getElementById("chat-history");
const messageInput = document.getElementById("message-input");
const sendBtn = document.getElementById("send-btn");
const clearBtn = document.getElementById("clear-btn");

function addMessage(content, isUser = false) {
    const messageDiv = document.createElement("div");
    messageDiv.className = isUser ? "message user-message" : "message ai-message";
    messageDiv.textContent = content;
    chatHistory.appendChild(messageDiv);
    chatHistory.scrollTop = chatHistory.scrollHeight;
}

async function sendMessage() {
    const message = messageInput.value.trim();
    if (!message) return;
    addMessage(message, isUser = true);
    messageInput.value = "";
    try {
        const response =  (, {
            : ,
            : { :  },
            : .({ : userId, : message })
        });
         result =  response.();
        userId = result.;
        .(, userId);
         (result.) {
            (result.);
        }  {
            (, isUser = );
        }
    }  (e) {
        (, isUser = );
    }
}

  () {
     (userId) {
         (, {
            : ,
            : { :  },
            : .({ : userId })
        });
    }
    chatHistory. = ;
    .();
    userId = ;
}

sendBtn.(, sendMessage);
messageInput.(,  {
     (e. === ) ();
});
clearBtn.(, clearHistory);
</script>
</body>
</html>

场景	效果	耗时	准确率
FAQ 匹配（'退款到账时间'）	直接返回知识库答案	<200ms	100%
多轮对话（'查订单→改地址'）	基于内存历史理解上下文	300-400ms	90%
复杂问题（'推荐性价比高的商品'）	调用 GPT-5 生成推荐	400-600ms	85%

大模型 API 实战：打造带 RAG 的电商客服机器人

平台能做什么

接入步骤

注册与开通模型

创建 API Key

安全建议

第一次调用：文本对话

更多推荐文章

相关免费在线工具

更多推荐文章

相关免费在线工具

实战：本地电商客服机器人

需求与架构

准备知识库

创建融合模式 Key

项目结构

代码实现

Windows 上运行

测试效果

局限性

最后

大模型 API 实战：打造带 RAG 的电商客服机器人

平台能做什么

接入步骤

注册与开通模型

创建 API Key

安全建议

第一次调用：文本对话

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

实战：本地电商客服机器人

需求与架构

准备知识库

创建融合模式 Key

项目结构

代码实现

Windows 上运行

测试效果

局限性

最后