大模型应用开发实战：从环境搭建到 API 服务

大模型应用开发实战：从环境搭建到 API 服务 | 极客日志

python -m pip install --upgrade pip

from modelscope import snapshot_download

# 指定模型仓库 ID 和保存路径
model_dir = snapshot_download(
    model_id='ZhipuAI/chatglm3-6b', 
    cache_dir=r'D:\Transformers'
)
print(f"模型已下载到：{model_dir}")

from modelscope import AutoTokenizer, AutoModel
import torch

# 加载模型和分词器
model_path = r'D:\Transformers\chatglm3-6b'
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = AutoModel.from_pretrained(model_path, device_map="auto", trust_remote_code=True).eval()

# 生成回答
response, history = model.chat(tokenizer, "你好", history=[])
print(response)

conda install pytorch torchvision torchaudio cpuonly -c pytorch

pip install fastapi uvicorn

from fastapi import FastAPI
from pydantic import BaseModel
import uvicorn
from modelscope import AutoTokenizer, AutoModel

app = FastAPI()

# 全局加载模型（避免每次请求都加载）
model_path = r'D:\Transformers\chatglm3-6b'
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = AutoModel.from_pretrained(model_path, device_map="auto", trust_remote_code=True).eval()

class Message(BaseModel):
    text: str

@app.post("/chat")
async def chat_endpoint(message: Message):
    response, _ = model.chat(tokenizer, message.text, history=[])
    return {"reply": response}

if __name__ == "__main__":
    uvicorn.run(app, host="0.0.0.0", port=7866)

pip install langchain requests

import requests

url = "http://localhost:7866/chat"
data = {"text": "你是谁？"}
response = requests.post(url, json=data)
print(response.json())

pip install gradio

import gradio as gr
import requests

def predict(text):
    url = "http://localhost:7866/chat"
    try:
        res = requests.post(url, json={"text": text})
        return res.json().get("reply", "Error")
    except Exception as e:
        return f"Error: {str(e)}"

with gr.Blocks() as demo:
    gr.Markdown("# 大模型对话演示")
    input_text = gr.Textbox(label="输入内容")
    output_text = gr.Textbox(label="模型回复")
    btn = gr.Button("发送")
    btn.click(predict, inputs=input_text, outputs=output_text)

demo.launch()

大模型应用开发实战：从环境搭建到 API 服务

大模型应用开发实战：从环境搭建到 API 服务

工具准备

1. MiniConda

2. PyCharm

环境准备

新建项目

模型下载

模型使用

提供 API 支持

编写客户端

总结与展望

更多推荐文章

相关免费在线工具

大模型应用开发实战：从环境搭建到 API 服务

大模型应用开发实战：从环境搭建到 API 服务

工具准备

1. MiniConda

2. PyCharm

环境准备

新建项目

模型下载

模型使用

提供 API 支持

编写客户端

总结与展望

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具