基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用 | 极客日志

PythonAI算法

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

综述由AI生成基于 AgentFabric 框架微调 Qwen-7B-Chat 模型以实现交互式智能体应用的技术方案。针对小模型在工具调用方面能力较弱的问题，文章详细阐述了环境搭建、数据集格式转换（从 MS-Agent 到 AgentFabric）、LoRA 微调流程及模型部署步骤。通过自定义 Prompt 格式转换脚本和混合真实调用数据，显著提升了模型在 AgentFabric 环境下的工具调用准确率、总结能力及停止机制。最终实现了在消费级硬件上低成本部署具备可靠工具调用能力的本地大模型智能体。

嘘发布于 2025/2/7更新于 2026/6/221 浏览

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

1. 前言

在构建大模型智能体（Agent）应用时，选择合适的基座模型至关重要。目前，使用 DashScope 提供的 Qwen-Max 或开源的 Qwen-72B 等大规模参数模型，通常能获得较好的工具调用和角色扮演效果。然而，这些大规模模型难以在消费级机器上进行本地部署，推理成本高昂。

相比之下，小模型如 Qwen-7B-Chat 虽然易于部署，但在原生状态下对复杂工具调用的能力较弱。为了在资源受限的环境下实现高效的 Agent 应用，我们需要针对特定场景（如 AgentFabric）对稍小的模型进行微调，使其具备可靠的工具调用能力。

本文旨在介绍如何利用 AgentFabric 的工具调用场景，通过数据集转换和 LoRA 微调方法，使 Qwen-7B-Chat 模型在本地部署后也能完成高质量的工具调用任务。

2. 环境安装与准备

2.1 基础环境配置

首先，需要配置 Python 环境并安装必要的依赖库。建议使用虚拟环境以避免依赖冲突。

# 设置 pip 全局镜像 (加速下载)
pip config set global.index-url https://mirrors.aliyun.com/pypi/simple/

# 创建并激活虚拟环境
python -m venv swift_env
source swift_env/bin/activate  # Linux/Mac
# swift_env\Scripts\activate   # Windows

2.2 安装 Swift 框架

魔搭社区提供了 Swift 框架，用于简化大模型的训练和部署流程。

# 克隆仓库
git clone https://github.com/modelscope/swift.git
cd swift

# 安装依赖
pip install -e .[llm]

# 环境对齐 (如果运行报错，可尝试更新以下依赖)
pip install -r requirements/framework.txt -U
pip install -r requirements/llm.txt -U

确保系统已安装 CUDA 驱动，且 PyTorch 版本与 CUDA 版本匹配。推荐使用 CUDA 11.8 或更高版本。

3. 数据准备与处理

为训练 Agent 能力，魔搭官方提供了 ms_agent 开源数据集。但直接使用通用数据集微调后的模型在 AgentFabric 上的表现往往不佳，主要问题包括：不调用工具、调用时参数错误、对工具返回结果总结错误等。

3.1 问题分析

MS-Agent 数据集的 Prompt 格式侧重于标准的 ReAct 模式，而 AgentFabric 更侧重角色扮演和应用场景的 Prompt 组织。两者存在显著差异：

MS-Agent 格式示例：

Answer the following questions as best you can. You have access to the following APIs:
1. fire_recognition: Call this tool...
Use the following format:
Thought: ...

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online

# 工具
## 你拥有如下工具：
amap_weather: amap_weather API。获取对应城市的天气数据...
## 当你需要调用工具时，请在你的回复中穿插如下的工具调用命令...
Action: 工具的名称...
Action Input: 工具的输入...
# 指令
你扮演一个天气预报助手...

import json
import re

sys_prefix = "\n# 工具\n\n## 你拥有如下工具：\n\n"

def _process_system(text):
    apis_info = []
    api_pattern = r"(?<=\n\d\.)(.*?})(?=])"
    apis = re.findall(api_pattern, text, re.DOTALL)
    sys_prompt = sys_prefix
    func_names = []
    
    for api in apis:
        func_name = re.search(r'(.*?):', api).group(1).strip()
        func_names.append(func_name)
        api_name = re.search(r'(\S+)\sAPI', api).group(1)
        api_desc = re.search(r'useful for?\s(.*?)\.', api).group(1)
        
        sys_prompt += f"{func_name}: {api_name} API。{api_desc}" + "输入参数：{\"type\": \"object\", \"properties\": {"
        paras = re.findall(r"Parameters: \[({.*})", api, re.DOTALL)
        required_paras = []
        
        for para in paras:
            para_name = re.search(r'"name": \"(.*?)"', para).group(1)
            desc = re.search(r'"description": \"(.*?)"', para).group(1)
            if re.search(r'"required": \"(.*)"', para).group(1).strip().lower() == "true":
                required_paras.append(para_name)
            sys_prompt += f'"{para_name}": {{"type": "string", "description": "{desc}"}}'
        
        sys_prompt += "},\"required\": " + json.dumps(required_paras) + "} Format the arguments as a JSON object." + "\n\n"
    
    func_names = json.dumps(func_names)
    sys_prompt += f"## 当你需要调用工具时，请在你的回复中穿插如下的工具调用命令，可以根据需求调用零次或多次：\n\n工具调用\nAction: 工具的名称，必须是{func_names}之一\nAction Input: 工具的输入\nObservation: <result>工具返回的结果</result>\nAnswer: 根据 Observation 总结本次工具调用返回的结果，如果结果中出现 url，请使用如下格式展示出来：![图片](url)\n\n\n# 指令\n\n你扮演 AI-Agent，\n你具有下列具体功能：\n下面你将开始扮演\n\n请注意：你具有图像和视频的展示能力，也具有运行代码的能力，不要在回复中说你做不到。\n"
    return sys_prompt

# 读取原始数据
jsonl_file_path = 'ms_agent/train_agent_react.jsonl'
target_file_path = 'new_ms_agent.jsonl'

modified_data = []
with open(jsonl_file_path, 'r', encoding='utf-8') as file:
    for line in file:
        json_obj = json.loads(line)
        system_prompt = json_obj["conversations"][0]["value"]
        json_obj["conversations"][0]["value"] = _process_system(system_prompt)
        modified_data.append(json_obj)

# 写入新文件
with open(target_file_path, 'w', encoding='utf-8') as file:
    for json_obj in modified_data:
        file.write(json.dumps(json_obj, ensure_ascii=False) + '\n')

# 进入示例目录
cd examples/pytorch/llm

# 设置环境变量
export PYTHONPATH=../../..
nproc_per_node=8

# 启动训练 (使用 nohup 后台运行)
nohup torchrun \
    --nproc_per_node=$nproc_per_node \
    --master_port 29500 \
    llm_sft.py \
    --model_id_or_path qwen/Qwen-7B-Chat \
    --model_revision master \
    --sft_type lora \
    --tuner_backend swift \
    --dtype AUTO \
    --output_dir output \
    --custom_train_dataset_path ms_agent_for_agentfabric/new_ms_agent.jsonl ms_agent_for_agentfabric/addition.jsonl \
    --train_dataset_mix_ratio 2.0 \
    --train_dataset_sample -1 \
    --num_train_epochs 2 \
    --max_length 2048 \
    --check_dataset_strategy warning \
    --lora_rank 8 \
    --lora_alpha 32 \
    --lora_dropout_p 0.05 \
    --lora_target_modules ALL \
    --self_cognition_sample 3000 \
    --model_name 卡卡罗特 \
    --model_author 陶白白 \
    --gradient_checkpointing true \
    --batch_size 2 \
    --weight_decay 0.01 \
    --learning_rate 5e-5 \
    --gradient_accumulation_steps $(expr 1 / $nproc_per_node) \
    --max_grad_norm 0.5 \
    --warmup_ratio 0.03 \
    --eval_steps 100 \
    --save_steps 100 \
    --save_total_limit 2 \
    --logging_steps 10 &

[INFO:swift] best_model_checkpoint: /home/workspace/swift/examples/pytorch/llm/output/qwen-7b-chat/v0-20240314-211944/checkpoint-2828

python tools/merge_lora_weights_to_model.py \
    --model_id_or_path /dir/to/your/base/model \
    --model_revision master \
    --ckpt_dir /dir/to/your/lora/model

nohup python -m vllm.entrypoints.openai.api_server \
    --model /dir/to/your/model-merged \
    --trust-remote-code &

curl http://localhost:8000/v1/completions \
    -H "Content-Type: application/json" \
    -d '{"model": "/dir/to/your/model-merged", "prompt": "San Francisco is a", "max_tokens": 7, "temperature": 0}'

from modelscope_agent.agents.role_play import RolePlay

def test_weather_role():
    role_template = '你扮演一个天气预报助手，你需要查询相应地区的天气，并调用给你的画图工具绘制一张城市的图。'
    
    llm_config = {
        "model_server": "openai",
        "model": "/dir/to/your/model-merged",
        "api_base": "http://localhost:8000/v1",
        "is_chat": True,
        "is_function_call": False,
        "support_stream": False
    }
    
    function_list = ['amap_weather']
    bot = RolePlay(function_list=function_list, llm=llm_config, instruction=role_template)
    response = bot.run('朝阳区天气怎样？')
    
    text = ''
    for chunk in response:
        text += chunk
    print(text)
    assert isinstance(text, str)

test_weather_role()

进入 AgentFabric 目录：cd modelscope-agent/apps/agentfabric

修改 config/model_config.json，新增本地模型配置：

"my-qwen-7b-chat": {
    "type": "openai",
    "model": "/dir/to/your/model-merged",
    "api_base": "http://localhost:8000/v1",
    "is_chat": true,
    "is_function_call": false,
    "support_stream": false
}

启动 Gradio 界面：

GRADIO_SERVER_NAME=0.0.0.0 PYTHONPATH=../../ python app.py

在浏览器访问 服务器 IP:7860 即可使用。

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

1. 前言

2. 环境安装与准备

2.1 基础环境配置

2.2 安装 Swift 框架

3. 数据准备与处理

3.1 问题分析

更多推荐文章

相关免费在线工具

3.2 数据集转换脚本

3.3 补充 AgentFabric 真实数据

4. 微调流程

4.1 训练配置

5. 模型部署

5.1 合并 LoRA 权重

5.2 拉起服务

5.3 接口测试

6. 集成与应用

6.1 简单测试

6.2 AgentFabric 中使用

7. 常见问题与优化建议

7.1 显存不足

7.2 工具调用失败

7.3 响应超时

8. 总结

更多推荐文章

相关免费在线工具

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

基于 AgentFabric 微调 Qwen-7B 实现交互式智能体应用

1. 前言

2. 环境安装与准备

2.1 基础环境配置

2.2 安装 Swift 框架

3. 数据准备与处理

3.1 问题分析

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

3.2 数据集转换脚本

3.3 补充 AgentFabric 真实数据

4. 微调流程

4.1 训练配置

5. 模型部署

5.1 合并 LoRA 权重

5.2 拉起服务

5.3 接口测试

6. 集成与应用

6.1 简单测试

6.2 AgentFabric 中使用

7. 常见问题与优化建议

7.1 显存不足

7.2 工具调用失败

7.3 响应超时

8. 总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具