基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

本文将详细介绍如何在 Windows 环境下（单卡 8G 显存），利用 LLaMA-Factory 框架对 Qwen2.5-1.5B 模型进行 LoRA 微调，并通过 Ollama 实现本地部署。

⚠️ 实验环境警告 本教程涉及 PyTorch、ModelScope 及 LLaMA-Factory 等多个深度学习框架，依赖关系较为复杂。为了避免污染您的系统 Python 环境或引发版本冲突，强烈建议在 Anaconda / Miniconda 虚拟环境中进行本实验。

1. 准备数据集 (Data Preparation)

微调的第一步是获取原始指令数据。本项目通过两种方式获取数据，并利用 Python 脚本进行人设注入，将通用数据转化为 Elaine 的专属训练语料。

1.1 下载原始数据集（两种方式）

方式 A：手动下载 (Manual Download)

在文件列表中找到 alpaca_zh.json，手动点击下载按钮。
将下载的文件保存至 D:\Code\LoRA\yuki_identity_sft\ 目录下。

方式 B：自动化下载（推荐）

使用 modelscope 库自动获取数据集，适合自动化工作流：

def download_dataset():
    # 获取当前工作目录
    current_dir = os.getcwd()
    # 建议下载到一个专门的子目录中，例如 'dataset'
    target_dir = os.path.join(current_dir, 'yuki_identity_sft')
    if not os.path.exists(target_dir):
        os.makedirs(target_dir)
    print(f"正在下载数据集到：{target_dir}")
    # 使用 subprocess 调用 modelscope 命令，并指定 --local_dir 为目标子目录
    result = subprocess.run(['modelscope', 'download', '--dataset', 'DanKe123abc/yuki_identity_sft', '--local_dir', target_dir], capture_output=True, text=True)

1.2 预处理与人物替换 (Preprocessing & Identity Swap)

下载完成后，必须运行预处理脚本。该脚本会遍历所有对话条目，将原有的助手名称（如'通义千问'、'机器人'）及开发商（如'阿里巴巴'）替换为 Elaine 和 DanKe。

核心预处理脚本 (preprocess.py):

 ():
    
    old_jsonl = os.path.join(target_dir, )
    new_jsonl = os.path.join(target_dir, )
    info_file = os.path.join(target_dir, )
    
    
     os.path.exists(old_jsonl):
        ()
         (old_jsonl, , encoding=)  f_in, \
             (new_jsonl, , encoding=)  f_out:
             line  f_in:
                
                updated_line = line.replace(old_name.capitalize(), new_name.capitalize())
                updated_line = updated_line.replace(old_name.lower(), new_name.lower())
                f_out.write(updated_line)
        os.remove(old_jsonl) 
        ()

基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

本文将详细介绍如何在 Windows 环境下（单卡 8G 显存），利用 LLaMA-Factory 框架对 Qwen2.5-1.5B 模型进行 LoRA 微调，并通过 Ollama 实现本地部署。

⚠️ 实验环境警告 本教程涉及 PyTorch、ModelScope 及 LLaMA-Factory 等多个深度学习框架，依赖关系较为复杂。为了避免污染您的系统 Python 环境或引发版本冲突，强烈建议在 Anaconda / Miniconda 虚拟环境中进行本实验。

1. 准备数据集 (Data Preparation)

微调的第一步是获取原始指令数据。本项目通过两种方式获取数据，并利用 Python 脚本进行人设注入，将通用数据转化为 Elaine 的专属训练语料。

1.1 下载原始数据集（两种方式）

方式 A：手动下载 (Manual Download)

在文件列表中找到 alpaca_zh.json，手动点击下载按钮。
将下载的文件保存至 D:\Code\LoRA\yuki_identity_sft\ 目录下。

方式 B：自动化下载（推荐）

使用 modelscope 库自动获取数据集，适合自动化工作流：

def download_dataset():
    # 获取当前工作目录
    current_dir = os.getcwd()
    # 建议下载到一个专门的子目录中，例如 'dataset'
    target_dir = os.path.join(current_dir, 'yuki_identity_sft')
    if not os.path.exists(target_dir):
        os.makedirs(target_dir)
    print(f"正在下载数据集到：{target_dir}")
    # 使用 subprocess 调用 modelscope 命令，并指定 --local_dir 为目标子目录
    result = subprocess.run(['modelscope', 'download', '--dataset', 'DanKe123abc/yuki_identity_sft', '--local_dir', target_dir], capture_output=True, text=True)

1.2 预处理与人物替换 (Preprocessing & Identity Swap)

核心预处理脚本 (preprocess.py):

 ():
    
    old_jsonl = os.path.join(target_dir, )
    new_jsonl = os.path.join(target_dir, )
    info_file = os.path.join(target_dir, )
    
    
     os.path.exists(old_jsonl):
        ()
         (old_jsonl, , encoding=)  f_in, \
             (new_jsonl, , encoding=)  f_out:
             line  f_in:
                
                updated_line = line.replace(old_name.capitalize(), new_name.capitalize())
                updated_line = updated_line.replace(old_name.lower(), new_name.lower())
                f_out.write(updated_line)
        os.remove(old_jsonl) 
        ()

基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

1. 准备数据集 (Data Preparation)

1.1 下载原始数据集（两种方式）

方式 A：手动下载 (Manual Download)

方式 B：自动化下载（推荐）

1.2 预处理与人物替换 (Preprocessing & Identity Swap)

基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

基于 Qwen2.5 与 LLaMA-Factory 的 LoRA 微调实战

1. 准备数据集 (Data Preparation)

1.1 下载原始数据集（两种方式）

方式 A：手动下载 (Manual Download)

方式 B：自动化下载（推荐）

1.2 预处理与人物替换 (Preprocessing & Identity Swap)

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

1.3 数据集注册 (Registration)

2. 下载基座模型 (Base Model Download)

方式 A：代码自动下载（推荐方式）

方式 B：手动下载（备选方式）

3. 下载工具 LLaMA-Factory (Tools Setup)

3.1 工具简介

3.2 下载与安装步骤

步骤 1：克隆源代码

步骤 2：安装核心依赖

步骤 3：验证安装

4. 修改配置文件 (Configuration)

4.1 添加数据集定义文件 (Add Dataset Info)

4.2 修改训练参数配置文件 (Modify Training Config)

4.3 关键点解释

5. 开始微调训练 (Start Training)

5.1 执行训练命令

5.2 训练过程关键指标

5.3 产出物检查

5.4 验证与对话测试 (Validation)

方式 A：官方 WebUI 验证（标准路径）

方式 B：Python 脚本流式调用（稳定路径 / 本项目采用）

验证标准 (Checklist)

6. 打包与 Ollama 部署测试 (Export & Deployment)

6.1 模型权重合并 (Export & Merge)

6.2 注册至 Ollama

6.3 最终成果验证

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具