LLaMA-Factory DeepSeek-R1 模型微调基础教程

LLaMA-Factory 模型微调基础教程

LLaMA-Factory 概述

使用 LLaMA-Factory 进行模型微调具有多方面的好处。首先，它简化了大模型微调的过程，使得即使是没有深厚技术功底的用户也能轻松进行模型的优化和改进。此外，LLaMA-Factory 支持多种训练方法，如全量调参、LoRA 等，以及不同的对齐方案，如 DPO、PPO 等。这为用户提供了灵活性，可以根据具体需求选择合适的微调策略。

LLaMA-Factory 还提供了一站式服务，从模型微调到量化处理，再到运行，整个过程一气呵成，无需在不同的工具和流程之间来回切换。此外，它支持多种流行的语言模型，如 LLaMA、BLOOM、Mistral、Baichuan 等，涵盖了广泛的应用场景。

在模型量化方面，LLaMA-Factory 能够有效地压缩模型规模，减少模型运行所需的计算量和存储空间，使得模型能够在性能稍弱的设备上也能流畅运行。这不仅提高了模型的可访问性，也降低了运行成本。

此外，LLaMA-Factory 的训练过程中记录的内容比较全面，除了同步输出 loss 曲线图以外，还自带 BLEU 等评测指标，这有助于用户更好地监控和评估模型的性能。

LLaMA-Factory 下载

GitHub: LLaMA-Factory

进到 LLaMA-Factory 后点击 code 下载就行，建议下载 zip 包。
解压完成之后记录一下解压路径。

Anaconda 环境创建

软硬件依赖详情

创建虚拟环境：官方给出的是 python 至少 3.9，推荐 3.10。
打开终端。
导航到刚才解压的地址。

LLaMA-Factory 依赖安装

依赖下载：pip install -r requirements.txt
最好都执行一遍：pip install -e ".[torch,metrics]"

CUDA 安装

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c nvidia 记得输入 y 继续安装。

量化 BitsAndBytes 安装

如果要在 Windows 平台上开启量化 LoRA（QLoRA），需要安装预编译的 bitsandbytes 库。支持 CUDA11.1 到 12.2，请根据您的 CUDA 版本情况选择适合的发布版本。 pip install https://github.com/jllllll/bitsandbytes-windows-webui/releases/download/wheels/bitsandbytes-0.41.2.post2-py3-none-win_amd64.whl

可视化微调启动

启动命令：llamafactory-cli webui
如果出现无法访问 localhost 的错误，说明 Gradio share 为 false。我们需要更改 interface.py 代码。
找到 interface.py 存在路径，通常在 LLaMA-Factory-main\src\llamafactory\webui。
找到 run_web_ui() 和 run_web_demo() 方法，把 share=gradio_share 修改成 share=True。
然后再次运行即可成功。

数据集准备

所需工具下载

使用数据提取工具导出聊天记录。例如可以使用微信风格化工具或其他类似工具（如 finetune_dataset_maker）。

参考论文：

import json import re # 读取 merged_data.json 文件 with open('merged_data.json', 'r', encoding='utf-8') as file: data = json.load(file) # 转换后的数据格式 converted_data = [] # 数据清洗：去除空消息，清除特殊字符，统一格式 def clean_data(dataset): cleaned_data = [] for example in dataset: messages = example['messages'] cleaned_messages = [] for message in messages: # 去除内容为空的消息 if not message['content'].strip(): continue # 清除多余的空格、换行符等 message['content'] = message['content'].replace("\n", " ").strip() cleaned_messages.append(message) if cleaned_messages: cleaned_data.append({'messages': cleaned_messages}) return cleaned_data # 脱敏处理：替换敏感信息 def replace_sensitive_info(text): # 匹配手机号、邮箱等敏感信息 text = re.sub(r'\d{3}[-]?\d{4}[-]?\d{4}', '[PHONE_NUMBER]', text) # 替换手机号 text = re.sub(r'\S+@\S+', '[EMAIL]', text) # 替换邮箱 text = re.sub(r'\d{4}-\d{2}-\d{2}', '[DATE]', text) # 替换日期 return text # 匿名化数据：替换用户角色 def anonymize_data(dataset): anonymized_data = [] for example in dataset: messages = example['messages'] anonymized_messages = [] for message in messages: # 匿名化用户角色 if message['role'] == 'user': message['content'] = message['content'].replace("用户", "用户 X") # 替换敏感信息 message['content'] = replace_sensitive_info(message['content']) anonymized_messages.append(message) anonymized_data.append({'messages': anonymized_messages}) return anonymized_data # 处理每一条对话 for item_list in data: for item in item_list: # 确保每个条目中包含 'messages' 字段 if 'messages' not in item: print("跳过：没有找到 'messages' 字段") continue # 如果没有 'messages' 字段，跳过当前数据项 print(f"正在处理数据项：{item}") # 打印当前处理的项 conversation = {"conversations": []} # 处理消息数据 for message in item['messages']: role = message['role'] content = message['content'] print(f"处理消息：role={role}, content={content}") # 打印消息内容 # 清洗和脱敏处理 content = replace_sensitive_info(content) # 映射 role 到 from 字段 if role == "system": continue # 忽略 system 消息 elif role == "user": from_role = "human" elif role == "assistant": from_role = "gpt" # 添加转换后的消息 conversation['conversations'].append({"from": from_role, "value": content}) # 将转换后的会话添加到最终结果中 converted_data.append(conversation) # 保存转换后的数据为新的文件 with open('converted_data.json', 'w', encoding='utf-8') as file: json.dump(converted_data, file, ensure_ascii=False, indent=2) print("数据转换完成，结果已保存为 converted_data.json")

LLaMA-Factory DeepSeek-R1 模型微调基础教程

LLaMA-Factory 模型微调基础教程

LLaMA-Factory 概述

LLaMA-Factory 下载

Anaconda 环境创建

软硬件依赖详情

LLaMA-Factory 依赖安装

CUDA 安装

量化 BitsAndBytes 安装

可视化微调启动

数据集准备

所需工具下载

更多推荐文章

相关免费在线工具

所需数据合并

数据集预处理

DeepSeek-R1 可视化微调

数据集处理

数据详解

LLaMA-Factory 基础设置

模型评估与预测

训练模型对话

训练模型导出

更多推荐文章

相关免费在线工具

LLaMA-Factory DeepSeek-R1 模型微调基础教程

LLaMA-Factory 模型微调基础教程

LLaMA-Factory 概述

LLaMA-Factory 下载

Anaconda 环境创建

软硬件依赖详情

LLaMA-Factory 依赖安装

CUDA 安装

量化 BitsAndBytes 安装

可视化微调启动

数据集准备

所需工具下载

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

所需数据合并

数据集预处理

DeepSeek-R1 可视化微调

数据集处理

数据详解

LLaMA-Factory 基础设置

模型评估与预测

训练模型对话

训练模型导出

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具