使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南

使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南 | 极客日志

git clone https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory

# 安装 unsloth
pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
# 安装 xformers
pip install --no-deps xformers==0.0.25
# 安装 bitsandbytes
pip install .[bitsandbytes]
# 修复 urllib3 版本冲突
pip install 'urllib3<2'

!nvidia-smi

import torch
try:
    assert torch.cuda.is_available() is True
except AssertionError:
    print("Your GPU is not setup!")

import json
%cd /notebooks/LLaMA-Factory
MODEL_NAME = "Llama-3"

with open("/notebooks/LLaMA-Factory/data/identity.json", "r", encoding="utf-8") as f:
    dataset = json.load(f)

for sample in dataset:
    sample["output"] = sample["output"].replace("MODEL_NAME", MODEL_NAME).replace("AUTHOR", "LLaMA Factory")

with open("/notebooks/LLaMA-Factory/data/identity.json", "w", encoding="utf-8") as f:
    json.dump(dataset, f, indent=2, ensure_ascii=False)

GRADIO_SHARE=1 llamafactory-cli webui

{
  "stage": "sft",
  "do_train": true,
  "model_name_or_path": "unsloth/llama-3-8b-Instruct-bnb-4bit",
  "dataset": "identity,alpaca_gpt4_en",
  "template": "llama3",
  "finetuning_type": "lora",
  "lora_target": "all",
  "output_dir": "llama3_lora",
  "per_device_train_batch_size": 2,
  "gradient_accumulation_steps": 4,
  "lr_scheduler_type": "cosine",
  "logging_steps": 10,
  "warmup_ratio": 0.1,
  "save_steps": 1000,
  "learning_rate": 5e-5,
  "num_train_epochs": 3.0,
  "max_samples": 500,
  "max_grad_norm": 1.0,
  "quantization_bit": 4,
  "loraplus_lr_ratio": 16.0,
  "use_unsloth": true,
  "fp16": true
}

llamafactory-cli train train_llama3.json

{
  "model_name_or_path": "unsloth/llama-3-8b-Instruct-bnb-4bit",
  "adapter_name_or_path": "llama3_lora",
  "finetuning_type": "lora",
  "template": "llama3",
  "quantization_bit": 4,
  "use_unsloth": true
}

llamafactory-cli chat infer_llama3.json

使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南

使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南

什么是模型的微调？

为什么要用 LLaMA-Factory？

LLaMA Board：统一用户界面

环境准备与安装

1. 克隆仓库

2. 安装依赖包

3. 检查 GPU 规格

4. 验证 CUDA 环境

数据集准备

启动 WebUI 进行微调

配置说明

命令行训练配置

关键参数解析

模型推理测试

最佳实践与注意事项

小结

更多推荐文章

相关免费在线工具

使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南

使用 LLaMA-Factory 训练和微调 LLaMA3 模型指南

什么是模型的微调？

为什么要用 LLaMA-Factory？

LLaMA Board：统一用户界面

环境准备与安装

1. 克隆仓库

2. 安装依赖包

3. 检查 GPU 规格

4. 验证 CUDA 环境

数据集准备

启动 WebUI 进行微调

配置说明

命令行训练配置

关键参数解析

模型推理测试

最佳实践与注意事项

小结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具