大模型训练核心算法：损失函数详解 | 极客日志

大模型训练核心算法：损失函数详解 | 极客日志

import torch
import torch.nn as nn

# 定义交叉熵损失函数
criterion = nn.CrossEntropyLoss()

# 模拟模型输出和真实标签
outputs = torch.randn(5, 3)  # 5 个样本，3 个类别
labels = torch.tensor([0, 1, 2, 0, 1])  # 真实标签

# 计算损失
loss = criterion(outputs, labels)
print(f"Loss: {loss.item()}")