YOLO11 基于 DroneVehicle 数据集的无人机车辆检测实战 | 极客日志

PythonAI算法

YOLO11 基于 DroneVehicle 数据集的无人机车辆检测实战

YOLO11 无人机车辆检测实战流程涵盖 DroneVehicle 数据集预处理、标签格式转换及模型训练。主要步骤包括去除原始图片白边并调整尺寸至 640x512，将 COCO 标签经 VOC 转为 YOLO 格式，重点解决边缘框坐标越界问题。通过合并验证集与测试集构建训练集，使用 YOLO11s 权重训练 100 个 epoch。实测表明垂直视角下检测效果良好，但斜视及红外融合场景仍需进一步验证。

热情发布于 2026/4/9更新于 2026/7/426 浏览

1. 数据集简介

DroneVehicle 是由天津大学收集并标注的大型无人机航拍车辆数据集。该数据集包含 56,878 幅图像，其中一半为 RGB 图像，另一半为红外图像。我们对五个类别进行了带有方向性边界框的丰富标注：

Car (汽车): RGB 389,779 个标注，红外 428,086 个
Truck (卡车): RGB 22,123 个标注，红外 25,960 个
Bus (公交车): RGB 15,333 个标注，红外 16,590 个
Van (面包车): RGB 11,935 个标注，红外 12,708 个
Freight Car (货车): RGB 13,400 个标注，红外 17,173 个

2. 数据预处理

原始图片四周有宽度为 100 像素的白色边框，导致下载的图片尺寸为 840 x 712。为了适配 YOLO 训练要求，我们需要去除白边并将图像尺寸调整为 640 x 512。

文章配图

处理前后的对比效果如上所示。下面是去除白边的 Python 脚本，逻辑很简单，直接裁剪坐标区域即可：

import numpy as np
import cv2
import os
from tqdm import tqdm

def create_file(output_dir_vi, output_dir_ir):
    if not os.path.exists(output_dir_vi):
        os.makedirs(output_dir_vi)
    if not os.path.exists(output_dir_ir):
        os.makedirs(output_dir_ir)
    print(f'Created folder: ({output_dir_vi}); ({output_dir_ir})')

def update(input_img_path, output_img_path):
    image = cv2.imread(input_img_path)
    # 裁剪坐标为 [y0:y1, x0:x1]
    cropped = image[100:612, 100:740]
    cv2.imwrite(output_img_path, cropped)

dataset_dir_vi = r'valimg'
output_dir_vi = 
dataset_dir_ir = 
output_dir_ir = 

create_file(output_dir_vi, output_dir_ir)

image_filenames_vi = [(os.path.join(dataset_dir_vi, x), os.path.join(output_dir_vi, x))  x  os.listdir(dataset_dir_vi)]
image_filenames_ir = [(os.path.join(dataset_dir_ir, x), os.path.join(output_dir_ir, x))  x  os.listdir(dataset_dir_ir)]

()
 path  tqdm(image_filenames_vi):
    update(path[], path[])

()
 path  tqdm(image_filenames_ir):
    update(path[], path[])

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online

import xml.etree.ElementTree as ET
import shutil
import os
import imagesize

object = 'datasets'

if os.path.exists("./%s/labels/" % object):
    shutil.rmtree("./%s/labels/" % object)
os.makedirs("./%s/labels/" % object)

sets = ['train', 'val']
classes = ["car", "truck", "bus", "van", "freight_car"]

def convert(size, box):
    dw = 1. / size[0]
    dh = 1. / size[1]
    x = (box[0] + box[1]) / 2.0
    y = (box[2] + box[3]) / 2.0
    w = box[1] - box[0]
    h = box[3] - box[2]
    x = x * dw
    w = w * dw
    y = y * dh
    h = h * dh
    return (x, y, w, h)

def convert_annotation(image_id):
    in_file = open('./%s/xml/%s.xml' % (object, image_id))
    out_file = open('./%s/labels/%s.txt' % (object, image_id), 'w')
    image_file = open('./%s/images/%s.jpg' % (object, image_id))
    
    tree = ET.parse(in_file)
    root = tree.getroot()
    size = root.find('size')
    
    # 这里的 width 和 height 在 Autolabelimg 下自动标注可能会被修改，需替换成图片的真实宽高
    w, h = imagesize.get(image_file.name)
    
    for obj in root.iter('object'):
        difficult = obj.find('difficult').text
        cls = obj.find('name').text
        if cls not in classes or int(difficult) == 1:
            continue
        cls_id = classes.index(cls)
        xmlbox = obj.find('bndbox')
        xmin = float(xmlbox.find('xmin').text)
        xmax = float(xmlbox.find('xmax').text)
        ymin = float(xmlbox.find('ymin').text)
        ymax = float(xmlbox.find('ymax').text)
        
        # 处理越界坐标
        xmin = xmin if xmin >= 0 else 0.0
        xmax = xmax if xmax <= w else float(w)
        ymin = ymin if ymin >= 0 else 0.0
        ymax = ymax if ymax <= h else float(h)
        
        b = (xmin, xmax, ymin, ymax)
        bb = convert((w, h), b)
        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')

for image_set in sets:
    if not os.path.exists('./%s/labels/' % object):
        os.makedirs('./%s/labels/' % object)
    image_ids = open('./%s/ImageSets/%s.txt' % (object, image_set)).read().strip().split()
    list_file = open('./%s/%s.txt' % (object, image_set), 'w')
    for image_id in image_ids:
        list_file.write('./images/%s.jpg\n' % (image_id))
        convert_annotation(image_id)
    list_file.close()

YOLO11 基于 DroneVehicle 数据集的无人机车辆检测实战

1. 数据集简介

2. 数据预处理

更多推荐文章

相关免费在线工具

3. 标签格式转换

3.1 获取 COCO 格式标签

3.2 转为 VOC 格式

3.3 转为 YOLO 格式

3.4 数据集划分

4. 模型训练

5. 推理预测

6. 总结与注意事项

更多推荐文章

相关免费在线工具

YOLO11 基于 DroneVehicle 数据集的无人机车辆检测实战

1. 数据集简介

2. 数据预处理

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

3. 标签格式转换

3.1 获取 COCO 格式标签

3.2 转为 VOC 格式

3.3 转为 YOLO 格式

3.4 数据集划分

4. 模型训练

5. 推理预测

6. 总结与注意事项

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具