基于 DroneVehicle 数据集的 YOLOv11 无人机车辆检测实战 | 极客日志

PythonAI算法

基于 DroneVehicle 数据集的 YOLOv11 无人机车辆检测实战

基于 DroneVehicle 数据集，详细演示了使用 YOLOv11 进行无人机视角下车辆目标检测的全流程。内容涵盖数据集介绍、图像白边预处理、COCO/VOC/YOLO 标签格式转换（含边界框越界修正）、训练集与验证集划分策略、模型训练配置及推理测试。重点解决了航拍图像预处理及标签坐标归一化的常见问题，提供了可复用的 Python 脚本与实操建议。

moshang发布于 2026/4/9更新于 2026/7/2839 浏览

关于 DroneVehicle 数据集

DroneVehicle 是由天津大学收集并标注的大型无人机航拍车辆数据集。该数据集包含 56,878 幅图像，其中一半为 RGB 可见光图像，另一半为红外图像。数据集中对五个类别进行了带有方向性边界框的丰富标注：

汽车 (car): RGB 389,779 个标注，红外 428,086 个标注
卡车 (truck): RGB 22,123 个标注，红外 25,960 个标注
公交车 (bus): RGB 15,333 个标注，红外 16,590 个标注
面包车 (van): RGB 11,935 个标注，红外 12,708 个标注
货车 (freight car): RGB 13,400 个标注，红外 17,173 个标注

数据集下载与预处理

官方仓库地址：VisDrone/DroneVehicle

在 DroneVehicle 中，为了标注图片边界上的物体，作者在每张图片的上下左右四边设置了宽度为 100 像素的白色边框，导致原始图片尺寸为 840 x 712。在训练检测网络前，建议进行预处理，去除周围的白色边框，并将图像尺寸调整为 640 x 512。

文章配图

处理前后对比效果如上所示。

去除白边脚本

下面这段 Python 代码用于批量裁剪可见光和红外图像，去除边缘白边：

import numpy as np
import cv2
import os
from tqdm import tqdm

def create_file(output_dir_vi, output_dir_ir):
    if not os.path.exists(output_dir_vi):
        os.makedirs(output_dir_vi)
    if not os.path.exists(output_dir_ir):
        os.makedirs(output_dir_ir)
    print(f'Created folder: ({output_dir_vi}); ({output_dir_ir})')

def update(input_img_path, output_img_path):
    image = cv2.imread(input_img_path)
    # 裁剪坐标为 [y0:y1, x0:x1]，去除四周 100px 白边
    cropped = image[:, :]
    cv2.imwrite(output_img_path, cropped)

dataset_dir_vi = 
output_dir_vi = 
dataset_dir_ir = 
output_dir_ir = 

create_file(output_dir_vi, output_dir_ir)

image_filenames_vi = [(os.path.join(dataset_dir_vi, x), os.path.join(output_dir_vi, x))  x  os.listdir(dataset_dir_vi)]
image_filenames_ir = [(os.path.join(dataset_dir_ir, x), os.path.join(output_dir_ir, x))  x  os.listdir(dataset_dir_ir)]

()
 path  tqdm(image_filenames_vi):
    update(path[], path[])

()
 path  tqdm(image_filenames_ir):
    update(path[], path[])

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online

import xml.etree.ElementTree as ET
import shutil
import os
import imagesize

object = 'datasets'
if os.path.exists("./%s/labels/" % object):
    shutil.rmtree("./%s/labels/" % object)
os.makedirs("./%s/labels/" % object)

sets = ['train', 'val']
classes = ["car", "truck", "bus", "van", "freight_car"]

def convert(size, box):
    dw = 1. / size[0]
    dh = 1. / size[1]
    x = (box[0] + box[1]) / 2.0
    y = (box[2] + box[3]) / 2.0
    w = box[1] - box[0]
    h = box[3] - box[2]
    x = x * dw
    w = w * dw
    y = y * dh
    h = h * dh
    return (x, y, w, h)

def convert_annotation(image_id):
    in_file = open('./%s/xml/%s.xml' % (object, image_id))
    out_file = open('./%s/labels/%s.txt' % (object, image_id), 'w')
    image_file = open('./%s/images/%s.jpg' % (object, image_id))
    
    tree = ET.parse(in_file)
    root = tree.getroot()
    size = root.find('size')
    
    # 使用真实图片宽高替换 XML 中的可能错误信息
    w, h = imagesize.get(image_file.name)
    
    for obj in root.iter('object'):
        difficult = obj.find('difficult').text
        cls = obj.find('name').text
        if cls not in classes or int(difficult) == 1:
            continue
        cls_id = classes.index(cls)
        xmlbox = obj.find('bndbox')
        xmin = float(xmlbox.find('xmin').text)
        xmax = float(xmlbox.find('xmax').text)
        ymin = float(xmlbox.find('ymin').text)
        ymax = float(xmlbox.find('ymax').text)
        
        # 边界框越界修正
        xmin = xmin if xmin >= 0 else 0.0
        xmax = xmax if xmax <= w else float(w)
        ymin = ymin if ymin >= 0 else 0.0
        ymax = ymax if ymax <= h else float(h)
        
        b = (xmin, xmax, ymin, ymax)
        bb = convert((w, h), b)
        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')

for image_set in sets:
    if not os.path.exists('./%s/labels/' % object):
        os.makedirs('./%s/labels/' % object)
    image_ids = open('./%s/ImageSets/%s.txt' % (object, image_set)).read().strip().split()
    list_file = open('./%s/%s.txt' % (object, image_set), 'w')
    for image_id in image_ids:
        list_file.write('./images/%s.jpg\n' % (image_id))
        convert_annotation(image_id)
    list_file.close()

基于 DroneVehicle 数据集的 YOLOv11 无人机车辆检测实战

关于 DroneVehicle 数据集

数据集下载与预处理

去除白边脚本

更多推荐文章

相关免费在线工具

制作 YOLO 目标检测所需的数据集文件

标签格式转换

数据集划分策略

模型训练

模型预测

结语与注意事项

更多推荐文章

相关免费在线工具

基于 DroneVehicle 数据集的 YOLOv11 无人机车辆检测实战

关于 DroneVehicle 数据集

数据集下载与预处理

去除白边脚本

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

制作 YOLO 目标检测所需的数据集文件

标签格式转换

数据集划分策略

模型训练

模型预测

结语与注意事项

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具