统计xml文件中标记框的特性

全栈程序员-站长 • 2020年11月8日下午9:33 • 未分类 • 阅读 196

统计xml文件中标记框的特性

统计xml文件中标记框的特性

使用labelimg对图片进行标记之后保存为xml文件，运行脚本统计xml文件中的标记框的特征。

代码如下：

 import os
    import sys
    filedir = os.path.dirname(sys.argv[0])      #获取脚本所在目录
    os.chdir(filedir)       #将脚本所在的目录设置为工作目录
    wdir = os.getcwd()
    print('当前工作目录：{}\n'.format(wdir))      #打印当前工作目录
    
    from xml.dom.minidom import parse
    
    def xml_parser( xml_file ):
        '''
        Parse an xml file and return the annotation info in the file
    
        :param xml_file: the xml file name to be parsed
        :return: file_name, width, height, objects.
            file_name, filename of the xml file (without extension)
            width, width of the annotated image
            height, height of the annotated image
            objects, annotated objects in the image
            object, (object_name, xmin, ymin, xmax, ymax)
                object_name, name of the annotated object
                xmin, ymin, xmax, ymax, coordinate of the bounding box of the object
    
        '''
        DOMTree = parse( xml_file )
        collection = DOMTree.documentElement #得到xml文件的根节点
        file_name_xml = collection.getElementsByTagName( 'filename' )[0]
        objects_xml = collection.getElementsByTagName( 'object' )
        size_xml = collection.getElementsByTagName( 'size' )
    
        file_name = file_name_xml.childNodes[0].data
    
        for size in size_xml:
            width = size.getElementsByTagName( 'width' )[0]
            height = size.getElementsByTagName( 'height' )[0]
    
            width = width.childNodes[0].data
            height = height.childNodes[0].data
    
        objects = []
        for object_xml in objects_xml:
            object_name = object_xml.getElementsByTagName( 'name' )[0]
            bdbox = object_xml.getElementsByTagName( 'bndbox' )[0]
            xmin = bdbox.getElementsByTagName( 'xmin' )[0]
            ymin = bdbox.getElementsByTagName( 'ymin' )[0]
            xmax = bdbox.getElementsByTagName( 'xmax' )[0]
            ymax = bdbox.getElementsByTagName( 'ymax' )[0]
    
            object = [ object_name.childNodes[0].data,
                       float(xmin.childNodes[0].data),
                       float(ymin.childNodes[0].data),
                       float(xmax.childNodes[0].data),
                       float(ymax.childNodes[0].data) ]
    
            objects.append( object )
    
        return file_name, int(width), int(height), objects
        
    image_dir = 'images'    
    xml_dir = 'labels'
    xml_files = os.listdir(xml_dir)
    image_files = os.listdir(image_dir)
    image_ext = image_files[0].split('.')[-1] #图片文件的扩展名
    print(image_ext)
    if len(image_files) == len(xml_files):
        print('共有{:d}个xml文件。'.format(len(xml_files)))
    else:
        print('图片数量和xml文件数量不一致。')
    obj_dict = {}
    for xml_file in xml_files:
        annotation = xml_parser(os.path.join(xml_dir, xml_file))
        name_1 = xml_file.split('.')[0] + '.' + image_ext.lower()
        name_2 = xml_file.split('.')[0] + '.' + image_ext.upper()
        if  name_1 not in image_files and name_2 not in image_files:
            print('{:s}没有对应的图片。'.format(xml_file))
        for obj in annotation[-1]:
            key = obj[0]
            x = (obj[1] + obj[3])/2
            y = (obj[2] + obj[4])/2
            width = obj[3] - obj[1]
            height = obj[4] - obj[2]
            box = [x,y,width,height]
            if key in obj_dict:
                obj_dict[key][0] += 1
                n = obj_dict[key][0]
                obj_dict[key][1:5] = [ (i*(n-1)+j)/(n) for i,j in zip(obj_dict[key][1:5] , box)]
                #obj_dict[key][5:9] = [ i if i>=j else j for i,j in zip(obj_dict[key][5:9] , box)]
                #obj_dict[key][9:] = [ i if i>=j else j for i,j in zip(obj_dict[key][9:] , box)]
            else:
                obj_dict[key] = []
                obj_dict[key].append(1)  # 0,个数
                obj_dict[key] += box     # 1-4， 平均坐标
                #obj_dict[key] += box     # 5-8， 最大值
                #obj_dict[key] += box     # 9-12，最小值
    for key,value in obj_dict.items():
        print('一共有 {:4d} 个 {:20s}，其边框平均位置为{:4.0f} *{:4.0f}；平均尺寸为{:3.0f} *{:3.0f}。'.format(value[0],key,*value[1:]))

版权声明：本文内容由互联网用户自发贡献，该文观点仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请联系我们举报，一经查实，本站将立刻删除。

发布者：全栈程序员-站长，转载请注明出处：https://javaforall.net/2173.html原文链接：https://javaforall.net

赞 (0)

0 0

关于作者

全栈程序员-站长

133.5K 文章

3 粉丝

本网站汇聚当前互联网主流语音，持续更新，欢迎关注公众号“全栈程序员社区”

vispy 显示 kitti 点云数据

上一篇 2020年11月8日下午9:33

串口数据读取和动态显示Tkinter+matplotlib+pyqtgraph（详细教程）

下一篇 2020年11月8日下午9:33

LoRa节点开发：5、代码详解LoRaWAN中的几种数据包（发送与接收数据）

LoRa节点开发：5、代码详解LoRaWAN中的几种数据包（发送与接收数据）本文来源微信公众号物联网思考本文主要结合 LoRaNodeSDKv 4 2 和 LoRaWAN 规范 1 0 3 来展开 1 数据包类型 LoRaWAN 规范中有不同的数据包通过 MType 字段区分 MType 是 3 位的总共可以表示 8 种不同类型的数据其中前六种是不同的数据包分别是入网请求入网回复不需要确认上行数据包需要确认上行数据包不需要确认下行数据包需要确

全栈程序员-站长
2026年3月19日
2
数据库导入sql文件_mysql导入sql文件命令

数据库导入sql文件_mysql导入sql文件命令Sql文件导入数据库-保姆级教程，铁打的保姆，希望对大家能有所帮助，不要忘了点赞+收藏哦

全栈程序员-站长
2022年10月2日
4
Android碎片化之屏幕适配

Android碎片化之屏幕适配Android 碎片化之屏幕适配现如今因 Android 系统的开放性市场上出现了不同厂商出厂的各种 android 版本分辨率型号等设备那对我们开发来说碎片化绝对是一个让人头脑炸裂的问题 Android 系统碎片化 Android 机型屏幕尺寸碎片化 Android 屏幕分辨率碎片化市面上安卓手机的主流屏幕尺寸种类繁多就算搞定了屏幕尺寸问题各种分辨率又让人眼花缭乱面对测试同学抛过来的适配问

全栈程序员-站长
2026年1月27日
3
qq邮箱日发5万邮件群发技术(qq邮箱怎样定时发送邮件)

将代码部署服务器，每日早上定时获取到天气数据，并发送到邮箱。也可以说是一个小人工智障。思路可以运用在不同地方，主要介绍的是思路。

全栈程序员-站长
2022年4月18日
703
yuv444 yuv420_硬盘转速和缓存哪个重要

yuv444 yuv420_硬盘转速和缓存哪个重要YUV420与YUV444互转，YUV420与YUV444读取和保存，YUV的显示和播放功能【尊重原创，转载请注明出处】：https://blog.csdn.net/guyuealian/article/details/82454945 OpenCV提供了RGB与YUV420/YUV444互转的接口：cvtColor()，但根尴尬OpenCV就是没有提供YUV444与YUV420互转…

全栈程序员-站长
2025年7月14日
7
基于麦克风阵列的现有声源定位技术有_高斯滤波椒盐噪声

基于麦克风阵列的现有声源定位技术有_高斯滤波椒盐噪声目前基于麦克风阵列的声源定位方法大致可以分为三类：基于最大输出功率的可控波束形成技术、基于高分辨率谱图估计技术和基于声音时间差（time-delayestimation，TDE）的声源定位技术。基

全栈程序员-站长
2022年8月3日
10

发表回复

关注全栈程序员社区公众号