信息通信技术与政策

信息通信技术与政策

信息通信技术与政策 ›› 2026, Vol. 52 ›› Issue (5): 50-57.doi: 10.12267/j.issn.2096-5931.2026.05.007

专题:高质量数据集 上一篇    下一篇

基于标准数据集的深度学习目标检测算法溯源技术研究*

Research on traceability techniques for deep learning algorithms based on standard datasets

胡天洋, 孙小强, 陈龙泉, 张大元   

  1. 中国信息通信研究院泰尔系统实验室, 北京 100191
  • 收稿日期:2026-03-31 出版日期:2026-05-25 发布日期:2026-05-28
  • 通讯作者: 孙小强
  • 作者简介:
    胡天洋,中国信息通信研究院泰尔系统实验室助理工程师,主要从事人工智能计量、具身智能、脑机接口等方面的研究工作
    陈龙泉,中国信息通信研究院泰尔系统实验室高级工程师,主要从事人工智能与通信计量测试等方面的研究工作
    张大元,中国信息通信研究院泰尔系统实验室正高级工程师,主要从事通信计量测试和测量仪器开发等方面的研究工作
  • 基金资助:
    * 国家重点研发计划项目(2022YFFO605903)

HU Tianyang, SUN Xiaoqiang, CHEN Longquan, ZHANG Dayuan   

  1. CTTL System Laboratory, China Academy of Information and Communications Technology, Beijing 100191, China
  • Received:2026-03-31 Online:2026-05-25 Published:2026-05-28
  • Contact: SUN Xiaoqiang

摘要:

针对深度学习目标检测算法可解释性不足导致的输出量值缺乏有效计量评价方法的难题,从计量学视角构建算法溯源技术体系。以合成孔径雷达(Synthetic Aperture Radar,SAR)图像舰船检测为典型应用场景,明确算法溯源的定义与不确定度来源,提出基于标准数据集的溯源技术路径及连续比较链;面向算法溯源的计量要求,建立标准数据集的质量评估指标体系与标准化测试方法。研究实现了SAR图像舰船检测算法输出的定量评价与量值溯源,可为人工智能算法可信评测提供基准支撑,对推动标准化评测体系建设具有重要意义。

关键词: 算法溯源, 标准数据集, 深度学习, 质量评估

Abstract:

To address the challenge of lacking effective metrological evaluation methods for the output values of deep learning object detection algorithms due to their lack of interpretability, this study constructs a technical framework for algorithm traceability from a metrological perspective. Using ship detection in Synthetic Aperture Radar (SAR) images as a typical application scenario, the definition and uncertainty sources of algorithm traceability are clarified, and a traceability technical path based on standard datasets together with a continuous comparison chain is proposed. To meet the metrological requirements of algorithm traceability, a quality evaluation indicator system and standardized testing methods for standard datasets are established. This study has achieved quantitative evaluation and traceability of the performance metrics generated by SAR image-based ship detection algorithms. It provides a benchmark for the reliable evaluation of artificial intelligence algorithms and is of great significance for advancing the development of a standardized evaluation system.

Key words: algorithm traceability, standard datasets, deep learning, quality assessment

中图分类号: