您当前的位置:
首页 >
文章列表页 >
Audio object detection network with multimodal cross level feature knowledge transfer
Information Sciences | 更新时间:2024-02-01
    • Audio object detection network with multimodal cross level feature knowledge transfer

    • The application of sound in the field of object detection has made new progress. To address the issue of low robustness of current target localization methods that rely solely on monitoring environmental sounds, experts have proposed a multimodal self supervised object detection network under cross level feature knowledge transfer. This network not only designs a multi teacher cross level feature knowledge transfer loss based on attention fusion to improve the learning ability of the network, but also solves the problem of missing localization information through localization distillation loss. The experimental results in the multimodal audiovisual detection MAVD dataset show that the mAP values of the network have improved by 6.71%, 14.36%, and 10.32% compared to the baseline network at IOU values of 0.5, 0.75, and average, respectively, demonstrating the superiority of the detection network. This research achievement provides a new solution for the application of sound in the field of object detection, and also opens up new directions for research in related fields.
    • Optics and Precision Engineering   Vol. 32, Issue 2, Pages: 237-251(2024)
    • DOI:10.37188/OPE.20243202.0237    

      CLC: TP394.1;TH691.9
    • Received:08 June 2023

      Revised:12 July 2023

      Published:25 January 2024

    移动端阅览

  • LIU Shibei,CHEN Ying.Audio object detection network with multimodal cross level feature knowledge transfer[J].Optics and Precision Engineering,2024,32(02):237-251. DOI: 10.37188/OPE.20243202.0237.

  •  
  •  

0

Views

701

下载量

0

CSCD

Alert me when the article has been cited
提交
Tools
Download
Export Citation
Share
Add to favorites
Add to my album

Related Articles

Photovoltaic hot spot detection method incorporating knowledge distillation and attention mechanisms
Relocation non-maximum suppression algorithm
Survey of target detection based on deep convolutional neural networks
Polarized image feature fusion in power inspection
Remote sensing object detection algorithm based on ultra fusion residual marching geometric perception

Related Author

HAO Shuai
WU Yingqi
MA Xu
LI Tong
WANG Haiying
Shuzhi SU
Runbin CHEN
Yanmin ZHU

Related Institution

College of Electrical and Control Engineering, Xi'an University of Science and Technology
School of Computer Science and Engineering, Anhui University of Science & Technology
Institute of Energy, Hefei Comprehensive National Science Center
School of Computer Mechanical Engineering, Anhui University of Science & Technology
College of Computer Science and Technology, Jilin University
0