浏览全部资源
扫码关注微信
1.海军航空工程学院 控制工程系, 山东 烟台 264001
2.中国国防科技信息中心, 北京 100142
3.91206部队, 山东 青岛 266108
[ "刘峰(1988-), 男, 黑龙江哈尔滨人, 博士研究生, 2011年于华中科技大学获得学士学位, 2013年于海军航空工程学院获得硕士学位, 主要从事机器视觉及目标检测识别等研究。E-mail:liufeng_cv@126.com" ]
[ "马新星(1982-), 男, 江苏如牟人, 博士研究生, 2005年, 2013年于海军航空工程学院分别获得学士学位、硕士学位, 主要从事红外图像处理、场景仿真、目标检测等研究。E-mail:xinxing_ma@tom.com" ]
收稿日期:2017-04-11,
录用日期:2017-6-30,
纸质出版日期:2017-11-25
移动端阅览
刘峰, 沈同圣, 马新星, 等. 基于多波段深度神经网络的舰船目标识别[J]. 光学 精密工程, 2017,25(11):2939-2946.
Feng LIU, Tong-sheng SHEN, Xin-xing MA, et al. Ship recognition based on multi-band deep neural network[J]. Optics and precision engineering, 2017, 25(11): 2939-2946.
刘峰, 沈同圣, 马新星, 等. 基于多波段深度神经网络的舰船目标识别[J]. 光学 精密工程, 2017,25(11):2939-2946. DOI: 10.3788/OPE.20172511.2939.
Feng LIU, Tong-sheng SHEN, Xin-xing MA, et al. Ship recognition based on multi-band deep neural network[J]. Optics and precision engineering, 2017, 25(11): 2939-2946. DOI: 10.3788/OPE.20172511.2939.
考虑多波段图像的融合识别可以扩展识别系统的应用范围,本文探索并设计了一种基于卷积神经网络的融合识别方法。该方法以AlexNet网络模型为基础,同时对可见光、中波红外和长波红外三波段图像进行特征提取;然后,利用互信息的方法对串联的三波段特征向量进行特征选择,依据重要性排序的方式选定固定长度的特征向量;最后,依据特征提取层级的不同,分别以早期融合、中期融合和后期融合3种融合方式来验证算法的有效性。采用自建的三波段舰船图像数据库进行了模型的训练和测试,共包含6类目标,5 000余张图像。实验结果显示,采用的3种融合识别方法中,中间层融合的识别准确率最高,达到84.5%,比早期融合和后期融合分别高5%和7%左右。另外,在本文的应用场景下,无论何种融合方式,其融合识别的准确率均明显高于其他单波段识别的准确率。
The fusion recognition of multi-band images can extend the application range of recognition systems. A fusion method based on convolutional neural networks (CNN) was explored and designed in this paper. Based on the AlexNet network model
it was extracted that the ship target features of three wave band images concurrently in visible light
Middle Wave Infrared (MWIR) and Long Wave Infrared (LWIR) bands. Then
it performed the feature selection for concatenated three-band eigenvectors by using the mutual information method and determines the dimensions of fusion eigenvectors according to sorting the importance of concatenated feature eigenvectors. Finally
three fusion methods named as Early fusion
Middle fusion and Late fusion were used to verify respectively the effectiveness of the proposed algorithm according to the features extracted from different levels. An available ship target dataset in three bands containing 6 categories of targets and more than 5 000 images was established for our experimental verification. The results show that the recognition rate from Middle fusion reaches 84.5%. Compared with Early Fusion and Late Fusion
it increases by 8% and 12%. Moreover
the recognition rates of all three fusion methods have been improve significantly as compared to that of the single band recognitions at the same application scene.
WAGNER J, FISCHER V, HERMAN M, et al .. Multispectral pedestrian detection using deep fusion convolutional neural networks[C]. 24 th European Symposium on Artificial Neural Networks , Computational Intelligence and Machine Learning ( ESANN ), 2016:509-514.
NGIAM J, KHOSLA A, KIM M, et al .. Multimodal deep learning[C]. Proceedings of the 28 th international conference on machine learning ( ICML -11), 2011:689-696.
SRIVASTAVA N, SALAKHUTDINOV R R. Multimodal learning with deep boltzmann machines[C]. Proceedings of the 25 th International Conference on Neural Information Processing Systems , NIPS , 2012:2222-2230.
KARPATHY A, TODERICI G, SHETTY S, et al .. Large-scale video classification with convolutional neural networks[C]. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition , 2014:1725-1732.
WANG LW, LI Y, LAZEBNIK S. Learning deep structure-preserving image-text embeddings[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2016:5005-5013.
SOCHER R, HUVAL B, BATH B, et al .. Convolutional-recursive deep learning for 3d object classification[C]. Proceedings of the 25th International Conference on Neural Information Processing Systems , 2012:656-664.
SIMONYAN K, ZISSERMAN A. Two-stream convolutional networks for action recognition in videos[C]. Proceedings of the 27 th International Conference on Neural Information Processing Systems , MIT Press , 2014:568-576.
GUNDOGDU E, SOLMAZ B, YVCESOY V, et al .. MARVEL:A large-scale image dataset for maritime vessels[C]. Asian Conference on Computer Vision , Springer , 2016:165-180.
BOUSETOUANE F, MORRIS B. Fast CNN surveillance pipeline for fine-grained vessel classification and detection in maritime scenarios[C]. 201613 th IEEE International Conference on Advanced Video and Signal Based Surveillance ( AVSS ), 2016:242-248.
REN S Q, HE K M, GIRSHICK R, et al .. Faster R-CNN:Towards real-time object detection with region proposal networks[C]. AIPS Proceedings of the 28 th International Conference on Neural Information Processing Systems , 2015:91-99.
ZHANG M M, CHOI J, DANⅡLIAIS K, et al .. Vais:A dataset for recognizing maritime imagery in the visible and infrared spectrums[C]. IEEE Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops , 2015:10-16.
周飞燕, 金林鹏, 董军.卷积神经网络研究综述[J].计算机学报, 2017, 40(6):1229-1251.
ZHOU F Y, JIN L P, DONG J. Review of convolutional neural network[J]. Chinese Journal of Computers, 2017, 40(6):1229-1251.(in Chinese)
KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 60(6):84-90.
NAIR V, HINTON G E. Rectified linear units improve restricted boltzmann machines[C] Proceedings of the 27 th International Conference on Machine Learning ( ICML -10), Omnipress, 2010:807-814.
ZHANG Y, WU J X, CAI J F. Compact representation for image classification:To choose or to compress?[C] Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014:907-914.
YE P, KUMAR J, DOERMANN D. Beyond human opinion scores:blind image quality assessment based on synthetic scores[C] IEEE Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014:4241-4248.
XUE W F, ZHANG L, MOU X Q, et al.. Gradient magnitude similarity deviation:a highly efficient perceptual image quality index[J]. IEEE Transactions on Image Processing, 2014, 23(2):684-695.
SHEIKH H R, BOVIK A C, DE VECIANA G. An information fidelity criterion for image quality assessment using natural scene statistics[J]. IEEE Transactions on image Processing, 2005, 14(12):2117-2128.
ZHANG L, ZHANG L, MOU X, et al.. FSIM:a feature similarity index for image quality assessment[J]. IEEE transactions on Image Processing, 2011, 20(8):2378-2386.
WANG Z, BOVIK A C, SHEIKH H R, et al.. Image quality assessment:from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4):600-612.
0
浏览量
461
下载量
18
CSCD
关联资源
相关文章
相关作者
相关机构