Feng LIU, Tong-sheng SHEN, Xin-xing MA, et al. Ship recognition based on multi-band deep neural network[J]. Optics and precision engineering, 2017, 25(11): 2939-2946.
DOI:
Feng LIU, Tong-sheng SHEN, Xin-xing MA, et al. Ship recognition based on multi-band deep neural network[J]. Optics and precision engineering, 2017, 25(11): 2939-2946. DOI: 10.3788/OPE.20172511.2939.
Ship recognition based on multi-band deep neural network
The fusion recognition of multi-band images can extend the application range of recognition systems. A fusion method based on convolutional neural networks (CNN) was explored and designed in this paper. Based on the AlexNet network model
it was extracted that the ship target features of three wave band images concurrently in visible light
Middle Wave Infrared (MWIR) and Long Wave Infrared (LWIR) bands. Then
it performed the feature selection for concatenated three-band eigenvectors by using the mutual information method and determines the dimensions of fusion eigenvectors according to sorting the importance of concatenated feature eigenvectors. Finally
three fusion methods named as Early fusion
Middle fusion and Late fusion were used to verify respectively the effectiveness of the proposed algorithm according to the features extracted from different levels. An available ship target dataset in three bands containing 6 categories of targets and more than 5 000 images was established for our experimental verification. The results show that the recognition rate from Middle fusion reaches 84.5%. Compared with Early Fusion and Late Fusion
it increases by 8% and 12%. Moreover
the recognition rates of all three fusion methods have been improve significantly as compared to that of the single band recognitions at the same application scene.
关键词
Keywords
references
WAGNER J, FISCHER V, HERMAN M, et al .. Multispectral pedestrian detection using deep fusion convolutional neural networks[C]. 24 th European Symposium on Artificial Neural Networks , Computational Intelligence and Machine Learning ( ESANN ), 2016:509-514.
NGIAM J, KHOSLA A, KIM M, et al .. Multimodal deep learning[C]. Proceedings of the 28 th international conference on machine learning ( ICML -11), 2011:689-696.
SRIVASTAVA N, SALAKHUTDINOV R R. Multimodal learning with deep boltzmann machines[C]. Proceedings of the 25 th International Conference on Neural Information Processing Systems , NIPS , 2012:2222-2230.
KARPATHY A, TODERICI G, SHETTY S, et al .. Large-scale video classification with convolutional neural networks[C]. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition , 2014:1725-1732.
WANG LW, LI Y, LAZEBNIK S. Learning deep structure-preserving image-text embeddings[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2016:5005-5013.
SOCHER R, HUVAL B, BATH B, et al .. Convolutional-recursive deep learning for 3d object classification[C]. Proceedings of the 25th International Conference on Neural Information Processing Systems , 2012:656-664.
SIMONYAN K, ZISSERMAN A. Two-stream convolutional networks for action recognition in videos[C]. Proceedings of the 27 th International Conference on Neural Information Processing Systems , MIT Press , 2014:568-576.
GUNDOGDU E, SOLMAZ B, YVCESOY V, et al .. MARVEL:A large-scale image dataset for maritime vessels[C]. Asian Conference on Computer Vision , Springer , 2016:165-180.
BOUSETOUANE F, MORRIS B. Fast CNN surveillance pipeline for fine-grained vessel classification and detection in maritime scenarios[C]. 201613 th IEEE International Conference on Advanced Video and Signal Based Surveillance ( AVSS ), 2016:242-248.
REN S Q, HE K M, GIRSHICK R, et al .. Faster R-CNN:Towards real-time object detection with region proposal networks[C]. AIPS Proceedings of the 28 th International Conference on Neural Information Processing Systems , 2015:91-99.
ZHANG M M, CHOI J, DANⅡLIAIS K, et al .. Vais:A dataset for recognizing maritime imagery in the visible and infrared spectrums[C]. IEEE Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops , 2015:10-16.
ZHOU F Y, JIN L P, DONG J. Review of convolutional neural network[J]. Chinese Journal of Computers, 2017, 40(6):1229-1251.(in Chinese)
KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 60(6):84-90.
NAIR V, HINTON G E. Rectified linear units improve restricted boltzmann machines[C] Proceedings of the 27 th International Conference on Machine Learning ( ICML -10), Omnipress, 2010:807-814.
ZHANG Y, WU J X, CAI J F. Compact representation for image classification:To choose or to compress?[C] Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014:907-914.
YE P, KUMAR J, DOERMANN D. Beyond human opinion scores:blind image quality assessment based on synthetic scores[C] IEEE Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014:4241-4248.
XUE W F, ZHANG L, MOU X Q, et al.. Gradient magnitude similarity deviation:a highly efficient perceptual image quality index[J]. IEEE Transactions on Image Processing, 2014, 23(2):684-695.
SHEIKH H R, BOVIK A C, DE VECIANA G. An information fidelity criterion for image quality assessment using natural scene statistics[J]. IEEE Transactions on image Processing, 2005, 14(12):2117-2128.
ZHANG L, ZHANG L, MOU X, et al.. FSIM:a feature similarity index for image quality assessment[J]. IEEE transactions on Image Processing, 2011, 20(8):2378-2386.
WANG Z, BOVIK A C, SHEIKH H R, et al.. Image quality assessment:from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4):600-612.