PAN Rong, SUN Wei,. Deep learning target detection based on pre-segmentation and regression[J]. Editorial Office of Optics and Precision Engineering, 2017,25(10s): 221-227
PAN Rong, SUN Wei,. Deep learning target detection based on pre-segmentation and regression[J]. Editorial Office of Optics and Precision Engineering, 2017,25(10s): 221-227 DOI: 10.3788/OPE.20172513.0221.
Deep learning target detection based on pre-segmentation and regression
Aiming at the problem of difficult small target detection in high resolution images
combined with region-of-interest (ROI) extraction strategy in target detection method based on candidate region and regression strategy in target detection algorithm based on regression
deep learning target detection algorithm based on pre-segmentation and regression (Quad-ssd) was proposed. As fast-RCNN series implement image location and classification separately
small targets could be detected but detection time was too long. YOLO series method used regression method to implement classification and location for targets in images at the same time. As only high-level features were used
detection accuracy for small target was not enough. Therefore
quad tree was used to extract interest target of original images
and target detection method based on regression was used to implement detailed relocation and classification for targets in interested region. Compared with traditional Fast-RCNN method and deep learning method based on regression of YOLO series
target detection algorithm of deep learning based on quad tree has obvious advantages in accuracy and speed. The experimental results show that compared with Fast-RCNN
accuracy of Quad-ssd algorithm is improved by 6.5% and reaches 74.9% at the time of target detection. The detection speed is improved greatly; reaching 45 f/s
TAIGMAN Y, YANG M, RANZATO M, et al.. DeepFace:Closing the Gap to Human-Level Performance in Face Verification[C]. IEEE Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 2014:1701-1708.
MA X, GRIMSON W E L. Edge-based rich representation for vehicle classification[C]. Tenth IEEE International Conference on Computer Vision. IEEE, 2005:1185-1192 Vol. 2.
KAZEMI F M, SAMADI S, POORREZA H R, et al.. Vehicle Recognition Using Curvelet Transform and SVM[C]. International Conference on Information Technology. IEEE, 2007:516-521.
FREUND Y, SCHAPIRE R E. A desicion-theoretic generalization of on-line learning and an application to boosting[C]. European Conference on Computational Learning Theory. Springer Berlin Heidelberg, 1995:23-37.
FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al.. Object detection with discriminatively trained part-based models.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2010, 32(9):1627-45.
SZEGEDY C, TOSHEV A, ERHAN D. Deep Neural Networks for Object Detection.[C]. Advances in Neural Information Processing Systems[S. l.]:NIPS Press, 2013:1673-1675.
SERMANET P, EIGEN D, ZHANG X, et al.. OverFeat:Integrated Recognition, Localization and Detection using Convolutional Networks[J]. Eprint Arxiv, 2013.
GIRSHICK R, DONAHUE J, DARRELL T, et al.. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[J]. 2013:580-587.
UIJLINGS J R, SANDE K E, GEVERS T, et al.. Selective Search for Object Recognition[J]. International Journal of Computer Vision, 2013, 104(2):154-171.
HE K, ZHANG X, REN S, et al.. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 37(9):1904.
GIRSHICK R. Fast R-CNN[J]. Computer Science, 2015.
REN S, HE K, GIRSHICK R, et al.. Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, 39(6):1137.
REDMON J, DIVVALA S, GIRSHICK R, et al.. You only look once:unified, real-time object detection[J]. IEEE Computer Society, 2015:779-788.
LIU W, ANGUELOV D, ERHAN D, et al.. SSD:Single Shot MultiBox Detector[C]. European Conference on Computer Vision. Springer International Publishing, 2016:21-37.
LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]. Computer Vision and Pattern Recognition. IEEE, 2015:3431-3440.
FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al.. Object detection with discriminatively trained part-based models.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2010, 32(9):1627-1645.
RUSSAKOVSKY O, DENG J, SU H, et al.. ImageNet Large Scale Visual Recognition Challenge[J]. International Journal of Computer Vision, 2015, 115(3):211-252.