The action recognition framework based on local features was established to improve the recognition ability of human behavior in video sequences. The algorithms related to the framework were researched through spatial temporal features extracting and encoding and parameters optimization of SVM classifier. Firstly
the feature descriptors composed of Histograms of Oriented Gradients (HOG) and Histograms of Optical Flow (HOF) were used to describe Space Time Interest Points (STIP) achieved by the Harris 3D detector and then encoded by Fisher Vector (FV). Due to the generalization ability of Support Vector Machine (SVM) model for action classification under fixed parameters was insufficient
the particle swarm optimization algorithm was applied to the parameter optimization of each action classifier. According to the characteristics of population diversity changed from generation to generation
the constructed particles aggregation degree model was used to adjust mutation probability for each generation of particles dynamically. Finally
the proposed method was verified by KTH and HMDB51 data sets. The results show that the Adaptive Mutation Particle Swarm Optimization (AMPSO) algorithm can avoid the local optimum and has strong global optimization capability. The recognition accuracies on KTH and HMDB51 data sets are 87.50% and 26.41%
respectively
which are better than two recognition methods. Experimental results indicate that the AMPSO algorithm has good convergence performance and the overall recognition framework has high practicability and accuracy.
WANG S G, LU F J, ZHAO W T, et al.. Action recognition based on-line random forest voting [J]. Opt. Precision Eng., 2016, 24(08): 2010-2017. (in Chinese)
PENG X J, WANG L M, WANG X X, et al.. Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice [J]. Computer Vision and Image Understanding, 2016, 150: 109-125.
SCHULDT C, LAPTEV I, CAPUTO B. Recognizing human actions: a local SVM approach [C]. 2004 IEEE Proceedings of the 17th International Conference on Pattern Recognition, Cambridge, UK, ICPR, 2004: 32-36.
LAPTEV I. On space-time interest points [J]. International Journal of Computer Vision, 2005, 64(2-3): 107-123.
WANG H, SCHMID C. Action recognition with improved trajectories [C]. 2013 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Portland, USA, CVPR, 2013: 3551-3558.
LIU L, WANG L, LIU X. In defense of soft-assignment coding [C]. 2011 International Conference on Computer Vision, Barcelona, Spain, ICCV, 2011: 2486-2493.
PERRONNIN F, SANCHEZ J, MENSINK T. Improving the fisher kernel for large-scale image classification [C]. 2010 European conference on computer vision, Crete, Greece, ECCV, 2010: 143-156.
SONG Y, LIN Z X, YAO J M. Luminance uniformity evaluation for backlight based on ε-support vector regression [J]. Chinese Journal of Liquid Crystals and Displays, 2015, 30(05): 857-863. (in Chinese)
SHEN L, CHEN H, YU Z, et al.. Evolving support vector machines using fruit fly optimization for medical data classification [J]. Knowledge-Based Systems, 2016, 96: 61-75.
HUANG C L, WANG C J. A GA-based feature selection and parameters optimizationfor support vector machines [J]. Expert Systems with Applications, 2006, 31(2): 231-240.
WANG G P, CHEN S Y, LIU J.. A Short-Term prediction model based on support vector regression optimized by artificial Fish-Swarm algorithm [J]. International Journal of Control and Automation, 2015, 8(7): 237-250.
段海滨, 张祥银, 徐春芳.仿生智能计算[M].北京:科学出版社, 2011.
DUAN H B, ZHANG X Y, XU C F. Bio-inspired Computing [M]. Beijing: Science Press, 2011. (in Chinese)
LIU Z Q, YIN J Q, ZHANG L, et al.. Human action recognition based on Kinect data principal component analysis [J]. Opt. Precision Eng., 2015, 23(10): 702-711. (in Chinese)
YU H Q, LIU Z. Spectral characterization model of liquid crystal display based on GA-BP network and PCA [J]. Chinese Journal of Liquid Crystals and Displays, 2016, 31(02): 201-207. (in Chinese)
SANCHEZ J, PERRONNIN F, MENSINK T, et al.. Image classification with the fisher vector: Theory and practice [J]. International Journal of Computer Vision, 2013, 105(3): 222-245.
ABOUHAWWASH M, SEADA H, DEB K. Towards faster convergence of evolutionary multi-criterion optimization algorithms using karush kuhn tucker optimality based local search [J]. Computers & Operations Research, 2017, 79(1):331-346.
WANG L, QIAO Y, TANG X. Action recognition with trajectory-pooled deep-convolutional descriptors [C]. 2015 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Boston, USA, CVPR, 2015: 4305-4314.
LAN Z, LIN M, LI X, et al.. Beyond gaussian pyramid: Multi-skip feature stacking for action recognition [C]. 2015 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Boston, USA, CVPR, 2015: 204-212.
KUEHNE H, JHUANG H, STIEFELHAGEN R. HMDB51: A large video database for human motion recognition [J]. High Performance Computing in Science and Engineering, 2013, 12(1): 571-582.
FERNANDO B, GAVVES S, MOGROVEJO O, et al.. Rank pooling for action recognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):773-787.
WANG S G, SUN A M. Single and interactive human behavior recognition algorithm based on spatio-temporal interest point [J]. Journal of Jilin University (Engineering and Technology Edition), 2015, 45(01): 304-308. (in Chinese)