Improving Chinese lip-reading recognizing rate by unsymmetrical lip contour model
|更新时间:2020-08-12
|
Improving Chinese lip-reading recognizing rate by unsymmetrical lip contour model
Optics and Precision EngineeringVol. 14, Issue 3, Pages: 473-476(2006)
作者机构:
天津大学 精密仪器与光电子工程学院 天津,300072
作者简介:
基金信息:
DOI:
CLC:TP391.43
Received:20 December 2005,
Revised:14 February 2006,
Published Online:30 June 2006,
Published:30 June 2006
稿件说明:
移动端阅览
LI Gang, WANG Meng-jun, LIN Ling. Improving Chinese lip-reading recognizing rate by unsymmetrical lip contour model[J]. Optics and precision engineering, 2006, 14(3): 473-476.
DOI:
LI Gang, WANG Meng-jun, LIN Ling. Improving Chinese lip-reading recognizing rate by unsymmetrical lip contour model[J]. Optics and precision engineering, 2006, 14(3): 473-476.DOI:
Improving Chinese lip-reading recognizing rate by unsymmetrical lip contour model
Based on analyzing the side-face image and full-face image
a new model was presented to extract the degree of pouting from a lip contour. At the same time
the differential coefficient of some parameters to describe dynamic characteristic of the lip contour were calculated. Experimental results based on a small database of Chinese words show that the parameters from unsymmetrical lip contour model improve the recognizing rate in more than 25%
which is superior to those of the traditional symmetrical lip contour model.
关键词
Keywords
references
. 梁毅雄,龚卫国,潘英俊,等. 基于奇异值分解的人脸识别方法[J]. 光学 精密工程,2004,12(5):543-549. LIANG Y X,GONG W G,PAN Y J,et al.Singular value decomposition-based approach for face recognition[J].Optics and Precision Engineering, 2004, 12(5):543-549.(in Chinese)
. WANG R,YAO H X,GAO W.Recognition of sequence lip images and its application . IEEE Fourth International Conference on Signal Processing, 1998, (Ⅰ):849-854.
. MATTHEWS I,COOTES T F,BANGHAM J A,et al.Extraction of visual features for lip reading[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24 (2):198-213.
. SCANLON P,REILLY R.Feature analysis for automatic speech reading .IEEE Fourth Workshop on Multimedia Signal Processing, 2001, Page(s):625-630.
. ZHANG X,MERSEREAU R M,CLEMENTS M,et al.Visual speech feature extraction for improved speech recognition . Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002, 2:1993-1996.
. 王志明,蔡莲红. 汉语语音视位的研究[J]. 应用声学,2002.21(3):29-34. WANG Z M,CAI L H.Study of Chinese viseme[J].Applied Acoustics, 2002, 21(3):29-34.(in Chinese)
. SEGUIER R, CLADEL N.Multiobjectives genetic snakes: application on audio-visual speech recognition . Fourth EURASIP Conference focused on Video/Image Processing and Multimedia Communications, 2003:625-630.
. 姚鸿勋,高文,李静梅,等. 用于口形识别的实时唇定位方法[J]. 软件学报,2000.11(8):1126-1132. YAO H X, GAO W, LI J M, et al.Real-time lip locating method for lip-movement recognition[J].Journal of Software, 2000, 11(8):1126-1132.(in Chinese)
. CHANDRAMOHAN D, SILSBEE P L.A multiple deformable template approach for visual speech recognition . Fourth International Conference on Spoken Language, 1996, 1:50-53.
. 王磊,莫玉龙,戚飞虎. 基于于弹性模板的嘴巴轮廓提取[J]. 上海大学学报(自然科学版),1998,4(5):579-585. WANG L, MO Y L, QI F H.Mouth feature extraction based on deformable template .Journal of Shanghai University (Natural Science),1998,4(5):579-585.(in Chinese)
. 晏洁. 文本驱动的唇动合成系统[J]. 计算机工程与设计,1998.19(1):31-34. YAN J.Text-driven lip motion synthesis system[J].Computer Engineering and Design, 1998, 19(1):31-34. (in Chinese)
. 王波波,来忠信,黄廉卿,等. 基于多尺度隐马尔可夫模型的CR影像降噪方法研究[J]. 光学 精密工程,2000.10(2):188-193. WANG B B,LAI Z X,HUANG L Q,et al.MHMM-based computed radiography image-denoising method[J].Optics and Precision Engineering, 2002, 10(2):188-193.(in Chinese)