Lip contour description based on orthogonal transform in visual driven speech synthesis system
|更新时间:2020-08-12
|
Lip contour description based on orthogonal transform in visual driven speech synthesis system
Optics and Precision EngineeringVol. 15, Issue 7, Pages: 1117-1123(2007)
作者机构:
1. 天津大学 精密仪器与光电子工程学院,天津 300072
2. 军事交通学院,天津 300161
作者简介:
基金信息:
DOI:
CLC:TN912.34
Received:18 October 2006,
Revised:25 December 2006,
Published Online:30 July 2007,
Published:30 July 2007
稿件说明:
移动端阅览
LI Gang, WANG Meng-jun, LIN Ling, et al. Lip contour description based on orthogonal transform in visual driven speech synthesis system[J]. Optics and precision engineering, 2007, 15(7): 1117-1123.
DOI:
LI Gang, WANG Meng-jun, LIN Ling, et al. Lip contour description based on orthogonal transform in visual driven speech synthesis system[J]. Optics and precision engineering, 2007, 15(7): 1117-1123.DOI:
Lip contour description based on orthogonal transform in visual driven speech synthesis system
In order to describe the lip contours in a lip reading system automatically and fleetly
orthogonal compression transformation was applied to the feature extraction of lip contours. Discrete Fourier Transform (DFT) and Discrete Cosine Transform(DCT) were used to get the descriptors of lip contours in the asymmetrical lip contour model. Then the Hidden Markov Model (HMM) was trained using two kinds of descriptors as the eigenvectors of lip contours. The experiments based on isolated Chinese words show that the number of DCT descriptors needed is 15
while the number of DFT descriptors is 20 at the same recognition rate of 40%. Experiments also show that the computing quantity and the consuming time are reduced obviously by the DCT at the same recognition rate.