Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units

L. Deng, J. Wu, H. Sameti

研究成果: 雜誌貢獻Conference article同行評審

3 引文 斯高帕斯(Scopus)

摘要

In this paper we provide a formal description of a speech recognizer designed on the basis of elaborate articulatory timing that is asynchronous across the multiple articulatory-feature dimensions. Three recently improved critical components of the recognizer are described in detail. Evaluation results, obtained from a standard TIMIT phonetic recognition task confined within the N-best rescoring scenario, are reported on comparative performances between the new feature-based recognizer and a recognizer using the conventional context-dependent triphone units. The results demonstrate an overall superior quality of the rescored N-best list from the feature-based recognizer over that from the triphone-based recognizer. Greater performance improvements are observed as the top number of candidate sentences increases.
原文英語
頁(從 - 到)385-388
頁數4
期刊ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1
出版狀態已發佈 - 1月 1 1995
對外發佈
事件Proceedings of the 1995 20th International Conference on Acoustics, Speech, and Signal Processing. Part 1 (of 5) - Detroit, MI, USA
持續時間: 5月 9 19955月 12 1995

ASJC Scopus subject areas

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units」主題。共同形成了獨特的指紋。

引用此