摘要
In this paper we provide a formal description of a speech recognizer designed on the basis of elaborate articulatory timing that is asynchronous across the multiple articulatory-feature dimensions. Three recently improved critical components of the recognizer are described in detail. Evaluation results, obtained from a standard TIMIT phonetic recognition task confined within the N-best rescoring scenario, are reported on comparative performances between the new feature-based recognizer and a recognizer using the conventional context-dependent triphone units. The results demonstrate an overall superior quality of the rescored N-best list from the feature-based recognizer over that from the triphone-based recognizer. Greater performance improvements are observed as the top number of candidate sentences increases.
| 原文 | 英語 |
|---|---|
| 頁(從 - 到) | 385-388 |
| 頁數 | 4 |
| 期刊 | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
| 卷 | 1 |
| 出版狀態 | 已發佈 - 1月 1 1995 |
| 對外發佈 | 是 |
| 事件 | Proceedings of the 1995 20th International Conference on Acoustics, Speech, and Signal Processing. Part 1 (of 5) - Detroit, MI, USA 持續時間: 5月 9 1995 → 5月 12 1995 |
ASJC Scopus subject areas
- 軟體
- 訊號處理
- 電氣與電子工程
指紋
深入研究「Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units」主題。共同形成了獨特的指紋。引用此
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS