摘要
We propose a semantic template-based distributed representation for the convolutional neural network called Semantic Template-based Convolutional Neural Network (STCNN) for text categorization that imitates the perceptual behavior of human comprehension. STCNN is a highly automatic approach that learns semantic templates that characterize a domain from raw text and recognizes categories of documents using a semantic-infused convolutional neural network that allows a template to be partially matched through a statistical scoring system. Our experiment results show that STCNN effectively classifies documents in about 140,000 Chinese news articles into predefined categories by capturing the most prominent and expressive patterns and achieves the best performance among all compared methods for Chinese topic classification. Finally, the same knowledge can be directly used to perform a semantic analysis task.
原文 | 英語 |
---|---|
文章編號 | 249 |
期刊 | ACM Transactions on Asian and Low-Resource Language Information Processing |
卷 | 22 |
發行號 | 11 |
DOIs | |
出版狀態 | 已發佈 - 11月 20 2023 |
ASJC Scopus subject areas
- 一般電腦科學