Sentiment analysis on Chinese movie review with distributed keyword vector representation

Chun Han Chu, Chen Ann Wang, Yung Chun Chang, Ying Wei Wu, Yu Lun Hsieh, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

In the area of national language processing, performing machine learning technique on customer or movie review for sentiment analysis has been? frequently tried. While methods such as? support vector machine (SVM) were much favored in the 2000s, recently there is a steadily rising percentage of implementation with vector representation and artificial neural network. In this article we present an approach to implement word embedding method to conduct sentiment analysis on movie review from a renowned bulletin board system forum in Taiwan. After performing log-likelihood ratio (LLR) on the corpus and selecting the top 10000 most related keywords as representative vectors for different sentiments, we use these vectors as the sentiment classifier for the testing set. We achieved results that are not only comparable to traditional methods like Naïve Bayes and SVM, but also outperform Latent Dirichlet Allocation, TF-IDF and its variant. It also tops the original LLR with a substantial margin.

Original languageEnglish
Title of host publicationTAAI 2016 - 2016 Conference on Technologies and Applications of Artificial Intelligence, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages84-89
Number of pages6
ISBN (Electronic)9781509057320
DOIs
Publication statusPublished - Mar 16 2017
Externally publishedYes
Event2016 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2016 - Hsinchu, Taiwan
Duration: Nov 25 2016Nov 27 2016

Publication series

NameTAAI 2016 - 2016 Conference on Technologies and Applications of Artificial Intelligence, Proceedings

Conference

Conference2016 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2016
Country/TerritoryTaiwan
CityHsinchu
Period11/25/1611/27/16

Keywords

  • LLR
  • machine learning
  • sentiment analysis
  • TF-IDF
  • word embedding

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Control and Optimization
  • Information Systems

Fingerprint

Dive into the research topics of 'Sentiment analysis on Chinese movie review with distributed keyword vector representation'. Together they form a unique fingerprint.

Cite this