Logo image
A Novel Hybrid HDP-LDA Model for Sentiment Analysis
Conference proceeding

A Novel Hybrid HDP-LDA Model for Sentiment Analysis

Wanying Ding, Xiaoli Song, Lifan Guo, Zunyan Xiong and Xiaohua Hu
2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, v 1, pp 329-336
01 Jan 2013

Abstract

Computer Science, Theory & Methods Engineering, Electrical & Electronic Science & Technology Computer Science Engineering Technology
Sentiment analysis studies the public opinions towards an entity, and it is an important research area in data mining. Recently, a lot of sentiment analysis models have been proposed, including supervised and unsupervised approaches. However, the role of supervised models has been undermined by the phenomenon of big data, and the unsupervised ones are drawing more and more attention. But, most current unsupervised methods are based on Latent Dirichlet Allocation (LDA), and they need to specify the number of aspects in advance, making them subjective. In addition, these methods treat factual words and opinioned words the same, and assume that one sentence contains only one aspect, all of which make the existing unsupervised methods unsatisfactory. To solve these problems, this paper proposes a novel hybrid Hierarchical Dirichlet Process-Latent Dirichlet Allocation (HDP-LDA) model. This model can automatically determine the number of aspects, distinguish factual words from opinioned words, and further effectively extracts the aspect specific sentiment words. Experiment result shows that our model can clearly capture the aspects people mentioned and the specific sentiment words they use in each aspect, improving the performance of sentiment analysis efficiently. At last, we compared our model with the influential topic models, namely, JST, AUSM and MaxEnt-LDA, on the online restaurant review, and found our model performs very well.

Metrics

14 Record Views
15 citations in Scopus

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Computer Science, Theory & Methods
Engineering, Electrical & Electronic
Logo image