Conference proceeding
TUT: a statistical model for detecting trends, topics and user interests in social media
Proceedings of the 21st ACM international conference on information and knowledge management, pp 972-981
29 Oct 2012
Abstract
The rapid development of online social media sites is accompanied by the generation of tremendous web contents. Web users are shifting from data consumers to data producers. As a result, topic detection and tracking without taking users' interests into account is not enough. This paper presents a statistical model that can detect interpretable trends and topics from document streams, where each trend (short for trending story) corresponds to a series of continuing events or a storyline. A topic is represented by a cluster of words frequently co-occurred. A trend can contain multiple topics and a topic can be shared by different trends. In addition, by leveraging a Recurrent Chinese Restaurant Process (RCRP), the number of trends in our model can be determined automatically without human intervention, so that our model can better generalize to unseen data. Furthermore, our proposed model incorporates user interest to fully simulate the generation process of web contents, which offers the opportunity for personalized recommendation in online social media. Experiments on three different datasets indicated that our proposed model can capture meaningful topics and trends, monitor rise and fall of detected trends, outperform baseline approach in terms of perplexity on held-out dataset, and improve the result of user participation prediction by leveraging users' interests to different trends.
Metrics
14 Record Views
26 citations in Scopus
Details
- Title
- TUT
- Creators
- Xuning Tang - Drexel UniversityChristopher Yang - Drexel University
- Publication Details
- Proceedings of the 21st ACM international conference on information and knowledge management, pp 972-981
- Conference
- 21st ACM international conference on information and knowledge management, 21st (Maui, Hawaii, United States, 2012)
- Series
- CIKM '12
- Publisher
- Association for Computing Machinery (ACM)
- Number of pages
- 1
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science
- Scopus ID
- 2-s2.0-84871035454
- Other Identifier
- 991019173768404721