Conference proceeding
Semantic Representation in Text Classification Using Topic Signature Mapping
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, v 10, pp 1034-1040
01 Jan 2008
Abstract
Document representation is one of the crucial components that determine the effectiveness of text classification tasks. Traditional document representation approaches typically adopt a popular bag-of-word method as the underlying document representation. Although it's a simple and efficient method, the major shortcoming of bag-of-word representation is in the independent of word feature assumption. Many researchers have attempted to address this issue by incorporating semantic information into document representation. In this paper, we study the effect of semantic representation on the effectiveness of text classification systems. We employed a novel semantic smoothing technique to derive semantic information in a form of mapping probability between topic signatures and single-word features. Two classifiers, Naive Bayes and Support Vector Machine, were selected to carry out the classification experiments. Overall, our topic-signature semantic representation approaches significantly outperformed traditional bag-of-word representation in most datasets.
Metrics
Details
- Title
- Semantic Representation in Text Classification Using Topic Signature Mapping
- Creators
- Palakorn Achananuparp - Drexel UniversityXiaohua Zhou - Drexel UniversityXiaohua Hu - Drexel UniversityXiaodan Zhang - Drexel UniversityIEEE
- Publication Details
- 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, v 10, pp 1034-1040
- Series
- IEEE International Joint Conference on Neural Networks (IJCNN)
- Publisher
- IEEE
- Number of pages
- 7
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science
- Web of Science ID
- WOS:000263827200169
- Scopus ID
- 2-s2.0-56349141680
- Other Identifier
- 991019167442204721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Computer Science, Artificial Intelligence
- Computer Science, Cybernetics
- Engineering, Electrical & Electronic