Logo image
Document clustering by semantic smoothing and dynamic growing cell structure (DynGCS) for biomedical literature
Conference proceeding   Peer reviewed

Document clustering by semantic smoothing and dynamic growing cell structure (DynGCS) for biomedical literature

Min Song, Xiaohua Hu, Illhoi Yoo and Eric Koppel
DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, v 5182, pp 217-226
01 Jan 2008

Abstract

Computer Science Computer Science, Artificial Intelligence Computer Science, Information Systems Computer Science, Theory & Methods Science & Technology Technology
The general goal of clustering is to group data elements such that the intra-group similarities are high and the inter-group similarities are low. In this paper, we propose a novel hybrid clustering technique that incorporates semantic smoothing of document models into a neural network framework. Recently it has been reported that the semantic smoothing model enhances the retrieval quality in Information Retrieval (IR). Inspired by that, we apply the context-sensitive semantic smoothing model to boost accuracy of clustering that is generated by a dynamic growing cell structure algorithm, a variation of the neural network technique. We evaluated the proposed technique on article sets from MEDLINE, the largest biomedical digital library in Biomedicine. Our experimental evaluations show that the proposed algorithm significantly improves the clustering quality over the traditional clustering techniques.

Metrics

5 Record Views

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
Web of Science research areas
Computer Science, Artificial Intelligence
Computer Science, Information Systems
Computer Science, Theory & Methods
Logo image