Logo image
Biomedical ontology improves biomedical literature clustering performance: a comparison study
Journal article   Peer reviewed

Biomedical ontology improves biomedical literature clustering performance: a comparison study

Illhoi Yoo, Xiaohua Hu and Il-Yeol Song
International journal of bioinformatics research and applications, v 3(3), pp 414-428
2007
PMID: 18048199

Abstract

MEDLINE Publications Medical Subject Headings Computational Biology Information Storage and Retrieval Cluster Analysis
Document clustering has been used for better document retrieval and text mining. In this paper, we investigate if a biomedical ontology improves biomedical literature clustering performance in terms of the effectiveness and the scalability. For this investigation, we perform a comprehensive comparison study of various document clustering approaches such as hierarchical clustering methods, Bisecting K-means, K-means and Suffix Tree Clustering (STC). According to our experiment results, a biomedical ontology significantly enhances clustering quality on biomedical documents. In addition, our results show that decent document clustering approaches, such as Bisecting K-means, K-means and STC, gains some benefit from the ontology while hierarchical algorithms showing the poorest clustering quality do not reap the benefit of the biomedical ontology.

Metrics

14 Record Views
9 citations in Scopus

Details

Logo image