Logo image
Recursive fuzzy granulation for gene subsets extraction and cancer classification
Journal article

Recursive fuzzy granulation for gene subsets extraction and cancer classification

Yuchun Tang, Yan-Qing Zhang, Zhen Huang, Xiaohua Hu and Yichuan Zhao
IEEE transactions on information technology in biomedicine, v 12(6), pp 723-730
Nov 2008
PMID: 19000951

Abstract

Computational Biology - methods Oligonucleotide Array Sequence Analysis - methods Artificial Intelligence Humans Databases, Genetic Gene Expression Profiling - methods Male Neoplasms - classification Algorithms Prostatic Neoplasms - genetics Neoplasms - genetics Prostatic Neoplasms - classification Fuzzy Logic
A typical microarray gene expression dataset is usually both extremely sparse and imbalanced. To select multiple highly informative gene subsets for cancer classification and diagnosis, a new Fuzzy Granular Support Vector Machine---Recursive Feature Elimination algorithm (FGSVM-RFE) is designed in this paper. As a hybrid algorithm of statistical learning, fuzzy clustering, and granular computing, the FGSVM-RFE separately eliminates irrelevant, redundant, or noisy genes in different granules at different stages and selects highly informative genes with potentially different biological functions in balance. Empirical studies on three public datasets demonstrate that the FGSVM-RFE outperforms state-of-the-art approaches. Moreover, the FGSVM-RFE can extract multiple gene subsets on each of which a classifier can be modeled with 100% accuracy. Specifically, the independent testing accuracy for the prostate cancer dataset is significantly improved. The previous best result is 86% with 16 genes and our best result is 100% with only eight genes. The identified genes are annotated by Onto-Express to be biologically meaningful.

Metrics

10 Record Views
47 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Computer Science, Information Systems
Computer Science, Interdisciplinary Applications
Mathematical & Computational Biology
Medical Informatics
Logo image