Logo image
Predicting gene function with positive and unlabeled examples
Conference proceeding

Predicting gene function with positive and unlabeled examples

Yiming Chen, Zhoujun Li, Xiaohua Hu, Hongxiang Diao and Junwan Liu
2009 IEEE International Conference on Granular Computing, pp 89-94
Aug 2009

Abstract

Clustering algorithms Educational institutions Information science Large-scale systems Prediction methods Proteins Agricultural Engineering Bioinformatics Computer Science Genomics
Predicting gene function is usually formulated as binary classification problem. However, we only know which gene has some function while we are not sure that it doesn't belong to a function class, which means that only positive examples are given. Therefore, selecting a good training example set becomes a key step. In this paper, we cluster the genes on integrated weighted graph by generalizing the cluster coefficient of unweighted graph to weighted one, and identify the reliable negative samples based on distance between a gene and centroid of positive clusters. Then, the tri-training algorithm is used to learn three classifiers from labeled and unlabeled examples to predict the gene function by combining three prediction result. The experiment results show that our approach outperforms several classic prediction methods.

Metrics

9 Record Views

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Computer Science, Artificial Intelligence
Engineering, Electrical & Electronic
Logo image