Logo image
Random Forest in Splice Site Prediction of Human Genome
Book chapter

Random Forest in Splice Site Prediction of Human Genome

Elham Pashaei, Mustafa Ozen, Nizamettin Aydin and Dov Jaron
XIV Mediterranean Conference on Medical and Biological Engineering and Computing 2016, pp 518-523
17 Sep 2016

Abstract

Feature ranking Random forest Splice site prediction
With the rapid growth of huge amounts of DNA sequence, genes identification has become an important task in bioinformatics. To detect genes, it is important to accurately predict splice sites, i.e. exon intron boundaries. Moreover, in biology where structures are described by a large number of features as splice sites, the feature selection is an important step toward the classification task. It provides useful biological knowledge and allows for a faster and better classification. Feature selection techniques can be divided into two groups: feature-ranking and feature-subset selection. This paper investigates the performance of combining support vector machine (SVM) with two different feature ranking methods, namely F-score and Random Forest feature ranking competitively in splice site detection of Human genome. Also a new classification method based on Random Forest for splice site prediction is presented.

Metrics

17 Record Views

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Engineering, Biomedical
Materials Science, Biomaterials
Logo image