Journal article
Benchmarking blast accuracy of genus/phyla classification of metagenomic reads
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, pp 10-20
2010
PMID: 19908353
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Metagenomics is the study of environmental samples. Because few tools exist for metagenomic analysis, a natural step has been to utilize the popular homology tool, BLAST, to search for sequence similarity between sample fragments and an administered database. Most biologists use this method today without knowing BLAST's accuracy, especially when a particular taxonomic class is under-represented in the database. The aim of this paper is to benchmark the performance of BLAST for taxonomic classification of metagenomic datasets in a supervised setting; meaning that the database contains microbes of the same class as the 'unknown' query fragments. We examine well- and under-represented genera and phyla in order to study their effect on the accuracy of BLAST. We conclude that on fine-resolution classes, such as genera, the accuracy of BLAST does not degrade very much with under-representation, but in a highly variant class, such as phyla, performance degrades significantly. Our analysis includes five-fold cross validation to substantiate our findings.
Metrics
3 Record Views
Details
- Title
- Benchmarking blast accuracy of genus/phyla classification of metagenomic reads
- Creators
- Steven D Essinger - Electrical & Computer Engineering, Drexel University, 3141 Chestnut Street, Philadelphia, PA 19141, USAGail L Rosen
- Publication Details
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, pp 10-20
- Publisher
- World Scientific Publishing; United States
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Electrical and Computer Engineering
- Web of Science ID
- WOS:000411479100002
- Scopus ID
- 2-s2.0-84873038229
- Other Identifier
- 991014878447604721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Computer Science, Theory & Methods
- Mathematical & Computational Biology