Logo image
Manifold learning reveals nonlinear structure in metagenomic profiles
Conference proceeding

Manifold learning reveals nonlinear structure in metagenomic profiles

Xingpeng Jiang, Xiaohua Hu, Huiyu Shen and Tingting He
2012 IEEE International Conference on Bioinformatics and Biomedicine
Oct 2012

Abstract

Communities Correlation Covariance matrix Environmental factors Isomap Matrix decomposition metagenomic profile non-negative matrix factorization Nonlinear dimension reduction Oceans Principal component analysis principle component analysis
Using metagenomics to detect the global structure of microbial community remains a significant challenge. The structure of a microbial community and its functions are complicated not only because of the complex interactions among microbes but also their complicate interacting with confounding environmental factors. Recently dimension reduction methods such as Principle component analysis, Non-negative matrix factorization and Canonical correlation analysis have been employed extensively to investigate the complex structure embedded in metagenomic profiles which summarize the abundance of functional or taxonomic categorizations in metagenomic studies. However, metagenomic profiles are not necessary to meet the "Assumption of Linearity" behind these methods. Therefore it is worth to investigate how nonlinear methods can be utilized in metagenomic studies. In this paper, a nonlinear manifold learning method- Isomap is used to visualize and analyze large-scale metagenomic profiles. Isomap was applied on a large-scale Pfam profile which are derived from 45 metagenomes in Global Ocean Sampling expedition. In our result, a novel nonlinear structure of protein families is identified and the relationships among the identified nonlinear components and environmental factors of global ocean are explored. The results indicate the strength of nonlinear methods in learning the complex microbial structure. With the coming of the huge number of new sequenced metagenomes, nonlinear methods like Isomap could be necessary complementary tools to current widely used methods.

Metrics

19 Record Views
6 citations in Scopus

Details

Logo image