Journal article
LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction
Scientific reports, v 12(1), pp 6832-6832
27 Apr 2022
PMID: 35477726
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Proteins perform many essential functions in biological systems and can be successfully developed as bio-therapeutics. It is invaluable to be able to predict their properties based on a proposed sequence and structure. In this study, we developed a novel generalizable deep learning framework, LM-GVP, composed of a protein Language Model (LM) and Graph Neural Network (GNN) to leverage information from both 1D amino acid sequences and 3D structures of proteins. Our approach outperformed the state-of-the-art protein LMs on a variety of property prediction tasks including fluorescence, protease stability, and protein functions from Gene Ontology (GO). We also illustrated insights into how a GNN prediction head can inform the fine-tuning of protein LMs to better leverage structural information. We envision that our deep learning framework will be generalizable to many protein property prediction problems to greatly accelerate protein engineering and drug development.
Metrics
Details
- Title
- LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction
- Creators
- Zichen Wang - Amazon (United States)Steven A Combs - Johnson & Johnson (United States)Ryan Brand - Amazon (United States)Miguel Romero Calvo - Amazon (United States)Panpan Xu - Amazon (United States)George Price - Amazon (United States)Nataliya Golovach - Johnson & Johnson (United States)Emmanuel O Salawu - Amazon (United States)Colby J Wise - Amazon (United States)Sri Priya Ponnapalli - Amazon (United States)Peter M Clark - Johnson & Johnson (United States)
- Publication Details
- Scientific reports, v 12(1), pp 6832-6832
- Publisher
- Springer Nature
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- School of Biomedical Engineering, Science, and Health Systems; Drexel University
- Web of Science ID
- WOS:000788639400056
- Scopus ID
- 2-s2.0-85128972187
- Other Identifier
- 991019356496104721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Industry collaboration
- Domestic collaboration
- Web of Science research areas
- Biochemistry & Molecular Biology