Logo image
AutoMSR: Auto Molecular Structure Representation Learning for Multi-label Metabolic Pathway Prediction
Journal article

AutoMSR: Auto Molecular Structure Representation Learning for Multi-label Metabolic Pathway Prediction

Jiamin Chen, Jianliang Gao, Tengfei Lyu, Babatounde Moctard Oloulade, Xiaohua Hu and Jie Chen
IEEE/ACM transactions on computational biology and bioinformatics, v PP
2022

Abstract

Architecture Computer architecture Drugs Feature extraction Graph Neural Architecture Search Graph Neural Network Graph neural networks Metabolic Pathway Multi-label Prediction Prediction algorithms Task analysis
It is significant to comprehend the relationship between metabolic pathway and molecular pathway for synthesizing new molecules, for instance optimizing drug metabolization. In bioinformatics fields, multi-label prediction of metabolic pathways is a typical manner to understand this relationship. Graph neural networks (GNNs) have become an effective method to extract molecular structure's features for multi-label prediction of metabolic pathways. Though GNNs can effectively capture structural features from molecular structure graphs, building a well-performed GNN model for a given molecular structure data set requires the manual design of the GNN architecture and fine-tuning of the hyperparameters, which are time-consuming and rely on expert experience. To address the above challenge, we design an end-to-end automatic molecular structure representation learning framework named AutoMSR that can design the optimal GNN model based on a given molecular structure data set without manual intervention. We propose a multi-seed age evolution (MSAE) search algorithm to identify the optimal GNN architecture from the GNN architecture subspace. For a given molecular structure data set, AutoMSR first uses MSAE to search the GNN architecture, and then it adopts a tree-structured parzen estimator to obtain the best hyperparameters in the hyperparameters subspace. Finally, AutoMSR automatically constructs the optimal GNN model based on the best GNN architecture and hyperparameters to extract the molecular structure features for multi-label metabolic pathway prediction. We test the performance of AutoMSR on the real data set KEGG. The experiment results show that AutoMSR outperforms baseline methods on different multi-label classification evaluation metrics.

Metrics

13 Record Views
7 citations in Scopus

Details

Logo image