Logo image
Machine Learning Toolkits and Frameworks for Materials Design
Journal article - Review   Open access   Peer reviewed

Machine Learning Toolkits and Frameworks for Materials Design

B. Moses Abraham and Yury Gogotsi
Wiley interdisciplinary reviews Computational molecular science, v 16(2), e70067
Mar 2026
Featured in Collection :   Research Supported by Drexel Libraries' OA Programs
url
https://doi.org/10.1002/wcms.70067View
Published, Version of Record (VoR) Open Access via Drexel Libraries Read and Publish Program 2026 Open CC BY V4.0

Abstract

automation frameworks descriptor generation machine learning toolkits materials design
The rapid evolution of machine learning (ML) has advanced materials discovery, providing tools to explore, predict, and design materials with tailored properties. Here we present an overview of emerging ML tools for data-driven materials innovation, including data curation, feature engineering, model development, interpretability, and inverse design. We highlight high-throughput material databases in providing large-scale, DFT-computed datasets, and discuss the importance of descriptor libraries that encode compositional and structural information into machine-readable inputs for model development. Advances in ML architectures, ranging from classical algorithms to graph neural networks, are discussed for their ability to capture complex structure–property relationships. Particular emphasis is given to inverse design frameworks using generative models and optimization strategies to enable property-targeted materials generation. We further explore interpretability and uncertainty quantification techniques that are important for bridging ML predictions with experimental validation. Automation platforms are described as tools for closed-loop, high-throughput discovery pipelines. We outline grand challenges, including data sparsity, model generalizability, and experimental integration. Finally, we summarize future directions that include foundation models pre-trained on broad, multimodal materials data; self-supervised learning strategies to reduce dependence on labeled datasets; ML workflows that embed thermodynamic and symmetry constraints to enhance interpretability; and fully autonomous laboratories that couple ML guidance with robotic synthesis and real-time feedback.

Metrics

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#7 Affordable and Clean Energy

Source: SDGs in the Output

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Chemistry, Multidisciplinary
Mathematical & Computational Biology
Logo image