A curated collection of resources on AI-driven polymer materials research, covering design, prediction, characterization, and related technologies and literature across the entire workflow.
Inspired by awesome-python and awesome-python-chemistry.
The retrieval keyword is: (TI=(polymer* OR macromolecul* OR "polymeric material*" OR "high polymer" OR copolymer* OR biopolymer* OR elastomer*) AND PY=(2010-2025)) AND (AB=("machine learning" OR "deep learning" OR "neural network*") AND PY=(2010-2025)).
Access time of Web of Science: July 8, 2025.
-
De Novo Design of Polyimides Leveraging Deep Reinforcement Learning Agent [2025] - Advanced Materials
-
Inverse Design of Block Polymer Materials with Desired Nanoscale Structure and Macroscale Properties [2025] - JACS Au 5, 2810-2824 (2025)
-
AI-guided few-shot inverse design of HDP-mimicking polymers against drug-resistant bacteria [2024]
-
Benchmarking study of deep generative models for inverse polymer design [2025]
-
Glass Transition Temperature Prediction of Polymers via Graph Reinforcement Learning [2024]
-
Predicting glass transition temperatures for structurally diverse polymers [2025]
-
Data-Driven Modeling and Design of Sustainable High Tg Polymers [2025]
-
Interpretable Machine Learning Framework to Predict the Glass Transition Temperature of Polymers [2024]
-
Machine learning discovery of high-temperature polymers [2021]
-
Recent Progress and Future Prospects on All-Organic Polymer Dielectrics for Energy Storage Capacitors [2022] - Review
-
Dielectric Polymer Property Prediction Using Recurrent Neural Networks with Optimizations - RNN, Journal of Chemical Information and Modeling 61, 2175-2186 (2021)
-
AI-assisted discovery of high-temperature dielectrics for energy storage [2024]
-
High-temperature polymer composite capacitors with high energy density designed via machine learning [2025] - Nature Energy (2025), EGNN
-
Machine learning enables interpretable discovery of innovative polymers for gas separation membranes [2022]
-
Machine learning-guided discovery of polymer membranes for CO₂ separation with genetic algorithm [2024]
-
Machine learning for the advancement of membrane science and technology: A critical review [2025]
-
Machine Learning for Polymer Design to Enhance Pervaporation-Based Organic Recovery [2024]
-
Data-driven predictions of complex organic mixture permeation in polymer membranes [2023]
-
De novo design of polymer electrolytes using GPT-based and diffusion-based generative models [2024] - GPT & Diffusion model
-
A materials discovery framework based on conditional generative models applied to the design of polymer electrolytes [2025] - minGPT
-
Directed Message Passing Neural Networks for Accurate Prediction of Polymer-Solvent Interaction Parameters [2025] - DMPNN
-
Polymer Solubility Prediction Using Large Language Models [2025] - LLM, GPT3.5
-
Data-Driven Prediction of Flory−Huggins Parameter for Quantifying Polymer−Solvent Interaction [2025]
-
Predicting homopolymer and copolymer solubility through machine learning [2025]
-
Cloud point prediction model for polyvinyl alcohol production plants considering process dynamics [2024]
-
Integrating theory with machine learning for predicting polymer solution phase behavior [2023]
-
Predicting Multi‐Component Phase Equilibria of Polymers using Approximations to Flory–Huggins Theory [2023]
-
A Machine Learning Study of Polymer-Solvent Interactions [2022]
-
Solvent selection for polymers enabled by generalized chemical fingerprinting and machine learning [2022]
-
A review of polymer dissolution [2003] - Review
-
Characterization of Polymer-Solvent Interactions and Their Temperature Dependence Using Inverse Gas Chromatography [1994] - Dataset
-
Solubility parameters [1975] - Review
-
Application of machine learning in polyimide structure design and property regulation [2025]
-
Vertical Model for Polyimide Design Assisted by Knowledge-fused Large Language Models [2025]
-
Heat-Resistant Polymer Discovery by Utilizing Interpretable Graph Neural Network with Small Data [2024]
-
High-temperature energy storage polyimide dielectric materials: polymer multiple-structure design [2023]
-
Design of polyimides with targeted glass transition temperature using a graph neural network [2023]
- Radical Reactivity Ratio Predictions for Copolymers with an Interpretable Machine Learning Model - ChemArxiv, 2025
- PVDF-based solid polymer electrolytes for lithium-ion batteries: strategies in composites, blends, dielectric engineering, and machine learning approaches [2025] - RSC Advances 15, 20629-20656 (2025)
-
Refractive index prediction models for polymers using machine learning [2020] - Refractive index
-
Machine-Learning-Enhanced Trial-and-Error for Efficient Optimization of Rubber Composites [2025] - Rubber composite optimization
-
Machine Learning Aided Design of Polymer with Targeted Band Gap Based on DFT Computation [2021] - Band gap
-
Accelerated Scheme to Predict Ring-Opening Polymerization Enthalpy: Simulation-Experimental Data Fusion and Multitask Machine Learning [2023] - Ring-opening polymerization enthalpy
-
Unsupervised learning of sequence-specific aggregation behavior for a model copolymer [2021] - Copolymer aggregation behavior
-
Machine learning for analyzing atomic force microscopy (AFM) images generated from polymer blends [2024]
-
Machine learning for analyses and automation of structural characterization of polymer materials [2024]
-
Design of Tough 3D Printable Elastomers with Human-in-the-Loop Reinforcement Learning [2025]
-
De Novo Design of Polymers with Specified Properties Using Reinforcement Learning [2025]
-
Glass Transition Temperature Prediction of Polymers via Graph Reinforcement Learning [2024]
-
QC-Augmented GNNs for Few-Shot Prediction of Amorphous Polymer Properties [2025]
-
A graph representation of molecular ensembles for polymer property prediction
-
Polymer graph neural networks for multitask property learning [2023]
-
Rationalizing Graph Neural Networks with Data Augmentation - Environmental augmentation GNN, ACM Trans. Knowl. Discov. Data 18, Article 86 (2024)
-
Representing Polymers as Periodic Graphs with Learned Descriptors for Accurate Polymer Property Predictions - Represents polymer as a circular graph by linking monomer repeating unit head and tail
-
Graph neural networks for materials science and chemistry [2022] - Review
-
TransPolymer [2023] - RoBERTa-based polymer characterization model
-
PolyBERT [2023] - Multitask property prediction model
-
PolyNC [2024] - T5-based natural and chemical language model
-
Unified Multimodal Multidomain Polymer Representation for Property Prediction - LLM-based multimodal framework
-
Transferring a Molecular Foundation Model for Polymer Property Predictions [2024] - Transformer-based
-
Harnessing large language models for data-scarce learning of polymer properties [2025] - Supervised pretraining with synthetic data
-
Polymer Solubility Prediction Using Large Language Models [2025] - LLM, GPT3.5
-
Property-guided generation of complex polymer topologies using variational autoencoders [2024] - Variational autoencoder
-
De novo design of polymer electrolytes using GPT-based and diffusion-based generative models [2024] - GPT & Diffusion model
-
Graph-to-String Variational Autoencoder for Synthetic Polymer Design [2023] - NeurIPS 2023, VAE
-
High-temperature polymer composite capacitors with high energy density designed via machine learning [2025] - Nature Energy (2025), EGNN
-
PolyConf: Unlocking Polymer Conformation Generation through Hierarchical Generative Models [2025] - Masked generative model, SE(3)/SO(3) diffusion model
-
A materials discovery framework based on conditional generative models applied to the design of polymer electrolytes [2025] - minGPT
-
A Large Encoder-Decoder Polymer-Based Foundation Model [2024] - Encoder-Decoder
-
Polymer Informatics at Scale with Multitask Graph Neural Networks [2023]
-
Polymer graph neural networks for multitask property learning [2023]
-
Multitask Machine Learning to Predict Polymer–Solvent Miscibility Using Flory–Huggins Interaction Parameters [2023] - Extended Flory-Huggins dataset
-
PolyCL: contrastive learning for polymer representation learning via explicit and implicit augmentations [2025] - Contrastive learning
-
Harnessing large language models for data-scarce learning of polymer properties [2025] - Supervised pretraining with synthetic data
-
Machine Learning Models and Dimensionality Reduction for Prediction of Polymer Properties [2024] - Dimensionality reduction
-
Unsupervised learning of sequence-specific aggregation behavior for a model copolymer [2021]
-
Understanding Polymers Through Transfer Learning and Explainable AI [2024]
-
Transferring a Molecular Foundation Model for Polymer Property Predictions [2024] - Transformer-based
-
[SAScore] - Synthesizability scoring tool
-
Functional monomer design for synthetically accessible polymers [2025]
-
Open Macromolecular Genome: Generative Design of Synthetically Accessible Polymers [2023] - Synthetically accessible polymers with DFT-calculated properties
-
SMiPoly: Generation of a Synthesizable Polymer Virtual Library Using Rule-Based Polymerization Reactions [2023] - Rule-based polymerization reaction-driven synthesizable polymer virtual library
-
Introducing PolySea: An LLM-Based Polymer Smart Evolution Agent [2025] - ChemArxiv
-
Automated Retrosynthesis Planning of Macromolecules Using Large Language Models and Knowledge Graphs [2025]
-
Toward Automated Simulation Research Workflow through LLM Prompt Engineering Design [2025]
-
Agentic Mixture-of-Workflows for Multi-Modal Chemical Search [2025]
-
A physics-enforced neural network to predict polymer melt viscosity [2025]
-
Physics-Informed Neural Networks in Polymers: A Review [2025]
-
Polymer Data Challenges in the AI Era: Bridging Gaps for Next-Generation Energy Materials [2025] - Review of energy materials datasets
-
Estimation of the Flory‐Huggins interaction parameter of polymer‐solvent mixtures using machine learning [2022] - Flory-Huggins parameter dataset
-
Multitask Machine Learning to Predict Polymer–Solvent Miscibility Using Flory–Huggins Interaction Parameters [2023] - Extended Flory-Huggins dataset
-
Polyinfo - Japanese database collecting polymer experimental data (names, structures, preparation methods, properties)
-
PI1M (PI1M: A Benchmark Database for Polymer Informatics) [2020] - RNN-derived dataset from PolyInfo
-
Open Macromolecular Genome: Generative Design of Synthetically Accessible Polymers [2023] - Synthetically accessible polymers with DFT-calculated properties
-
POINT2: A Polymer Informatics Training and Testing Database [2025]
-
SMiPoly: Generation of a Synthesizable Polymer Virtual Library Using Rule-Based Polymerization Reactions [2023] - Rule-based polymerization reaction-driven synthesizable polymer virtual library
-
Polyuniverse: generation of a large-scale polymer library using rule-based polymerization reactions for polymer informatics [2024] - Large-scale polymer library (rule-based polymerization-driven)
-
NIMS polymer database PoLyInfo (I): an overarching view of half a million data points [2024] - NIMS polymer database (>500k data points)
-
NIMS polymer database PoLyInfo (II): machine-readable standardization of polymer knowledge expression [2024] - Machine-readable polymer knowledge standardization
-
Characterization of Polymer-Solvent Interactions and Their Temperature Dependence Using Inverse Gas Chromatography [1994] - Polymer-solvent interaction dataset
-
Creation of Polymer Datasets with Targeted Backbones for Screening of High-Performance Membranes for Gas Separation [2024] - Polymer membrane dataset
-
Polymer genome (Machine-learning predictions of polymer properties with Polymer Genome) [2020] - Hierarchical fingerprints
-
[Uni-mol]
-
Mmpolymer: A multimodal multitask pretraining framework for polymer property prediction - Integrates SMILES and 3D geometry
-
Multimodal machine learning with large language embedding model for polymer property prediction [2025] - Integrates PSMILES, 3D geometry (Uni-Mol), and LLM embeddings
-
PolyMetriX: An Ecosystem for Digital Polymer Chemistry [2025] - Hierarchical featurization
-
Extended-Connectivity Fingerprints (ECFPs) - Topological fingerprints
-
Sizing up feature descriptors for macromolecular machine learning with polymeric biomaterials [2023]
-
Featurization strategies for polymer sequence or composition design by machine learning [2022]
-
Mordred (Mordred: a molecular descriptor calculator) [2018] - Open-source tool for calculating constitutional, topological, geometrical, electronic, and physicochemical descriptors
-
Polymer genome [2020] - Hierarchical fingerprints
-
Multimodal Transformer for Property Prediction in Polymers [2024] - Fuses PSMILES and 2D graphs
-
Multimodal machine learning with large language embedding model for polymer property prediction [2025] - Integrates PSMILES, 3D geometry (Uni-Mol), and LLM embeddings
-
Mmpolymer: A multimodal multitask pretraining framework for polymer property prediction - SMILES, 3D geometry
-
Agentic Mixture-of-Workflows for Multi-Modal Chemical Search [2025]
-
Uni-Poly Unified Multimodal Multidomain Polymer Representation for Property Prediction - SMILES, 2D graphs, 3D geometries, Morgan fingerprints, and polymer domain-specific textual descriptions
-
Representing Polymers as Periodic Graphs with Learned Descriptors for Accurate Polymer Property Predictions [2022] - Circular graph representation of polymers
-
Evaluating Polymer Representations via Quantifying Structure–Property Relationships
-
Sizing up feature descriptors for macromolecular machine learning with polymeric biomaterials [2023]
-
Artificial Intelligence and Multiscale Modeling for Sustainable Biopolymers and Bioinspired Materials [2025] - Advanced Matarials
-
Machine learning in polymer science: A new lens for physical and chemical exploration [2026] - Progress in Materials Science 156, 101544 (2026)
-
Recent Advances in Machine Learning-Assisted Design and Development of Polymer Materials [2025] - Chemistry – A European Journal 31, e202500718 (2025)
-
Basic concepts and tools of artificial intelligence in polymer science [2025] - Polymer Chemistry 16, 2457-2470 (2025)
-
Machine-Learning-Assisted Molecular Design of Innovative Polymers [2025] - Accounts of Materials Research
-
Machine learning in constructing structure–property relationships of polymers [2025]
-
Machine Learning Approaches in Polymer Science: Progress and Fundamental for a New Paradigm [2025]
-
A review on machine learning-guided design of energy materials [2024]
-
Application of Digital Methods in Polymer Science and Engineering [2024]
-
Machine learning applied to the design and optimization of polymeric materials: A review [2025]
-
Data-driven algorithms for inverse design of polymers [2021] - Review
-
Graph neural networks for materials science and chemistry [2022] - Review
-
A review of polymer dissolution [2003] - Review
-
Solubility parameters [1975] - Review
-
Recent Progress and Future Prospects on All-Organic Polymer Dielectrics for Energy Storage Capacitors [2022] - Review
-
Machine learning for the advancement of membrane science and technology: A critical review [2025] - Review
-
A systematic review of recent advances in the application of machine learning in membrane-based gas separation technologies [2025] - Review
-
Physics-Informed Neural Networks in Polymers: A Review [2025] - Review
-
Polymer informatics: Current status and critical next steps [2021]
-
Polymer Genome: A Data-Powered Polymer Informatics Platform for Property Predictions [2018]
-
Emerging Trends in Machine Learning: A Polymer Perspective [2023]
-
Machine learning for polymeric materials: an introduction [2022]
-
Deep Learning for Polymer Discovery [2026] - Springer
-
AI Application Potential and Prospects in Materials Science: A Focus on Polymers [2025]
- DeepChem Polymer Science - Introductory tutorials on polymer informatics
Benchmarking studies play a critical role in evaluating the performance of AI models, polymer representation methods, and property prediction frameworks. They provide standardized datasets, evaluation metrics, and comparative analyses to guide the advancement of AI-driven polymer research.
-
Benchmarking Machine Learning Models for Polymer Informatics: An Example of Glass Transition Temperature [2021] - A foundational benchmark that compares 14 machine learning models (including linear regression, random forest, and neural networks) on predicting Tg using a dataset of 1,168 polymers. Establishes metrics like root-mean-square error (RMSE) and coefficient of determination (R²) for model evaluation.
-
Evaluating Polymer Representations via Quantifying Structure–Property Relationships [2020] - Benchmarks 8 polymer representation methods (e.g., ECFP, Mordred descriptors, polymer genome fingerprints) across 5 property prediction tasks (Tg, density, solubility parameter). Provides guidelines for selecting representations based on data scarcity and property type.
-
Polymer Informatics at Scale with Multitask Graph Neural Networks [2023] - Benchmarks multitask GNNs against single-task models on a large-scale dataset (100k+ polymers) for 8 properties (e.g., dielectric constant, gas permeability). Demonstrates the advantage of multitask learning in leveraging cross-property correlations.
-
POINT2: A Polymer Informatics Training and Testing Database [2025] - Introduces a benchmark database with 25,000+ curated polymer entries and 12 key properties. Includes split strategies for training/validation/testing (temporal split, scaffold split) to avoid data leakage and ensure model generalizability.
-
Benchmarking study of deep generative models for inverse polymer design [2025] - Compares 6 generative models (VAEs, GANs, diffusion models, GPT-based models) on 3 inverse design tasks (Tg targeting, dielectric constant optimization, membrane selectivity). Evaluates models using synthetic accessibility, property accuracy, and chemical diversity metrics.
-
A Large Encoder-Decoder Polymer-Based Foundation Model [2024] - Benchmarks a transformer-based encoder-decoder model against 4 state-of-the-art generative models on polymer sequence generation and property matching. Uses BLEU score (for sequence similarity) and property RMSE as evaluation criteria.
-
Sizing up feature descriptors for macromolecular machine learning with polymeric biomaterials [2023] - Benchmarks 12 descriptor sets (including graph-based, fingerprint-based, and domain-specific descriptors) for predicting 7 biomaterial-related properties (e.g., cell adhesion, biodegradation rate). Identifies descriptor sets optimized for biopolymer applications.
-
MacroSimGNN: Efficient and Accurate Calculation of Macromolecule Pairwise Similarity via Graph Neural Network [2024] - Benchmarks GNN-based similarity metrics against traditional methods (Tanimoto coefficient, Earth Mover’s Distance) for polymer clustering and property prediction. Shows improved performance in capturing structural nuances of complex polymers (e.g., copolymers, branched polymers).
Research groups dedicated to AI-driven polymer science serve as hubs for innovation, collaboration, and knowledge dissemination. Below is a curated list of leading groups (international and domestic) with a focus on advancing AI-polymer integration.
-
Juan J. de Pablo Group (New York University, USA)
Group Website | Google Scholar
Focus: Molecular modeling of polymers, AI-driven design of block copolymers, and multiscale simulation of polymer self-assembly. Key contributions include developing machine learning frameworks for predicting polymer nanoscale structure.
-
Zhen-Gang Wang Group (California Institute of Technology, USA)
Group Website | Google Scholar
Focus: Polymer physics, AI-aided prediction of polymer dynamics, and design of high-performance polymer composites. Known for work on coarse-grained modeling and machine learning for polymer rheology.
-
Rui Wang Group (University of California, Berkeley, USA)
Group Website | Google Scholar
Focus: Polymer simulation, AI-driven optimization of polymer membranes, and data-driven discovery of sustainable polymers. Leads research on integrating molecular dynamics with machine learning for property prediction.
-
Michael A. Webb Group (Princeton University, USA)
Group Website | Google Scholar
Focus: Polymer simulation, machine learning for polymer crystallization, and high-throughput screening of polymeric materials. Contributes to open-source tools for polymer informatics.
-
Nicholas E. Jackson Group (University of Illinois Urbana-Champaign, USA)
Group Website | Google Scholar
Focus: AI for materials discovery, inverse design of polymer dielectrics, and machine learning for polymer synthesis planning. Known for work on physics-informed neural networks in polymer science.
-
Rampi Ramprasad Group (Georgia Institute of Technology, USA)
Group Website | Google Scholar
Focus: Polymer informatics, machine learning for polymer property prediction, and design of energy-efficient polymers. Developed the Polymer Genome database and related AI tools.
-
Dagmar R. D'hooge Group (Ghent University, Belgium)
Group Website | Google Scholar
Focus: Polymer simulation and modeling, AI-driven optimization of polymer processing, and design of biodegradable polymers. Collaborates with industry to translate AI research to practical applications.
-
Tengfei Luo Group (University of Notre Dame, USA)
Group Website | Google Scholar
Focus: AI for polymer design, high-throughput experimentation of polymers, and development of sustainable polymer materials. Leads work on human-in-the-loop reinforcement learning for polymer optimization.
-
Jana M. Weber Group (Delft University of Technology, Netherlands)
Group Website | Google Scholar
Focus: AI for bioscience, machine learning for biopolymer design, and polymer sequence-function relationships. Applies large language models to peptide and protein-polymer hybrid design.
-
Grace X. Gu (UCB, USA)
Group Website | Google Scholar
Focus: AI for biomaterials and 3D printing.
-
Ying Li Group (University of Wisconsin-Madison, USA)
Group Website | Google Scholar
Focus: Integrate machine learning & data science with multiscale/multiphysics modeling to computationally design advanced polymeric materials (for advanced manufacturing, extreme environments, sustainable energy); accelerate materials discovery via AI-driven tools.
-
Shiping Zhu Group (The Chinese University of Hong Kong, Shenzhen)
Group Website | Google Scholar
Focus: Polymer reaction engineering and theory, AI-driven optimization of polymerization processes, and design of functional polymers. Develops machine learning models for predicting polymer molecular weight distribution.
-
Hanyu Gao Group (Hong Kong University of Science and Technology)
Group Website | Google Scholar
Focus: Polymerization kinetic modeling, AI-aided prediction of polymer reaction rates, and design of controlled polymerization systems. Collaborates with chemical companies to optimize industrial polymer synthesis.
-
Yin-Ning Zhou Group (Shanghai Jiao Tong University)
Group Website | Google Scholar
Focus: Polymer simulation and modeling, AI-driven design of polymer composites, and high-temperature polymer materials. Contributes to physics-informed machine learning for polymer thermal properties.
-
Jianfeng Li Group (Fudan University)
Group Website | Google Scholar
Focus: AI for polymer science, inverse design of conjugated polymers, and machine learning for polymer photoelectric properties. Develops generative models for organic solar cell polymer donors.
-
Zhao-Yan Sun Group (Chinese Academy of Sciences, Institute of Chemistry)
Focus: AI for polymer

