New trend on chemical structure representation learning in toxicology: In reviews of machine learning model methodology

Jiabin Zhang, Lei Zhao, Wei Wang*, De-Feng Xing, Zhen-Xing Wang, Jun Ma, Aijie Wang, Nan-Qi Ren, Duu-Jong Lee, Chuan Chen*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

2 Citations (Scopus)

Abstract

Computer-assisted virtual screening using structure-activity relationship (QSAR) models is a surrogate method for reducing the need for costly animal experiments. However, traditional QSAR models face significant challenges, such as the ‘activity cliff’ phenomenon and small datasets, which limit their ability to generalize and predict toxicity. This review examines transistion of digital encodings form in molecules and its corresponding models, introducing from molecule descriptors to three advanced types of molecular representations based on deep learning techniques. We highlight the importance of deep learning models that can not only capture molecular similarity in chemical space to address the ‘activity cliff’ problem but also improve model performance through feature fusion. As alternative solutions to reduce reliance on feature engineering potentially, graph neural network, convolutional neural network and large lanuage model and their related training paradigm such as transfer learning could give another opportunity for toxicity model setting in terms of data insuffient dealing etc. This work could help potential deep learning modelers to build robust model, setting the stage for groundbreaking advancements in further development and application of toxicity prediction models. © 2025 Taylor & Francis Group, LLC.
Original languageEnglish
JournalCritical Reviews in Environmental Science and Technology
DOIs
Publication statusOnline published - 10 Mar 2025

Research Keywords

  • chemical structure
  • Deep representative learning
  • Peng Gao
  • QSAR
  • Risk management
  • toxicity prediction

Fingerprint

Dive into the research topics of 'New trend on chemical structure representation learning in toxicology: In reviews of machine learning model methodology'. Together they form a unique fingerprint.

Cite this