Unraveling the Linkages between Molecular Abundance and Stable Carbon Isotope Ratio in Dissolved Organic Matter Using Machine Learning

Yuanbi Yi, Tongcun Liu*, Julian Merder, Chen He, Hongyan Bao, Penghui Li, Siliang Li, Quan Shi, Ding He*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

25 Citations (Scopus)

Abstract

Dissolved organic matter (DOM) is a complex mixture of molecules that constitutes one of the largest reservoirs of organic matter on Earth. While stable carbon isotope values (δ13C) provide valuable insights into DOM transformations from land to ocean, it remains unclear how individual molecules respond to changes in DOM properties such as δ13C. To address this, we employed Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR MS) to characterize the molecular composition of DOM in 510 samples from the China Coastal Environments, with 320 samples having δ13C measurements. Utilizing a machine learning model based on 5199 molecular formulas, we predicted δ13C values with a mean absolute error (MAE) of 0.30‰ on the training data set, surpassing traditional linear regression methods (MAE 0.85‰). Our findings suggest that degradation processes, microbial activities, and primary production regulate DOM from rivers to the ocean continuum. Additionally, the machine learning model accurately predicted δ13C values in samples without known δ13C values and in other published data sets, reflecting the δ13C trend along the land to ocean continuum. This study demonstrates the potential of machine learning to capture the complex relationships between DOM composition and bulk parameters, particularly with larger learning data sets and increasing molecular research in the future. © 2023 American Chemical Society.
Original languageEnglish
Pages (from-to)17900–17909
JournalEnvironmental Science and Technology
Volume57
Issue number46
Online published20 Apr 2023
DOIs
Publication statusPublished - 21 Nov 2023
Externally publishedYes

Funding

This work was supported by the National Science Foundation of China (42188102, 42222061, 42230509), Research Grants Council of Hong Kong (ECS26300822), Guangdong Basic and Applied Basic Research Foundation (2020A1515110194), Science and Technology Program of Guangzhou (202102021142), and funding support from the Center for Ocean Research in Hong Kong and Macau (CORE, a joint research center for ocean research between QNLM and HKUST), and the State Key Laboratory of Heavy Oil Processing, China University of Petroleum, Beijing. We thank three anonymous reviewers whose comments considerably improved the manuscript.

Research Keywords

  • DOM
  • FT-ICR MS
  • machine learning
  • stable carbon isotope
  • the China Coastal Environments

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Unraveling the Linkages between Molecular Abundance and Stable Carbon Isotope Ratio in Dissolved Organic Matter Using Machine Learning'. Together they form a unique fingerprint.

Cite this