Skip to main navigation Skip to search Skip to main content

SemaNet: Bridging Words and Numbers for Predicting Missing Environmental Data in Life Cycle Assessment

  • Bin Chen
  • , Hong Chen
  • , Zhishan Quan
  • , Wei He
  • , Visakan Kadirkamanathan
  • , Jose L. Casamayor
  • , Wei W. Xing*
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Life Cycle Assessment (LCA) is one of the most used methodologies for evaluating environmental impact, but its effective application is severely limited by missing data, an issue that existing methods for Life Cycle Inventory (LCI) data completion cannot address effectively. This paper proposes a paradigm shift: rather than depending exclusively on numerical correlations, we leverage the extensive contextual information inherent in process descriptions via pretrained language models, establishing a semantic bridge between qualitative descriptions and quantitative environmental flows. Our semantic-based neural network framework, SemaNet, achieves superior performance in predicting missing LCI values, surpassing existing state-of-the-art methods in various evaluation metrics. The results are significant: while existing approaches fail completely under high data sparsity, our method achieves high accuracy even with 100% missing numerical data while reducing computational requirements by 99% through the use of semantic filtering. This new method for LCI data completion significantly reduces the data collection efforts and time for LCA practitioners, making reliable and faster environmental impact assessment feasible, even when primary data does not exist, thus facilitating reliable sustainability assessment across industrial sectors. © 2025 American Chemical Society.
Original languageEnglish
Pages (from-to)21131-21146
Number of pages16
JournalEnvironmental Science and Technology
Volume59
Issue number39
Online published26 Sept 2025
DOIs
Publication statusPublished - 7 Oct 2025
Externally publishedYes

Funding

This research was supported by UK Engineering and Physical Sciences Research Council KE-IAA 193664.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure
  2. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production
  3. SDG 17 - Partnerships for the Goals
    SDG 17 Partnerships for the Goals

Research Keywords

  • data efficiency
  • life cycle assessment
  • life cycle inventory
  • neural network method
  • semantic learning

Fingerprint

Dive into the research topics of 'SemaNet: Bridging Words and Numbers for Predicting Missing Environmental Data in Life Cycle Assessment'. Together they form a unique fingerprint.

Cite this