Fast Exploring Literature by Language Machine Learning for Perovskite Solar Cell Materials Design

Lei Zhang, Yiru Huang, Leiming Yan, Jinghao Ge, Xiaokang Ma, Zhike Liu, Jiaxue You*, Alex K. Y. Jen*, Shengzhong Frank Liu*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

1 Citation (Scopus)
39 Downloads (CityUHK Scholars)

Abstract

Making computers automatically extract latent scientific knowledge from literature is highly desired for future materials and chemical research in the artificial intelligence era. Herein, the natural language processing (NLP)-based machine learning technique to build language models and automatically extract hidden information regarding perovskite solar cell (PSC) materials from 29 060 publications is employed. The concept that there are light-absorbing materials, electron-transporting materials, and hole-transporting materials in PSCs is successfully learned by the NLP-based machine learning model without a time-consuming human expert training process. The NLP model highlights a hole-transporting material that receives insufficient attention in the literature, which is then elaborated via density functional theory calculations to provide an atomistic view of the perovskite/hole-transporting layer heterostructures and their optoelectronic properties. Finally, the above results are confirmed by device experiments. The present study demonstrates the viability of NLP as a universal machine learning tool to extract useful information from existing publications. © 2024 The Authors. Advanced Intelligent Systems published by Wiley-VCH GmbH.
Original languageEnglish
Article number2300678
JournalAdvanced Intelligent Systems
Volume6
Issue number6
Online published12 May 2024
DOIs
Publication statusPublished - Jun 2024

Funding

This work was supported by the Jiangsu Provincial Science and Technology Project Basic Research Program (Natural Science Foundation of Jiangsu Province) (No. BK20211283). The authors acknowledge computational support from NSCCSZ Shenzhen, China. X.M. thanks the National Natural Science Foundation of China (No. 52202116). A.K.Y.J. thanks the sponsorship of the Lee Shau-Kee Chair Professor (Materials Science), and support from the APRC grants (9380086, 9610508) of the City University of Hong Kong, the TCFS grant (GHP/018/20SZ) and MRP grant (MRP/040/21X) from the Innovation and Technology Commission of Hong Kong, the Green Tech Fund (202020164) from the Environment and Ecology Bureau of Hong Kong, the GRF grants (11307621, 11316422) from the Research Grants Council of Hong Kong, Shenzhen Science and Technology Program (SGDX20201103095412040), and Guangdong Major Project of Basic and Applied Basic Research (2019B030302007).

Research Keywords

  • data-driven approach
  • machine learning
  • natural language processing
  • perovskite solar cells

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Fast Exploring Literature by Language Machine Learning for Perovskite Solar Cell Materials Design'. Together they form a unique fingerprint.

Cite this