Protocol for the automatic extraction of epidemiological information via a pre-trained language model

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

View graph of relations

Author(s)

  • Zhizheng Wang
  • Zhanwei Du
  • Lin Wang
  • Ye Wu
  • Petter Holme
  • Michael Lachmann
  • Hongfei Lin
  • Zhuoyue Wang
  • Yu Cao
  • Zoie S.Y. Wong
  • Xiao-Ke Xu
  • Yuanyuan Sun

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number102392
Journal / PublicationSTAR Protocols
Volume4
Issue number3
Online published1 Jul 2023
Publication statusPublished - 15 Sept 2023

Link(s)

Abstract

The lack of systems to automatically extract epidemiological fields from open-access COVID-19 cases restricts the timeliness of formulating prevention measures. Here we present a protocol for using CCIE, a COVID-19 Cases Information Extraction system based on the pre-trained language model.1 We describe steps for preparing supervised training data and executing python scripts for named entity recognition and text category classification. We then detail the use of machine evaluation and manual validation to illustrate the effectiveness of CCIE. For complete details on the use and execution of this protocol, please refer to Wang et al.2 © 2023 The Authors.

Research Area(s)

  • Clinical Protocol, Computer Sciences, Health Sciences

Citation Format(s)

Protocol for the automatic extraction of epidemiological information via a pre-trained language model. / Wang, Zhizheng; Liu, Xiao Fan; Du, Zhanwei et al.
In: STAR Protocols, Vol. 4, No. 3, 102392, 15.09.2023.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Download Statistics

No data available