Skip to main navigation Skip to search Skip to main content

An end-to-end entity recognition and disambiguation framework for identifying Author Affiliation from literature publications

  • Lianghong Lin
  • , Wenxiu Xie
  • , Zili Chen
  • , Tianyong Hao

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

19 Downloads (CityUHK Scholars)

Abstract

Author affiliation information plays a key role in bibliometric analyses and is essential for evaluating studies. However, as author affiliation information has not been standardized, which leads to difficulties such as synonym ambiguity and incomplete data during automated processing. To address the challenge, this paper proposes an end-to-end entity recognition and disambiguation framework for identifying author affiliation from literature publications. For entity disambiguation, an algorithm combining word embedding and spatial embedding is presented considering that author affiliation texts often contain rich geographic information. The disambiguation algorithm utilizes the semantic information and geographic information, which effectively enhances entity recognition and disambiguation effect. In addition, the proposed framework facilitates the effective utilization of the extensive literature in the PubMed database for comprehensive bibliometric analysis. The experimental results verify the robustness and effectiveness of the algorithm. © 2024 Association for Computational Linguistics.
Original languageEnglish
Title of host publicationProceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024)
EditorsTirthankar Ghosal, Amanpreet Singh, Anita Waard, Philipp Mayr, Aakanksha Naik, Orion Weller, Yoonjoo Lee, Shannon Shen, Yanxia Qin
PublisherAssociation for Computational Linguistics
Pages120-129
ISBN (Print)979-8-89176-151-3
DOIs
Publication statusPublished - Aug 2024
EventFourth Workshop on Scholarly Document Processing at ACL 2024 - Bangkok, Thailand
Duration: 16 Aug 202416 Aug 2024
https://aclanthology.org/2024.sdp-1
https://sdproc.org/2024/index.html

Publication series

NameSDP - Workshop on Scholarly Document Processing, Proceedings of the Workshop

Conference

ConferenceFourth Workshop on Scholarly Document Processing at ACL 2024
PlaceThailand
CityBangkok
Period16/08/2416/08/24
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'An end-to-end entity recognition and disambiguation framework for identifying Author Affiliation from literature publications'. Together they form a unique fingerprint.

Cite this