A dataset of mentorship in bioscience with semantic and demographic estimations

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

View graph of relations

Author(s)

  • Qing Ke
  • Lizhen Liang
  • Ying Ding
  • Stephen V. David
  • Daniel E. Acuna

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number467
Journal / PublicationScientific data
Volume9
Online published2 Aug 2022
Publication statusPublished - 2022

Link(s)

Abstract

Mentorship in science is crucial for topic choice, career decisions, and the success of mentees and mentors. Typically, researchers who study mentorship use article co-authorship and doctoral dissertation datasets. However, available datasets of this type focus on narrow selections of fields and miss out on early career and non-publication-related interactions. Here, we describe Mentorship, a crowdsourced dataset of 743176 mentorship relationships among 738989 scientists primarily in biosciences that avoids these shortcomings. Our dataset enriches the Academic Family Tree project by adding publication data from the Microsoft Academic Graph and “semantic” representations of research using deep learning content analysis. Because gender and race have become critical dimensions when analyzing mentorship and disparities in science, we also provide estimations of these factors. We perform extensive validations of the profile–publication matching, semantic content, and demographic inferences, which mostly cover neuroscience and biomedical sciences. We anticipate this dataset will spur the study of mentorship in science and deepen our understanding of its role in scientists’ career outcomes.

Research Area(s)

Citation Format(s)

A dataset of mentorship in bioscience with semantic and demographic estimations. / Ke, Qing; Liang, Lizhen; Ding, Ying et al.
In: Scientific data, Vol. 9, 467, 2022.

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

Download Statistics

No data available