Efficient and Fair Data Valuation for Horizontal Federated Learning

Shuyue Wei, Yongxin Tong*, Zimu Zhou, Tianshu Song

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 12 - Chapter in an edited book (Author)peer-review

59 Citations (Scopus)

Abstract

Availability of big data is crucial for modern machine learning applications and services. Federated learning is an emerging paradigm to unite different data owners for machine learning on massive data sets without worrying about data privacy. Yet data owners may still be reluctant to contribute unless their data sets are fairly valuated and paid. In this work, we adapt Shapley value, a widely used data valuation metric to valuating data providers in federated learning. Prior data valuation schemes for machine learning incur high computation cost because they require training of extra models on all data set combinations. For efficient data valuation, we approximately construct all the models necessary for data valuation using the gradients in training a single model, rather than train an exponential number of models from scratch. On this basis, we devise three methods for efficient contribution index estimation. Evaluations show that our methods accurately approximate the contribution index while notably accelerating its calculation. © 2020, Springer Nature Switzerland AG.
Original languageEnglish
Title of host publicationFederated Learning
Subtitle of host publicationPrivacy and Incentive
EditorsQiang Yang, Lixin Fan, Han Yu
Place of PublicationCham
PublisherSpringer 
Pages139-152
ISBN (Electronic)978-3-030-63076-8
ISBN (Print)978-3-030-63075-1
DOIs
Publication statusPublished - 2020
Externally publishedYes

Publication series

NameLecture Notes in Computer Science
Volume12500
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Research Keywords

  • Data valuation
  • Federated learning
  • Incentive mechanism
  • Shapley value

Fingerprint

Dive into the research topics of 'Efficient and Fair Data Valuation for Horizontal Federated Learning'. Together they form a unique fingerprint.

Cite this