Skip to main navigation Skip to search Skip to main content

SLADE: A Smart Large-Scale Task Decomposer in Crowdsourcing

  • Yongxin Tong*
  • , Lei Chen
  • , Zimu Zhou
  • , H. V. Jagadish
  • , Lidan Shou
  • , Weifeng Lv
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Crowdsourcing has been shown to be effective in a wide range of applications, and is seeing increasing use. A large-scale crowdsourcing task often consists of thousands or millions of atomic tasks, each of which is usually a simple task such as binary choice or simple voting. To distribute a large-scale crowdsourcing task to limited crowd workers, a common practice is to pack a set of atomic tasks into a task bin and send to a crowd worker in a batch. It is challenging to decompose a large-scale crowdsourcing task and execute batches of atomic tasks, which ensures reliable answers at a minimal total cost. Large batches lead to unreliable answers of atomic tasks, while small batches incur unnecessary cost. In this paper, we investigate a general crowdsourcing task decomposition problem, called the Smart Large-scAle task DE composer (SLADE) problem, which aims to decompose a large-scale crowdsourcing task to achieve the desired reliability at a minimal cost. We prove the NP-hardness of the SLADE problem and propose solutions in both homogeneous and heterogeneous scenarios. For the homogeneous SLADE problem, where all the atomic tasks share the same reliability requirement, we propose a greedy heuristic algorithm and an efficient and effective approximation framework using an optimal priority queue (OPQ) structure with provable approximation ratio. For the heterogeneous SLADE problem, where the atomic tasks can have different reliability requirements, we extend the OPQ-based framework leveraging a partition strategy, and also prove its approximation guarantee. Finally, we verify the effectiveness and efficiency of the proposed solutions through extensive experiments on representative crowdsourcing platforms. © 1989-2012 IEEE.
Original languageEnglish
Pages (from-to)1588-1601
JournalIEEE Transactions on Knowledge and Data Engineering
Volume30
Issue number8
DOIs
Publication statusPublished - 1 Aug 2018
Externally publishedYes

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Funding

The authors are grateful to anonymous reviewers for their constructive comments on this work. Yongxin Tong and Weifeng Lv are supported in part by the National Grand Fundamental Research 973 Program of China under Grant 2015CB358700, NSFC Grant No. 61502021 and 61532004, and SKLSDE (BUAA) Open Program SKLSDE-2016ZX-13. Lei Chen is supported in part by the Hong Kong RGC Project 16202215, Science and Technology Planning Project of Guangdong Province, China, No. 2015B010110006, NSFC Grant No. 61729201, 61232018, Microsoft Research Asia Collaborative Grant, Huawei Grant, and NSFC Guang Dong Grant No. U1301253. H. V. Jagadish is supported in part by NSF Grant IIS-1250880. Lidan Shou is supported in part by NSFC Grant No. 61672455

Research Keywords

  • Crowdsourcing
  • task decomposition

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'SLADE: A Smart Large-Scale Task Decomposer in Crowdsourcing'. Together they form a unique fingerprint.

Cite this