Private Data Analytics on Biomedical Sensing Data via Distributed Computation

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

37 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Article number7374672
Pages (from-to)431-444
Journal / PublicationIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume13
Issue number3
Publication statusPublished - 1 May 2016
Externally publishedYes

Abstract

Advances in biomedical sensors and mobile communication technologies have fostered the rapid growth of mobile health (mHealth) applications in the past years. Users generate a high volume of biomedical data during health monitoring, which can be used by the mHealth server for training predictive models for disease diagnosis and treatment. However, the biomedical sensing data raise serious privacy concerns because they reveal sensitive information such as health status and lifestyles of the sensed subjects. This paper proposes and experimentally studies a scheme that keeps the training samples private while enabling accurate construction of predictive models. We specifically consider logistic regression models which are widely used for predicting dichotomous outcomes in healthcare, and decompose the logistic regression problem into small subproblems over two types of distributed sensing data, i.e., horizontally partitioned data and vertically partitioned data. The subproblems are solved using individual private data, and thus mHealth users can keep their private data locally and only upload (encrypted) intermediate results to the mHealth server for model training. Experimental results based on real datasets show that our scheme is highly efficient and scalable to a large number of mHealth users.

Research Area(s)

  • logistic regression, mobile health, predictive model training, Private data analytics

Bibliographic Note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].