Abstract
The real purpose of collecting big data is to identify causality in the hope that this will facilitate credible predictivity. But the search for causality can trap one into infinite regress, and thus one takes refuge in seeking associations between variables in data sets. Regrettably, the mere knowledge of associations does not enable predictivity. Associations need to be embedded within the framework of the probability calculus to make coherent predictions. This is so because associations are a feature of probability models, and hence they do not exist outside the framework of a model. Measures of association, like correlation, regression, and mutual information merely refute a preconceived model. Estimated measures of associations do not lead to a probability model; a model is the product of pure thought. This paper discusses these and other fundamentals that are germane to seeking associations in particular, and machine learning in general.
| Original language | English |
|---|---|
| Pages (from-to) | 116-121 |
| Journal | International Journal of Management Science and Engineering Management |
| Volume | 11 |
| Issue number | 2 |
| Online published | 16 Feb 2016 |
| DOIs | |
| Publication status | Published - 2016 |
Research Keywords
- Association
- Bayesian
- Big data
- Correlation
- Dependence
Fingerprint
Dive into the research topics of 'Seeking relationships in big data: A bayesian perspective'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Mathematical Methods in Reliability: Diagnostics, Maintenance, and Survivability
SINGPURWALLA, N. D. (Principal Investigator / Project Coordinator)
1/01/15 → 12/12/18
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver