Relative efficiency of the Fuzzy p-value approach to hypothesis testing

Anthony Y.C. Kuk, Jinfeng Xu

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

In missing data problems, it is often the case that there is a natural test statistic for testing a statistical hypothesis had all the data been observed. A fuzzy p-value approach to hypothesis testing has recently been proposed which is implemented by imputing the missing values in the "complete data" test statistic by values simulated from the conditional null distribution given the observed data. We argue that imputing data in this way will inevitably lead to loss in power. For the case of scalar parameter, we show that the asymptotic efficiency of the score test based on the imputed " complete data" relative to the score test based on the observed data is given by the ratio of the observed data information to the complete data information. Three examples involving probit regression, normal random effects model, and unidentified paired data are used for illustration. For testing linkage disequilibrium based on pooled genotype data, simulation results show that the imputed Neyman Pearson and Fisher exact tests are less powerful than a Wald-type test based on the observed data maximum likelihood estimator. In conclusion, we caution against the routine use of the fuzzy p-value approach in latent variable or missing data problems and suggest some viable alternatives. © 2009 International Statistical Institute.
Original languageEnglish
Pages (from-to)395-404
JournalInternational Statistical Review
Volume77
Issue number3
DOIs
Publication statusPublished - Dec 2009
Externally publishedYes

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Complete data
  • Data cloning
  • Imputation
  • Latent variables
  • Locally most powerful test
  • Missing data
  • Pitman efficiency
  • Randomized test

Fingerprint

Dive into the research topics of 'Relative efficiency of the Fuzzy p-value approach to hypothesis testing'. Together they form a unique fingerprint.

Cite this