TY - JOUR
T1 - Likelihood identifiability and parameter estimation with nonignorable missing data
AU - ZHENG, Siming
AU - ZHANG, Juan
AU - ZHOU, Yong
PY - 2022/5/27
Y1 - 2022/5/27
N2 - We identify sufficient conditions to resolve the identification problem under nonignorable missingness, especially the identifiability of the observed likelihood when some of the covariate values are missing not at random, or, simultaneously, the response is also missing not at random. It is more difficult to tackle these cases than the nonignorable nonresponse case, and, to the best of our knowledge, the simultaneously missing case has never been discussed before. Under these conditions, we propose some parameter estimation methods. As an illustration, when some of the covariate values are missing not at random, we adopt a semiparametric logistic model with a tilting parameter to model the missingness mechanism and use an imputed estimating equation based on the generalized method of moments to estimate the parameters of interest and the tilting parameter simultaneously. This approach avoids the requirement for other independent surveys or a validation sample to estimate the unknown tilting parameter. The asymptotic properties of our proposed estimators are derived, and the proofs can be modified to show that our methods of estimation, which are based on inverse probability weighting, augmented inverse probability weighting, and estimating equation projection, have the same asymptotic efficiency when the tilting parameter is either known or unknown but estimated by some other method. In simulation studies, we compare our methods with various alternative approaches and find that our methods are more robust and effective.
AB - We identify sufficient conditions to resolve the identification problem under nonignorable missingness, especially the identifiability of the observed likelihood when some of the covariate values are missing not at random, or, simultaneously, the response is also missing not at random. It is more difficult to tackle these cases than the nonignorable nonresponse case, and, to the best of our knowledge, the simultaneously missing case has never been discussed before. Under these conditions, we propose some parameter estimation methods. As an illustration, when some of the covariate values are missing not at random, we adopt a semiparametric logistic model with a tilting parameter to model the missingness mechanism and use an imputed estimating equation based on the generalized method of moments to estimate the parameters of interest and the tilting parameter simultaneously. This approach avoids the requirement for other independent surveys or a validation sample to estimate the unknown tilting parameter. The asymptotic properties of our proposed estimators are derived, and the proofs can be modified to show that our methods of estimation, which are based on inverse probability weighting, augmented inverse probability weighting, and estimating equation projection, have the same asymptotic efficiency when the tilting parameter is either known or unknown but estimated by some other method. In simulation studies, we compare our methods with various alternative approaches and find that our methods are more robust and effective.
KW - Equivalent asymptotic efficiency
KW - exponentially tilting
KW - generalized method of moments
KW - identifiability
KW - nonignorable missingness
KW - ESTIMATING EQUATIONS INFERENCE
KW - GENERALIZED-METHOD
KW - MODELS
KW - REGRESSION
KW - MOMENTS
UR - http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000800633000001
U2 - 10.1002/cjs.11704
DO - 10.1002/cjs.11704
M3 - RGC 21 - Publication in refereed journal
SN - 0319-5724
JO - Canadian Journal of Statistics
JF - Canadian Journal of Statistics
ER -