The sociologist Paul T. von Hippel has written a great article outlining how to deal with missing values when the Y's are also missing. Typically the gold standard for dealing with missing data has been multiple imputation, but he advocates multiple imputation with deletion (MID): that is, you use all cases for multiple imputation, but after imputing you delete those cases with imputed Y values. Somewhat surprisingly (because of the reduced sample size after excluding those cases with imputed Y values), MID usually leads to smaller standard errors; moreover, since the Y's are excluded from the analysis, MID is robust to problems with the imputation model. Check out von Hippel's informative paper here.
Blog Archive
-
▼
2009
(18)
-
▼
December
(18)
- R in the NYT
- Top Ten Must-Have R Packages for Social Scientists
- Multilevel and Longitudinal Modeling in Stata
- Sociology = Hedge Fund?
- A Quantitative Tour of the Social Sciences
- Creating Summated Scales
- Simpson's Paradox Strikes Again
- The Relative Size of Things
- Multiple Imputation with Deletion
- The Paradox of Choice
- Abandoned Sociology
- LaTeX or MS Word?
- Why You Have No Friends
- The Language of Economists
- A Neat Mathematical Trick
- Do Social Networks Affect Health?
- Economists > Political Scientists > Sociologists?
- An Extraordinarily Useful Command
-
▼
December
(18)