Mixture of Truthful–Untruthful Responses in Public Surveys
Date
2019ISSN
1751-5823Source
International Statistical ReviewVolume
87Issue
3Pages
557-579Google Scholar check
Metadata
Show full item recordAbstract
When sensitive issues are surveyed, collecting truthful data and obtaining reliable estimates of population parameters is a persistent problem in many fields of applied research mostly in sociological, economic, demographic, ecological and medical studies. In this context, and moving from the so-called negative survey, we consider the problem of estimating the proportion of population units belonging to the categories of a sensitive variable when collected data are affected by measurement errors produced by untruthful responses. An extension of the negative survey approach is proposed herein in order to allow respondents to release a true response. The proposal rests on modelling the released data with a mixture of truthful and untruthful responses that allows researchers to obtain an estimate of the proportions as well as the probability of receiving the true response by implementing the EM-algorithm. We describe the estimation procedure and carry out a simulation study to assess the performance of the EM estimates vis-à-vis certain benchmark values and the estimates obtained under the traditional data-collection approach based on direct questioning that ignores the presence of misreporting due to untruthful responding. Simulation findings provide evidence on the accuracy of the estimates and permit us to appreciate the improvements that our approach can produce in public surveys, particularly in election opinion polls, when the hidden vote problem is present.