Ecological inference refers to the study of individuals using aggregate data and it is used in an impressive number of studies; it is well known, however, that the study of individuals using group data suffers from an ecological fallacy problem (Robinson in Am Sociol Rev 15:351–357, 1950). This paper evaluates the accuracy of two recent methods, the Rosen et al. (Stat Neerl 55:134–156, 2001) and the Greiner and Quinn (J R Stat Soc Ser A (Statistics in Society) 172:67–81, 2009) and the long-standing Goodman’s (Am Sociol Rev 18:663–664, 1953; Am J Sociol 64:610–625, 1959) method designed to estimate all cells of R × C tables simultaneously by employing exclusively aggregate data. To conduct these tests we leverage on extensive electoral data for which the true quantities of interest are known. In particular, we focus on examining the extent to which the confidence intervals provided by the three methods contain the true values. The paper also provides important guidelines regarding the appropriate contexts for employing these models.

An evaluation of the performance and suitability of R × C methods for ecological inference with known true values / Plescia, Carolina; DE SIO, Lorenzo. - In: QUALITY & QUANTITY. - ISSN 0033-5177. - (2017), pp. 1-15. [10.1007/s11135-017-0481-z]

An evaluation of the performance and suitability of R × C methods for ecological inference with known true values

DE SIO, LORENZO
2017

Abstract

Ecological inference refers to the study of individuals using aggregate data and it is used in an impressive number of studies; it is well known, however, that the study of individuals using group data suffers from an ecological fallacy problem (Robinson in Am Sociol Rev 15:351–357, 1950). This paper evaluates the accuracy of two recent methods, the Rosen et al. (Stat Neerl 55:134–156, 2001) and the Greiner and Quinn (J R Stat Soc Ser A (Statistics in Society) 172:67–81, 2009) and the long-standing Goodman’s (Am Sociol Rev 18:663–664, 1953; Am J Sociol 64:610–625, 1959) method designed to estimate all cells of R × C tables simultaneously by employing exclusively aggregate data. To conduct these tests we leverage on extensive electoral data for which the true quantities of interest are known. In particular, we focus on examining the extent to which the confidence intervals provided by the three methods contain the true values. The paper also provides important guidelines regarding the appropriate contexts for employing these models.
Ecological inference, Split-ticket voting, R × C contingency tables, Aggregate data
An evaluation of the performance and suitability of R × C methods for ecological inference with known true values / Plescia, Carolina; DE SIO, Lorenzo. - In: QUALITY & QUANTITY. - ISSN 0033-5177. - (2017), pp. 1-15. [10.1007/s11135-017-0481-z]
File in questo prodotto:
File Dimensione Formato  
2017 Quality and Quantity.pdf

Open Access

Descrizione: Article published online before print
Tipologia: Versione dell'editore
Licenza: Tutti i diritti riservati
Dimensione 462.32 kB
Formato Adobe PDF
462.32 kB Adobe PDF Visualizza/Apri
manuscript_FINAL_postpub.pdf

Solo gestori archivio

Descrizione: Author's post-print
Tipologia: Documento in Post-print
Licenza: Tutti i diritti riservati
Dimensione 563.01 kB
Formato Adobe PDF
563.01 kB Adobe PDF   Visualizza/Apri
de-sio-oa-quality-quantity.pdf

Solo gestori archivio

Descrizione: Sito editore Accesso aperto
Tipologia: Altro materiale allegato
Licenza: Tutti i diritti riservati
Dimensione 150.93 kB
Formato Adobe PDF
150.93 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11385/172296
Citazioni
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 5
social impact