Unweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation study

dc.contributor.authorAvery, Lisa
dc.contributor.authorRotondi, Nooshin
dc.contributor.authorMcKnight, Constance
dc.contributor.authorFirestone, Michelle
dc.contributor.authorSmylie, Janet
dc.contributor.authorRotondi, Michael
dc.date.accessioned2020-03-09T21:03:17Z
dc.date.available2020-03-09T21:03:17Z
dc.date.issued2019-10-29
dc.description.abstractBackground: It is unclear whether weighted or unweighted regression is preferred in the analysis of data derived from respondent driven sampling. Our objective was to evaluate the validity of various regression models, with and without weights and with various controls for clustering in the estimation of the risk of group membership from data collected using respondent-driven sampling (RDS). Methods: Twelve networked populations, with varying levels of homophily and prevalence, based on a known distribution of a continuous predictor were simulated using 1000 RDS samples from each population. Weighted and unweighted binomial and Poisson general linear models, with and without various clustering controls and standard error adjustments were modelled for each sample and evaluated with respect to validity, bias and coverage rate. Population prevalence was also estimated. Results: In the regression analysis, the unweighted log-link (Poisson) models maintained the nominal type-I error rate across all populations. Bias was substantial and type-I error rates unacceptably high for weighted binomial regression. Coverage rates for the estimation of prevalence were highest using RDS-weighted logistic regression, except at low prevalence (10%) where unweighted models are recommended. Conclusions: Caution is warranted when undertaking regression analysis of RDS data. Even when reported degree is accurate, low reported degree can unduly influence regression estimates. Unweighted Poisson regression is therefore recommended.en_US
dc.description.sponsorshipYork University Librariesen_US
dc.identifier.citationBMC Medical Research Methodology 19 (2019): 202.en_US
dc.identifier.urihttps://doi.org/10.1186/s12874-019-0842-5en_US
dc.identifier.urihttps://hdl.handle.net/10315/37084
dc.language.isoenen_US
dc.publisherBiomed Centralen_US
dc.rightsAttribution 2.5 Canada*
dc.rights.articlehttps://bmcmedresmethodol.biomedcentral.com/articles/10.1186/s12874-019-0842-5en_US
dc.rights.journalhttps://bmcmedresmethodol.biomedcentral.com/en_US
dc.rights.publisherhttps://www.biomedcentral.com/en_US
dc.rights.urihttp://creativecommons.org/licenses/by/2.5/ca/*
dc.titleUnweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation studyen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s12874-019-0842-5.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Item-specific license agreed upon to submission
Description: