Methods for observed-cluster inference when cluster size is informative: a review and clarifications.
dc.contributor.author | Seaman, Shaun | en |
dc.contributor.author | Pavlou, Menelaos | en |
dc.contributor.author | Copas, Andrew J | en |
dc.date.accessioned | 2018-06-14T10:45:11Z | |
dc.date.available | 2018-06-14T10:45:11Z | |
dc.date.issued | 2014-06 | en |
dc.identifier.issn | 0006-341X | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/277039 | |
dc.description.abstract | Clustered data commonly arise in epidemiology. We assume each cluster member has an outcome Y and covariates X. When there are missing data in Y, the distribution of Y given X in all cluster members ("complete clusters") may be different from the distribution just in members with observed Y ("observed clusters"). Often the former is of interest, but when data are missing because in a fundamental sense Y does not exist (e.g., quality of life for a person who has died), the latter may be more meaningful (quality of life conditional on being alive). Weighted and doubly weighted generalized estimating equations and shared random-effects models have been proposed for observed-cluster inference when cluster size is informative, that is, the distribution of Y given X in observed clusters depends on observed cluster size. We show these methods can be seen as actually giving inference for complete clusters and may not also give observed-cluster inference. This is true even if observed clusters are complete in themselves rather than being the observed part of larger complete clusters: here methods may describe imaginary complete clusters rather than the observed clusters. We show under which conditions shared random-effects models proposed for observed-cluster inference do actually describe members with observed Y. A psoriatic arthritis dataset is used to illustrate the danger of misinterpreting estimates from shared random-effects models. | |
dc.description.sponsorship | SRS is funded by MRC grants U1052 60558 and MC_US_A030_0015, AJC and MP by MRC grant G0600657. | |
dc.language | eng | en |
dc.publisher | Wiley | |
dc.rights | Attribution 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Bridge distribution | en |
dc.subject | Immortal cohort inference | en |
dc.subject | Informative missingness | en |
dc.subject | Missing not at random | en |
dc.subject | Mortal cohort inference | en |
dc.subject | Semi-continuous data | en |
dc.subject | Arthritis, Psoriatic | en |
dc.subject | Biometry | en |
dc.subject | Cluster Analysis | en |
dc.subject | Epidemiologic Methods | en |
dc.subject | Female | en |
dc.subject | Humans | en |
dc.subject | Male | en |
dc.subject | Models, Statistical | en |
dc.title | Methods for observed-cluster inference when cluster size is informative: a review and clarifications. | en |
dc.type | Article | |
prism.endingPage | 456 | |
prism.issueIdentifier | 2 | en |
prism.publicationDate | 2014 | en |
prism.publicationName | Biometrics | en |
prism.startingPage | 449 | |
prism.volume | 70 | en |
dc.identifier.doi | 10.17863/CAM.24339 | |
dcterms.dateAccepted | 2014-01-01 | en |
rioxxterms.versionofrecord | 10.1111/biom.12151 | en |
rioxxterms.version | VoR | * |
rioxxterms.licenseref.uri | http://creativecommons.org/licenses/by/4.0/ | en |
rioxxterms.licenseref.startdate | 2014-06 | en |
dc.contributor.orcid | Seaman, Shaun [0000-0003-3726-5937] | |
dc.identifier.eissn | 1541-0420 | |
rioxxterms.type | Journal Article/Review | en |
pubs.funder-project-id | MRC (unknown) | |
cam.issuedOnline | 2014-01-30 | en |
cam.orpheus.success | Thu Jan 30 12:58:17 GMT 2020 - Embargo updated | * |
rioxxterms.freetoread.startdate | 2014-06-30 |