Constructing and analyzing a synthetic life course cohort based on pooling two data sources: A case study of early adulthood depression symptomatology and late-life cognition
Background: Synthetic cohorts created by combining two cohorts can be useful when no single data set includes both the exposure and outcome data of interest. We estimate the effects of depression in early adulthood on later-life memory outcome using two nationally representative cohorts separately and in a synthetic sample. Methods: We used the National Longitudinal Study of Youth 1979 (NLSY; N=5,747) and the Health and Retirement Study (HRS; N=6,846) and a synthetic cohort combining exposure data from N=5,680 NLSY participants (born 1957-1965) aged 55-63 in 2020 who completed midlife cognitive assessment between 2006-2020 with outcome data from N=9,726 HRS participants born 1957-1964 who co