Here we provide a detailed description of the genome-wide information available on the National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pair subsample (Harris et al. in Twin Res Hum Genet 16:391–398, 2013). A total of 2,020 samples were genotyped (including duplicates) arising from 1946 Add Health individuals from the sibling pairs subsample. After various steps for quality control (QC) and quality assurance (QA), we have high quality genome-wide data available on 1,888 individuals. In this report, we first highlight the QC and QA steps that were taken to prune the data of poorly performing samples and genetic markers. We further estimate the pairwise biological relationships using genome-wide data and compare those estimates to the assumed relationships in Add Health. Additionally, using genome-wide data from known regional reference populations from Europe, West Africa, North and South America, Japan and China, we estimate the relative genetic ancestry of the respondents. Finally, rather than conducting a traditional cross-sectional genome-wide association study (GWAS) of body mass index (BMI), we opted to utilize the extensive publicly available genome-wide information to conduct a weighted GWAS of longitudinal BMI while accounting for both family and ethnic variation.
The National Longitudinal Study of Adolescent Health (Add Health) Sibling Pairs Genome-Wide Data
Year of Publication:2014
(2014). The National Longitudinal Study of Adolescent Health (Add Health) Sibling Pairs Genome-Wide Data. Behavior Genetics, 45(1), 12-23.