作者
Jiarui Li, Tomás González Zarzar, Julie D White, Karlijne Indencleef, Hanne Hoskens, Harry Matthews, Nele Nauwelaers, Arslan Zaidi, Ryan J Eller, Noah Herrick, Torsten Günther, Emma M Svensson, Mattias Jakobsson, Susan Walsh, Kristel Van Steen, Mark D Shriver, Peter Claes
发表日期
2020/7/16
期刊
Scientific reports
卷号
10
期号
1
页码范围
11850
出版商
Nature Publishing Group UK
简介
Estimates of individual-level genomic ancestry are routinely used in human genetics, and related fields. The analysis of population structure and genomic ancestry can yield insights in terms of modern and ancient populations, allowing us to address questions regarding admixture, and the numbers and identities of the parental source populations. Unrecognized population structure is also an important confounder to correct for in genome-wide association studies. However, it remains challenging to work with heterogeneous datasets from multiple studies collected by different laboratories with diverse genotyping and imputation protocols. This work presents a new approach and an accompanying open-source toolbox that facilitates a robust integrative analysis for population structure and genomic ancestry estimates for heterogeneous datasets. We show robustness against individual outliers and different protocols …
引用总数