作者
Huwenbo Shi, Bogdan Pasaniuc, Kenneth L Lange
发表日期
2015/11/1
期刊
Bioinformatics
卷号
31
期号
21
页码范围
3514-3521
出版商
Oxford University Press
简介
Motivation: Haplotype models enjoy a wide range of applications in population inference and disease gene discovery. The hidden Markov models traditionally used for haplotypes are hindered by the dubious assumption that dependencies occur only between consecutive pairs of variants. In this article, we apply the multivariate Bernoulli (MVB) distribution to model haplotype data. The MVB distribution relies on interactions among all sets of variants, thus allowing for the detection and exploitation of long-range and higher-order interactions. We discuss penalized estimation and present an efficient algorithm for fitting sparse versions of the MVB distribution to haplotype data. Finally, we showcase the benefits of the MVB model in predicting DNaseI hypersensitivity (DH) status—an epigenetic mark describing chromatin accessibility—from population-scale haplotype data.
Results: We fit the MVB …
引用总数
2018201920202021111