March 28, 2013

Refined IBD in Beagle 4

The Beagle page doesn't show version 4 yet, but I'm sure it will eventually turn up there since this paper has just been published.

Genetics doi: 10.1534/genetics.113.150029

Improving the Accuracy and Efficiency of Identity by Descent Detection in Population Data

Brian L. Browning and Sharon R. Browning

Segments of identity by descent (IBD) detected from high-density genetic data are useful for many applications, including long-range phase determination, phasing family data, imputation, IBD mapping and heritability analysis in founder populations. We present Refined IBD, a new method for IBD segment detection. Refined IBD achieves both computational efficiency and highly accurate IBD segment reporting by searching for IBD in two steps. The first step (identification) uses the GERMLINE algorithm to find shared haplotypes exceeding a length threshold. The second step (refinement), evaluates candidate segments with a probabilistic approach to assess the evidence for IBD. Like GERMLINE, Refined IBD allows for IBD reporting on a haplotype level, which facilitates determination of multi-individual IBD and allows for haplotype-based downstream analyses. To investigate the properties of Refined IBD, we simulate SNP data from a model with recent super-exponential population growth that is designed to match UK data. The simulation results show that Refined IBD achieves a better power/accuracy profile than fastIBD or GERMLINE. We find that a single run of Refined IBD achieves greater power than 10 runs of fastIBD. We also apply Refined IBD to SNP data for samples from the UK and from Northern Finland, and describe the IBD sharing in these data sets. Refined IBD is powerful, highly accurate, easy to use, and is implemented in Beagle version 4.

Link

No comments: