Estimation of recombination rate and maternal linkage disequilibrium in half-sibs

A livestock population can be characterized by different population genetic parameters, such as linkage disequilibrium and recombination rate between pairs of genetic markers. The population structure, which may be caused by family stratification, has an influence on the estimates of these parameters. An expectation maximization algorithm has been proposed for estimating these parameters in half-sibs without phasing the progeny. It, however, overlooks the fact that the underlying likelihood function may have two maxima. The magnitudes of the maxima depend on the maternal allele frequencies at the investigated marker pair. Which maximum the algorithm converges to depends on the chosen start values. We present a stepwise procedure in which the relationship between the two modes is exploited. The expectation maximization algorithm for the parameter estimation is applied twice using different start values, followed by a decision process to assess the most likely estimate. This approach was validated using simulated genotypes of half-sibs. It was also applied to a dairy cattle dataset consisting of multiple half-sib families and 39,780 marker genotypes, leading to estimates for 12,759,713 intrachromosomal marker pairs. Furthermore, the proper order of markers was verified by studying the mean of estimated recombination rates in a window adjacent to the investigated locus as well as in a window at its most distant chromosome end. Putatively misplaced markers or marker clusters were detected by comparing the results with the revised bovine genome assembly UMD 3.1.1. In total, 40 markers were identified as candidates of misplacement. This outcome may help improving the physical order of markers which is also required for refining the bovine genetic map.

Saved in:
Bibliographic Details
Main Authors: Hampel, A., Teuscher, F., Gómez Raya, Luis, Doschoris, M., Wittenburg, D.
Format: artículo biblioteca
Language:English
Published: Frontiers Media 2018
Subjects:Allele frequency, Expectation maximization, Algorithm, Genome assembly, Likelihood function, Linkage analysis,
Online Access:http://hdl.handle.net/20.500.12792/567
http://hdl.handle.net/10261/291211
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-inia-es-10261-291211
record_format koha
spelling dig-inia-es-10261-2912112023-02-20T07:15:44Z Estimation of recombination rate and maternal linkage disequilibrium in half-sibs Hampel, A. Teuscher, F. Gómez Raya, Luis Doschoris, M. Wittenburg, D. Allele frequency Expectation maximization Algorithm Genome assembly Likelihood function Linkage analysis A livestock population can be characterized by different population genetic parameters, such as linkage disequilibrium and recombination rate between pairs of genetic markers. The population structure, which may be caused by family stratification, has an influence on the estimates of these parameters. An expectation maximization algorithm has been proposed for estimating these parameters in half-sibs without phasing the progeny. It, however, overlooks the fact that the underlying likelihood function may have two maxima. The magnitudes of the maxima depend on the maternal allele frequencies at the investigated marker pair. Which maximum the algorithm converges to depends on the chosen start values. We present a stepwise procedure in which the relationship between the two modes is exploited. The expectation maximization algorithm for the parameter estimation is applied twice using different start values, followed by a decision process to assess the most likely estimate. This approach was validated using simulated genotypes of half-sibs. It was also applied to a dairy cattle dataset consisting of multiple half-sib families and 39,780 marker genotypes, leading to estimates for 12,759,713 intrachromosomal marker pairs. Furthermore, the proper order of markers was verified by studying the mean of estimated recombination rates in a window adjacent to the investigated locus as well as in a window at its most distant chromosome end. Putatively misplaced markers or marker clusters were detected by comparing the results with the revised bovine genome assembly UMD 3.1.1. In total, 40 markers were identified as candidates of misplacement. This outcome may help improving the physical order of markers which is also required for refining the bovine genetic map. 2023-02-20T07:15:44Z 2023-02-20T07:15:44Z 2018 artículo Frontiers in Genetics 9: e186 (2018) http://hdl.handle.net/20.500.12792/567 http://hdl.handle.net/10261/291211 10.3389/fgene.2018.00186 1664-8021 en open Frontiers Media
institution INIA ES
collection DSpace
country España
countrycode ES
component Bibliográfico
access En linea
databasecode dig-inia-es
tag biblioteca
region Europa del Sur
libraryname Biblioteca del INIA España
language English
topic Allele frequency
Expectation maximization
Algorithm
Genome assembly
Likelihood function
Linkage analysis
Allele frequency
Expectation maximization
Algorithm
Genome assembly
Likelihood function
Linkage analysis
spellingShingle Allele frequency
Expectation maximization
Algorithm
Genome assembly
Likelihood function
Linkage analysis
Allele frequency
Expectation maximization
Algorithm
Genome assembly
Likelihood function
Linkage analysis
Hampel, A.
Teuscher, F.
Gómez Raya, Luis
Doschoris, M.
Wittenburg, D.
Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
description A livestock population can be characterized by different population genetic parameters, such as linkage disequilibrium and recombination rate between pairs of genetic markers. The population structure, which may be caused by family stratification, has an influence on the estimates of these parameters. An expectation maximization algorithm has been proposed for estimating these parameters in half-sibs without phasing the progeny. It, however, overlooks the fact that the underlying likelihood function may have two maxima. The magnitudes of the maxima depend on the maternal allele frequencies at the investigated marker pair. Which maximum the algorithm converges to depends on the chosen start values. We present a stepwise procedure in which the relationship between the two modes is exploited. The expectation maximization algorithm for the parameter estimation is applied twice using different start values, followed by a decision process to assess the most likely estimate. This approach was validated using simulated genotypes of half-sibs. It was also applied to a dairy cattle dataset consisting of multiple half-sib families and 39,780 marker genotypes, leading to estimates for 12,759,713 intrachromosomal marker pairs. Furthermore, the proper order of markers was verified by studying the mean of estimated recombination rates in a window adjacent to the investigated locus as well as in a window at its most distant chromosome end. Putatively misplaced markers or marker clusters were detected by comparing the results with the revised bovine genome assembly UMD 3.1.1. In total, 40 markers were identified as candidates of misplacement. This outcome may help improving the physical order of markers which is also required for refining the bovine genetic map.
format artículo
topic_facet Allele frequency
Expectation maximization
Algorithm
Genome assembly
Likelihood function
Linkage analysis
author Hampel, A.
Teuscher, F.
Gómez Raya, Luis
Doschoris, M.
Wittenburg, D.
author_facet Hampel, A.
Teuscher, F.
Gómez Raya, Luis
Doschoris, M.
Wittenburg, D.
author_sort Hampel, A.
title Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
title_short Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
title_full Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
title_fullStr Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
title_full_unstemmed Estimation of recombination rate and maternal linkage disequilibrium in half-sibs
title_sort estimation of recombination rate and maternal linkage disequilibrium in half-sibs
publisher Frontiers Media
publishDate 2018
url http://hdl.handle.net/20.500.12792/567
http://hdl.handle.net/10261/291211
work_keys_str_mv AT hampela estimationofrecombinationrateandmaternallinkagedisequilibriuminhalfsibs
AT teuscherf estimationofrecombinationrateandmaternallinkagedisequilibriuminhalfsibs
AT gomezrayaluis estimationofrecombinationrateandmaternallinkagedisequilibriuminhalfsibs
AT doschorism estimationofrecombinationrateandmaternallinkagedisequilibriuminhalfsibs
AT wittenburgd estimationofrecombinationrateandmaternallinkagedisequilibriuminhalfsibs
_version_ 1767603186597625856