Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits

In genome-wide association studies (GWAS), sample size is the most important factor affecting statistical power that is under control of the investigator, posing a major challenge in understanding the genetics underlying difficult-to-measure traits. Combining data sets available from different populations for joint or meta-analysis is a promising alternative to increasing sample sizes available for GWAS. Simulation studies indicate statistical advantages from combining raw data or GWAS summaries in enhancing quantitative trait loci (QTL) detection power. However, the complexity of genetics underlying most quantitative traits, which itself is not fully understood, is difficult to fully capture in simulated data sets. In this study, population-specific and combined-population GWAS as well as a meta-analysis of the population-specific GWAS summaries were carried out with the objective of assessing the advantages and challenges of different data-combining strategies in enhancing detection power of GWAS using milk fatty acid (FA) traits as examples. Gas chromatography (GC) quantified milk FA samples and high-density (HD) genotypes were available from 1,566 Dutch, 614 Danish, and 700 Chinese Holstein Friesian cows. Using the joint GWAS, 28 additional genomic regions were detected, with significant associations to at least 1 FA, compared with the population-specific analyses. Some of these additional regions were also detected using the implemented meta-analysis. Furthermore, using the frequently reported variants of the diacylglycerol acyltransferase 1 (DGAT1) and stearoyl-CoA desaturase (SCD1) genes, we show that significant associations were established with more FA traits in the joint GWAS than the remaining scenarios. However, there were few regions detected in the population-specific analyses that were not detected using the joint GWAS or the meta-analyses. Our results show that combining multi-population data set can be a powerful tool to enhance detection power in GWAS for seldom-recorded traits. Detection of a higher number of regions using the meta-analysis, compared with any of the population-specific analyses also emphasizes the utility of these methods in the absence of raw multi-population data sets to undertake joint GWAS.

Saved in:
Bibliographic Details
Main Authors: Gebreyesus, G., Buitenhuis, A.J., Poulsen, N.A., Visker, M.H.P.W., Zhang, Q., van Valenberg, H.J.F., Sun, D., Bovenhuis, H.
Format: Article/Letter to editor biblioteca
Language:English
Subjects:mega-analysis, meta-analysis, multi-population GWAS,
Online Access:https://research.wur.nl/en/publications/combining-multi-population-datasets-for-joint-genome-wide-associa
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-554610
record_format koha
spelling dig-wur-nl-wurpubs-5546102024-10-02 Gebreyesus, G. Buitenhuis, A.J. Poulsen, N.A. Visker, M.H.P.W. Zhang, Q. van Valenberg, H.J.F. Sun, D. Bovenhuis, H. Article/Letter to editor Journal of Dairy Science 102 (2019) 12 ISSN: 0022-0302 Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits 2019 In genome-wide association studies (GWAS), sample size is the most important factor affecting statistical power that is under control of the investigator, posing a major challenge in understanding the genetics underlying difficult-to-measure traits. Combining data sets available from different populations for joint or meta-analysis is a promising alternative to increasing sample sizes available for GWAS. Simulation studies indicate statistical advantages from combining raw data or GWAS summaries in enhancing quantitative trait loci (QTL) detection power. However, the complexity of genetics underlying most quantitative traits, which itself is not fully understood, is difficult to fully capture in simulated data sets. In this study, population-specific and combined-population GWAS as well as a meta-analysis of the population-specific GWAS summaries were carried out with the objective of assessing the advantages and challenges of different data-combining strategies in enhancing detection power of GWAS using milk fatty acid (FA) traits as examples. Gas chromatography (GC) quantified milk FA samples and high-density (HD) genotypes were available from 1,566 Dutch, 614 Danish, and 700 Chinese Holstein Friesian cows. Using the joint GWAS, 28 additional genomic regions were detected, with significant associations to at least 1 FA, compared with the population-specific analyses. Some of these additional regions were also detected using the implemented meta-analysis. Furthermore, using the frequently reported variants of the diacylglycerol acyltransferase 1 (DGAT1) and stearoyl-CoA desaturase (SCD1) genes, we show that significant associations were established with more FA traits in the joint GWAS than the remaining scenarios. However, there were few regions detected in the population-specific analyses that were not detected using the joint GWAS or the meta-analyses. Our results show that combining multi-population data set can be a powerful tool to enhance detection power in GWAS for seldom-recorded traits. Detection of a higher number of regions using the meta-analysis, compared with any of the population-specific analyses also emphasizes the utility of these methods in the absence of raw multi-population data sets to undertake joint GWAS. en application/pdf https://research.wur.nl/en/publications/combining-multi-population-datasets-for-joint-genome-wide-associa 10.3168/jds.2019-16676 https://edepot.wur.nl/502445 mega-analysis meta-analysis multi-population GWAS https://creativecommons.org/licenses/by-nc-nd/4.0/ Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
language English
topic mega-analysis
meta-analysis
multi-population GWAS
mega-analysis
meta-analysis
multi-population GWAS
spellingShingle mega-analysis
meta-analysis
multi-population GWAS
mega-analysis
meta-analysis
multi-population GWAS
Gebreyesus, G.
Buitenhuis, A.J.
Poulsen, N.A.
Visker, M.H.P.W.
Zhang, Q.
van Valenberg, H.J.F.
Sun, D.
Bovenhuis, H.
Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
description In genome-wide association studies (GWAS), sample size is the most important factor affecting statistical power that is under control of the investigator, posing a major challenge in understanding the genetics underlying difficult-to-measure traits. Combining data sets available from different populations for joint or meta-analysis is a promising alternative to increasing sample sizes available for GWAS. Simulation studies indicate statistical advantages from combining raw data or GWAS summaries in enhancing quantitative trait loci (QTL) detection power. However, the complexity of genetics underlying most quantitative traits, which itself is not fully understood, is difficult to fully capture in simulated data sets. In this study, population-specific and combined-population GWAS as well as a meta-analysis of the population-specific GWAS summaries were carried out with the objective of assessing the advantages and challenges of different data-combining strategies in enhancing detection power of GWAS using milk fatty acid (FA) traits as examples. Gas chromatography (GC) quantified milk FA samples and high-density (HD) genotypes were available from 1,566 Dutch, 614 Danish, and 700 Chinese Holstein Friesian cows. Using the joint GWAS, 28 additional genomic regions were detected, with significant associations to at least 1 FA, compared with the population-specific analyses. Some of these additional regions were also detected using the implemented meta-analysis. Furthermore, using the frequently reported variants of the diacylglycerol acyltransferase 1 (DGAT1) and stearoyl-CoA desaturase (SCD1) genes, we show that significant associations were established with more FA traits in the joint GWAS than the remaining scenarios. However, there were few regions detected in the population-specific analyses that were not detected using the joint GWAS or the meta-analyses. Our results show that combining multi-population data set can be a powerful tool to enhance detection power in GWAS for seldom-recorded traits. Detection of a higher number of regions using the meta-analysis, compared with any of the population-specific analyses also emphasizes the utility of these methods in the absence of raw multi-population data sets to undertake joint GWAS.
format Article/Letter to editor
topic_facet mega-analysis
meta-analysis
multi-population GWAS
author Gebreyesus, G.
Buitenhuis, A.J.
Poulsen, N.A.
Visker, M.H.P.W.
Zhang, Q.
van Valenberg, H.J.F.
Sun, D.
Bovenhuis, H.
author_facet Gebreyesus, G.
Buitenhuis, A.J.
Poulsen, N.A.
Visker, M.H.P.W.
Zhang, Q.
van Valenberg, H.J.F.
Sun, D.
Bovenhuis, H.
author_sort Gebreyesus, G.
title Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
title_short Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
title_full Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
title_fullStr Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
title_full_unstemmed Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits
title_sort combining multi-population datasets for joint genome-wide association and meta-analyses: the case of bovine milk fat composition traits
url https://research.wur.nl/en/publications/combining-multi-population-datasets-for-joint-genome-wide-associa
work_keys_str_mv AT gebreyesusg combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT buitenhuisaj combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT poulsenna combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT viskermhpw combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT zhangq combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT vanvalenberghjf combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT sund combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
AT bovenhuish combiningmultipopulationdatasetsforjointgenomewideassociationandmetaanalysesthecaseofbovinemilkfatcompositiontraits
_version_ 1813197432423448576