Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018

This study presents a method for genomic prediction that uses individual-level data and summary statistics from multiple populations. Genome-wide markers are nowadays widely used to predict complex traits, and genomic prediction using multi-population data are an appealing approach to achieve higher prediction accuracies. However, sharing of individual-level data across populations is not always possible. We present a method that enables integration of summary statistics from separate analyses with the available individual-level data. The data can either consist of individuals with single or multiple (weighted) phenotype records per individual. We developed a method based on a hypothetical joint analysis model and absorption of population-specific information. We show that population-specific information is fully captured by estimated allele substitution effects and the accuracy of those estimates, i.e., the summary statistics. The method gives identical result as the joint analysis of all individual-level data when complete summary statistics are available. We provide a series of easy-to-use approximations that can be used when complete summary statistics are not available or impractical to share. Simulations show that approximations enable integration of different sources of information across a wide range of settings, yielding accurate predictions. The method can be readily extended to multiple-traits. In summary, the developed method enables integration of genome-wide data in the individual-level or summary statistics from multiple populations to obtain more accurate estimates of allele substitution effects and genomic predictions.

Saved in:
Bibliographic Details
Main Authors: Vandenplas, Jeremie, Calus, Mario, Gorjanc, Gregor
Format: Dataset biblioteca
Published: Wageningen University & Research
Subjects:genomic prediction, meta-analysis, multi-population, quantitative trait, statistical method, summary statistics,
Online Access:https://research.wur.nl/en/datasets/supplemental-material-for-vandenplas-calus-and-gorjanc-2018
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-561553
record_format koha
spelling dig-wur-nl-wurpubs-5615532024-12-23 Vandenplas, Jeremie Calus, Mario Gorjanc, Gregor Dataset Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018 2018 This study presents a method for genomic prediction that uses individual-level data and summary statistics from multiple populations. Genome-wide markers are nowadays widely used to predict complex traits, and genomic prediction using multi-population data are an appealing approach to achieve higher prediction accuracies. However, sharing of individual-level data across populations is not always possible. We present a method that enables integration of summary statistics from separate analyses with the available individual-level data. The data can either consist of individuals with single or multiple (weighted) phenotype records per individual. We developed a method based on a hypothetical joint analysis model and absorption of population-specific information. We show that population-specific information is fully captured by estimated allele substitution effects and the accuracy of those estimates, i.e., the summary statistics. The method gives identical result as the joint analysis of all individual-level data when complete summary statistics are available. We provide a series of easy-to-use approximations that can be used when complete summary statistics are not available or impractical to share. Simulations show that approximations enable integration of different sources of information across a wide range of settings, yielding accurate predictions. The method can be readily extended to multiple-traits. In summary, the developed method enables integration of genome-wide data in the individual-level or summary statistics from multiple populations to obtain more accurate estimates of allele substitution effects and genomic predictions. Wageningen University & Research text/html https://research.wur.nl/en/datasets/supplemental-material-for-vandenplas-calus-and-gorjanc-2018 10.25386/genetics.6216533 https://edepot.wur.nl/515410 genomic prediction meta-analysis multi-population quantitative trait statistical method summary statistics Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
topic genomic prediction
meta-analysis
multi-population
quantitative trait
statistical method
summary statistics
genomic prediction
meta-analysis
multi-population
quantitative trait
statistical method
summary statistics
spellingShingle genomic prediction
meta-analysis
multi-population
quantitative trait
statistical method
summary statistics
genomic prediction
meta-analysis
multi-population
quantitative trait
statistical method
summary statistics
Vandenplas, Jeremie
Calus, Mario
Gorjanc, Gregor
Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
description This study presents a method for genomic prediction that uses individual-level data and summary statistics from multiple populations. Genome-wide markers are nowadays widely used to predict complex traits, and genomic prediction using multi-population data are an appealing approach to achieve higher prediction accuracies. However, sharing of individual-level data across populations is not always possible. We present a method that enables integration of summary statistics from separate analyses with the available individual-level data. The data can either consist of individuals with single or multiple (weighted) phenotype records per individual. We developed a method based on a hypothetical joint analysis model and absorption of population-specific information. We show that population-specific information is fully captured by estimated allele substitution effects and the accuracy of those estimates, i.e., the summary statistics. The method gives identical result as the joint analysis of all individual-level data when complete summary statistics are available. We provide a series of easy-to-use approximations that can be used when complete summary statistics are not available or impractical to share. Simulations show that approximations enable integration of different sources of information across a wide range of settings, yielding accurate predictions. The method can be readily extended to multiple-traits. In summary, the developed method enables integration of genome-wide data in the individual-level or summary statistics from multiple populations to obtain more accurate estimates of allele substitution effects and genomic predictions.
format Dataset
topic_facet genomic prediction
meta-analysis
multi-population
quantitative trait
statistical method
summary statistics
author Vandenplas, Jeremie
Calus, Mario
Gorjanc, Gregor
author_facet Vandenplas, Jeremie
Calus, Mario
Gorjanc, Gregor
author_sort Vandenplas, Jeremie
title Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
title_short Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
title_full Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
title_fullStr Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
title_full_unstemmed Supplemental Material for Vandenplas, Calus, and Gorjanc, 2018
title_sort supplemental material for vandenplas, calus, and gorjanc, 2018
publisher Wageningen University & Research
url https://research.wur.nl/en/datasets/supplemental-material-for-vandenplas-calus-and-gorjanc-2018
work_keys_str_mv AT vandenplasjeremie supplementalmaterialforvandenplascalusandgorjanc2018
AT calusmario supplementalmaterialforvandenplascalusandgorjanc2018
AT gorjancgregor supplementalmaterialforvandenplascalusandgorjanc2018
_version_ 1822268891826487296