Genomic prediction in maize breeding populations with genotyping-by sequencing

Genotyping-by-sequencing (GBS) technologies have proven capacity for delivering large numbers of marker genotypes with potentially less ascertainment bias than standard single nucleotide polymorphism (SNP) arrays. Therefore, GBS has become an attractive alternative technology for genomic selection. However, the use of GBS data poses important challenges, and the accuracy of genomic prediction using GBS is currently undergoing investigation in several crops, including maize, wheat, and cassava. The main objective of this study was to evaluate various methods for incorporating GBS information and compare them with pedigree models for predicting genetic values of lines from two maize populations evaluated for different traits measured in different environments (experiments 1 and 2). Given that GBS data come with a large percentage of uncalled genotypes, we evaluated methods using nonimputed, imputed, and GBS-inferred haplotypes of different lengths (short or long). GBS and pedigree data were incorporated into statistical models using either the genomic best linear unbiased predictors (GBLUP) or the reproducing kernel Hilbert spaces (RKHS) regressions, and prediction accuracy was quantified using cross-validation methods. The following results were found: relative to pedigree or marker-only models, there were consistent gains in prediction accuracy by combining pedigree and GBS data; there was increased predictive ability when using imputed or nonimputed GBS data over inferred haplotype in experiment 1, or nonimputed GBS and information-based imputed short and long haplotypes, as compared to the other methods in experiment 2; the level of prediction accuracy achieved using GBS data in experiment 2 is comparable to those reported by previous authors who analyzed this data set using SNP arrays; and GBLUP and RKHS models with pedigree with nonimputed and imputed GBS data provided the best prediction correlations for the three traits in experiment 1, whereas for experiment 2 RKHS provided slightly better prediction than GBLUP for drought-stressed environments, and both models provided similar predictions in well-watered environments.

Saved in:
Bibliographic Details
Main Authors: Crossa, J., Beyene, Y., Fentaye Kassa Semagn, Perez-Rodriguez, P., Hickey, J.M., Charles Chen, De los Campos, G., Burgueño, J., Windhausen, V.S., Buckler, E.S., Jannink, J.L., Lopez-Cruz, M., Babu, R.
Format: Article biblioteca
Language:English
Published: Genetics Society of America 2013
Subjects:AGRICULTURAL SCIENCES AND BIOTECHNOLOGY, GenPred, Shared Data Resources, Genotyping by Sequencing, Imputation, Genomic Selection, GBLUP, RKHS, NUCLEOTIDE SEQUENCE, ARTIFICIAL SELECTION, GENETICS, FORECASTING, DATA ANALYSIS, STATISTICAL METHODS,
Online Access:http://hdl.handle.net/10883/3440
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-cimmyt-10883-3440
record_format koha
spelling dig-cimmyt-10883-34402023-12-08T15:01:40Z Genomic prediction in maize breeding populations with genotyping-by sequencing Crossa, J. Beyene, Y. Fentaye Kassa Semagn Perez-Rodriguez, P. Hickey, J.M. Charles Chen De los Campos, G. Burgueño, J. Windhausen, V.S. Buckler, E.S. Jannink, J.L. Lopez-Cruz, M. Babu, R. AGRICULTURAL SCIENCES AND BIOTECHNOLOGY GenPred Shared Data Resources Genotyping by Sequencing Imputation Genomic Selection GBLUP RKHS NUCLEOTIDE SEQUENCE ARTIFICIAL SELECTION GENETICS FORECASTING DATA ANALYSIS STATISTICAL METHODS Genotyping-by-sequencing (GBS) technologies have proven capacity for delivering large numbers of marker genotypes with potentially less ascertainment bias than standard single nucleotide polymorphism (SNP) arrays. Therefore, GBS has become an attractive alternative technology for genomic selection. However, the use of GBS data poses important challenges, and the accuracy of genomic prediction using GBS is currently undergoing investigation in several crops, including maize, wheat, and cassava. The main objective of this study was to evaluate various methods for incorporating GBS information and compare them with pedigree models for predicting genetic values of lines from two maize populations evaluated for different traits measured in different environments (experiments 1 and 2). Given that GBS data come with a large percentage of uncalled genotypes, we evaluated methods using nonimputed, imputed, and GBS-inferred haplotypes of different lengths (short or long). GBS and pedigree data were incorporated into statistical models using either the genomic best linear unbiased predictors (GBLUP) or the reproducing kernel Hilbert spaces (RKHS) regressions, and prediction accuracy was quantified using cross-validation methods. The following results were found: relative to pedigree or marker-only models, there were consistent gains in prediction accuracy by combining pedigree and GBS data; there was increased predictive ability when using imputed or nonimputed GBS data over inferred haplotype in experiment 1, or nonimputed GBS and information-based imputed short and long haplotypes, as compared to the other methods in experiment 2; the level of prediction accuracy achieved using GBS data in experiment 2 is comparable to those reported by previous authors who analyzed this data set using SNP arrays; and GBLUP and RKHS models with pedigree with nonimputed and imputed GBS data provided the best prediction correlations for the three traits in experiment 1, whereas for experiment 2 RKHS provided slightly better prediction than GBLUP for drought-stressed environments, and both models provided similar predictions in well-watered environments. 1903-1926 2014-03-05T17:59:55Z 2014-03-05T17:59:55Z 2013 Article 2160-1836 http://hdl.handle.net/10883/3440 10.1534/g3.113.008227 English CIMMYT manages Intellectual Assets as International Public Goods. The user is free to download, print, store and share this work. In case you want to translate or create any other derivative work and share or distribute such translation/derivative work, please contact CIMMYT-Knowledge-Center@cgiar.org indicating the work you want to use and the kind of use you intend; CIMMYT will contact you with the suitable license for that purpose. Open Access PDF Genetics Society of America http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3815055/ 11 3 G3: Genes, Genomes, Genetics
institution CIMMYT
collection DSpace
country México
countrycode MX
component Bibliográfico
access En linea
databasecode dig-cimmyt
tag biblioteca
region America del Norte
libraryname CIMMYT Library
language English
topic AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
GenPred
Shared Data Resources
Genotyping by Sequencing
Imputation
Genomic Selection
GBLUP
RKHS
NUCLEOTIDE SEQUENCE
ARTIFICIAL SELECTION
GENETICS
FORECASTING
DATA ANALYSIS
STATISTICAL METHODS
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
GenPred
Shared Data Resources
Genotyping by Sequencing
Imputation
Genomic Selection
GBLUP
RKHS
NUCLEOTIDE SEQUENCE
ARTIFICIAL SELECTION
GENETICS
FORECASTING
DATA ANALYSIS
STATISTICAL METHODS
spellingShingle AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
GenPred
Shared Data Resources
Genotyping by Sequencing
Imputation
Genomic Selection
GBLUP
RKHS
NUCLEOTIDE SEQUENCE
ARTIFICIAL SELECTION
GENETICS
FORECASTING
DATA ANALYSIS
STATISTICAL METHODS
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
GenPred
Shared Data Resources
Genotyping by Sequencing
Imputation
Genomic Selection
GBLUP
RKHS
NUCLEOTIDE SEQUENCE
ARTIFICIAL SELECTION
GENETICS
FORECASTING
DATA ANALYSIS
STATISTICAL METHODS
Crossa, J.
Beyene, Y.
Fentaye Kassa Semagn
Perez-Rodriguez, P.
Hickey, J.M.
Charles Chen
De los Campos, G.
Burgueño, J.
Windhausen, V.S.
Buckler, E.S.
Jannink, J.L.
Lopez-Cruz, M.
Babu, R.
Genomic prediction in maize breeding populations with genotyping-by sequencing
description Genotyping-by-sequencing (GBS) technologies have proven capacity for delivering large numbers of marker genotypes with potentially less ascertainment bias than standard single nucleotide polymorphism (SNP) arrays. Therefore, GBS has become an attractive alternative technology for genomic selection. However, the use of GBS data poses important challenges, and the accuracy of genomic prediction using GBS is currently undergoing investigation in several crops, including maize, wheat, and cassava. The main objective of this study was to evaluate various methods for incorporating GBS information and compare them with pedigree models for predicting genetic values of lines from two maize populations evaluated for different traits measured in different environments (experiments 1 and 2). Given that GBS data come with a large percentage of uncalled genotypes, we evaluated methods using nonimputed, imputed, and GBS-inferred haplotypes of different lengths (short or long). GBS and pedigree data were incorporated into statistical models using either the genomic best linear unbiased predictors (GBLUP) or the reproducing kernel Hilbert spaces (RKHS) regressions, and prediction accuracy was quantified using cross-validation methods. The following results were found: relative to pedigree or marker-only models, there were consistent gains in prediction accuracy by combining pedigree and GBS data; there was increased predictive ability when using imputed or nonimputed GBS data over inferred haplotype in experiment 1, or nonimputed GBS and information-based imputed short and long haplotypes, as compared to the other methods in experiment 2; the level of prediction accuracy achieved using GBS data in experiment 2 is comparable to those reported by previous authors who analyzed this data set using SNP arrays; and GBLUP and RKHS models with pedigree with nonimputed and imputed GBS data provided the best prediction correlations for the three traits in experiment 1, whereas for experiment 2 RKHS provided slightly better prediction than GBLUP for drought-stressed environments, and both models provided similar predictions in well-watered environments.
format Article
topic_facet AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
GenPred
Shared Data Resources
Genotyping by Sequencing
Imputation
Genomic Selection
GBLUP
RKHS
NUCLEOTIDE SEQUENCE
ARTIFICIAL SELECTION
GENETICS
FORECASTING
DATA ANALYSIS
STATISTICAL METHODS
author Crossa, J.
Beyene, Y.
Fentaye Kassa Semagn
Perez-Rodriguez, P.
Hickey, J.M.
Charles Chen
De los Campos, G.
Burgueño, J.
Windhausen, V.S.
Buckler, E.S.
Jannink, J.L.
Lopez-Cruz, M.
Babu, R.
author_facet Crossa, J.
Beyene, Y.
Fentaye Kassa Semagn
Perez-Rodriguez, P.
Hickey, J.M.
Charles Chen
De los Campos, G.
Burgueño, J.
Windhausen, V.S.
Buckler, E.S.
Jannink, J.L.
Lopez-Cruz, M.
Babu, R.
author_sort Crossa, J.
title Genomic prediction in maize breeding populations with genotyping-by sequencing
title_short Genomic prediction in maize breeding populations with genotyping-by sequencing
title_full Genomic prediction in maize breeding populations with genotyping-by sequencing
title_fullStr Genomic prediction in maize breeding populations with genotyping-by sequencing
title_full_unstemmed Genomic prediction in maize breeding populations with genotyping-by sequencing
title_sort genomic prediction in maize breeding populations with genotyping-by sequencing
publisher Genetics Society of America
publishDate 2013
url http://hdl.handle.net/10883/3440
work_keys_str_mv AT crossaj genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT beyeney genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT fentayekassasemagn genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT perezrodriguezp genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT hickeyjm genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT charleschen genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT deloscamposg genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT burguenoj genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT windhausenvs genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT buckleres genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT janninkjl genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT lopezcruzm genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
AT babur genomicpredictioninmaizebreedingpopulationswithgenotypingbysequencing
_version_ 1787232867409985536