Scalable sparse testing genomic selection strategy for early yield testing stage

To enable a scalable sparse testing genomic selection (GS) strategy at preliminary yield trials in the CIMMYT maize breeding program, optimal approaches to incorporate genotype by environment interaction (GEI) in genomic prediction models are explored. Two cross-validation schemes were evaluated: CV1, predicting the genetic merit of new bi-parental populations that have been evaluated in some environments and not others, and CV2, predicting the genetic merit of half of a bi-parental population that has been phenotyped in some environments and not others using the coefficient of determination (CDmean) to determine optimized subsets of a full-sib family to be evaluated in each environment. We report similar prediction accuracies in CV1 and CV2, however, CV2 has an intuitive appeal in that all bi-parental populations have representation across environments, allowing efficient use of information across environments. It is also ideal for building robust historical data because all individuals of a full-sib family have phenotypic data, albeit in different environments. Results show that grouping of environments according to similar growing/management conditions improved prediction accuracy and reduced computational requirements, providing a scalable, parsimonious approach to multi-environmental trials and GS in early testing stages. We further demonstrate that complementing the full-sib calibration set with optimized historical data results in improved prediction accuracy for the cross-validation schemes.

Saved in:
Bibliographic Details
Main Authors: Atanda, A.S., Olsen, M., Crossa, J., Burgueño, J., Rincent, R., Dzidzienyo, D., Beyene, Y., Gowda, M., Dreher, K.A., Prasanna, B.M., Tongoona, P., Danquah, E., Olaoye, G., Robbins, K.
Format: Article biblioteca
Language:English
Published: Frontiers 2021
Subjects:AGRICULTURAL SCIENCES AND BIOTECHNOLOGY, Genomic Selection, Preliminary Yield Trials, Prediction Accuracy, Unstructured Model, CDmean, MARKER-ASSISTED SELECTION, FACTOR ANALYSIS, MODELS,
Online Access:https://hdl.handle.net/10883/21621
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-cimmyt-10883-21621
record_format koha
spelling dig-cimmyt-10883-216212023-12-14T14:29:54Z Scalable sparse testing genomic selection strategy for early yield testing stage Atanda, A.S. Olsen, M. Crossa, J. Burgueño, J. Rincent, R. Dzidzienyo, D. Beyene, Y. Gowda, M. Dreher, K.A. Prasanna, B.M. Tongoona, P. Danquah, E. Olaoye, G. Robbins, K. AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Genomic Selection Preliminary Yield Trials Prediction Accuracy Unstructured Model CDmean MARKER-ASSISTED SELECTION FACTOR ANALYSIS MODELS To enable a scalable sparse testing genomic selection (GS) strategy at preliminary yield trials in the CIMMYT maize breeding program, optimal approaches to incorporate genotype by environment interaction (GEI) in genomic prediction models are explored. Two cross-validation schemes were evaluated: CV1, predicting the genetic merit of new bi-parental populations that have been evaluated in some environments and not others, and CV2, predicting the genetic merit of half of a bi-parental population that has been phenotyped in some environments and not others using the coefficient of determination (CDmean) to determine optimized subsets of a full-sib family to be evaluated in each environment. We report similar prediction accuracies in CV1 and CV2, however, CV2 has an intuitive appeal in that all bi-parental populations have representation across environments, allowing efficient use of information across environments. It is also ideal for building robust historical data because all individuals of a full-sib family have phenotypic data, albeit in different environments. Results show that grouping of environments according to similar growing/management conditions improved prediction accuracy and reduced computational requirements, providing a scalable, parsimonious approach to multi-environmental trials and GS in early testing stages. We further demonstrate that complementing the full-sib calibration set with optimized historical data results in improved prediction accuracy for the cross-validation schemes. 2021-08-24T00:05:16Z 2021-08-24T00:05:16Z 2021 Article Published Version https://hdl.handle.net/10883/21621 10.3389/fpls.2021.658978 English https://figshare.com/collections/Scalable_Sparse_Testing_Genomic_Selection_Strategy_for_Early_Yield_Testing_Stage/5478072 CIMMYT manages Intellectual Assets as International Public Goods. The user is free to download, print, store and share this work. In case you want to translate or create any other derivative work and share or distribute such translation/derivative work, please contact CIMMYT-Knowledge-Center@cgiar.org indicating the work you want to use and the kind of use you intend; CIMMYT will contact you with the suitable license for that purpose Open Access Switzerland Frontiers 12 1664-462X Frontiers in Plant Science 658978
institution CIMMYT
collection DSpace
country México
countrycode MX
component Bibliográfico
access En linea
databasecode dig-cimmyt
tag biblioteca
region America del Norte
libraryname CIMMYT Library
language English
topic AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Preliminary Yield Trials
Prediction Accuracy
Unstructured Model
CDmean
MARKER-ASSISTED SELECTION
FACTOR ANALYSIS
MODELS
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Preliminary Yield Trials
Prediction Accuracy
Unstructured Model
CDmean
MARKER-ASSISTED SELECTION
FACTOR ANALYSIS
MODELS
spellingShingle AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Preliminary Yield Trials
Prediction Accuracy
Unstructured Model
CDmean
MARKER-ASSISTED SELECTION
FACTOR ANALYSIS
MODELS
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Preliminary Yield Trials
Prediction Accuracy
Unstructured Model
CDmean
MARKER-ASSISTED SELECTION
FACTOR ANALYSIS
MODELS
Atanda, A.S.
Olsen, M.
Crossa, J.
Burgueño, J.
Rincent, R.
Dzidzienyo, D.
Beyene, Y.
Gowda, M.
Dreher, K.A.
Prasanna, B.M.
Tongoona, P.
Danquah, E.
Olaoye, G.
Robbins, K.
Scalable sparse testing genomic selection strategy for early yield testing stage
description To enable a scalable sparse testing genomic selection (GS) strategy at preliminary yield trials in the CIMMYT maize breeding program, optimal approaches to incorporate genotype by environment interaction (GEI) in genomic prediction models are explored. Two cross-validation schemes were evaluated: CV1, predicting the genetic merit of new bi-parental populations that have been evaluated in some environments and not others, and CV2, predicting the genetic merit of half of a bi-parental population that has been phenotyped in some environments and not others using the coefficient of determination (CDmean) to determine optimized subsets of a full-sib family to be evaluated in each environment. We report similar prediction accuracies in CV1 and CV2, however, CV2 has an intuitive appeal in that all bi-parental populations have representation across environments, allowing efficient use of information across environments. It is also ideal for building robust historical data because all individuals of a full-sib family have phenotypic data, albeit in different environments. Results show that grouping of environments according to similar growing/management conditions improved prediction accuracy and reduced computational requirements, providing a scalable, parsimonious approach to multi-environmental trials and GS in early testing stages. We further demonstrate that complementing the full-sib calibration set with optimized historical data results in improved prediction accuracy for the cross-validation schemes.
format Article
topic_facet AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Preliminary Yield Trials
Prediction Accuracy
Unstructured Model
CDmean
MARKER-ASSISTED SELECTION
FACTOR ANALYSIS
MODELS
author Atanda, A.S.
Olsen, M.
Crossa, J.
Burgueño, J.
Rincent, R.
Dzidzienyo, D.
Beyene, Y.
Gowda, M.
Dreher, K.A.
Prasanna, B.M.
Tongoona, P.
Danquah, E.
Olaoye, G.
Robbins, K.
author_facet Atanda, A.S.
Olsen, M.
Crossa, J.
Burgueño, J.
Rincent, R.
Dzidzienyo, D.
Beyene, Y.
Gowda, M.
Dreher, K.A.
Prasanna, B.M.
Tongoona, P.
Danquah, E.
Olaoye, G.
Robbins, K.
author_sort Atanda, A.S.
title Scalable sparse testing genomic selection strategy for early yield testing stage
title_short Scalable sparse testing genomic selection strategy for early yield testing stage
title_full Scalable sparse testing genomic selection strategy for early yield testing stage
title_fullStr Scalable sparse testing genomic selection strategy for early yield testing stage
title_full_unstemmed Scalable sparse testing genomic selection strategy for early yield testing stage
title_sort scalable sparse testing genomic selection strategy for early yield testing stage
publisher Frontiers
publishDate 2021
url https://hdl.handle.net/10883/21621
work_keys_str_mv AT atandaas scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT olsenm scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT crossaj scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT burguenoj scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT rincentr scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT dzidzienyod scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT beyeney scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT gowdam scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT dreherka scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT prasannabm scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT tongoonap scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT danquahe scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT olaoyeg scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
AT robbinsk scalablesparsetestinggenomicselectionstrategyforearlyyieldtestingstage
_version_ 1787232996610277376