Application of genomic selection at the early stage of breeding pipeline in tropical maize

In maize, doubled haploid (DH) line production capacity of large-sized maize breeding programs often exceeds the capacity to phenotypically evaluate the complete set of testcross candidates in multi-location trials. The ability to partially select DH lines based on genotypic data while maintaining or improving genetic gains for key traits using phenotypic selection can result in significant resource savings. The present study aimed to evaluate genomic selection (GS) prediction scenarios for grain yield and agronomic traits of one of the tropical maize breeding pipelines of CIMMYT in eastern Africa, based on multi-year empirical data for designing a GS-based strategy at the early stages of the pipeline. We used field data from 3,068 tropical maize DH lines genotyped using rAmpSeq markers and evaluated as test crosses in well-watered (WW) and water-stress (WS) environments in Kenya from 2017 to 2019. Three prediction schemes were compared: (1) 1 year of performance data to predict a second year; (2) 2 years of pooled data to predict performance in the third year, and (3) using individual or pooled data plus converting a certain proportion of individuals from the testing set (TST) to the training set (TRN) to predict the next year's data. Employing five-fold cross-validation, the mean prediction accuracies for grain yield (GY) varied from 0.19 to 0.29 under WW and 0.22 to 0.31 under WS, when the 1-year datasets were used training set to predict a second year's data as a testing set. The mean prediction accuracies increased to 0.32 under WW and 0.31 under WS when the 2-year datasets were used as a training set to predict the third-year data set. In a forward prediction scenario, good predictive abilities (0.53 to 0.71) were found when the training set consisted of the previous year's breeding data and converting 30% of the next year's data from the testing set to the training set. The prediction accuracy for anthesis date and plant height across WW and WS environments obtained using 1-year data and integrating 10, 30, 50, 70, and 90% of the TST set to TRN set was much higher than those trained in individual years. We demonstrate that by increasing the TRN set to include genotypic and phenotypic data from the previous year and combining only 10–30% of the lines from the year of testing, the predicting accuracy can be increased, which in turn could be used to replace the first stage of field-based screening partially, thus saving significant costs associated with the testcross formation and multi-location testcross evaluation.

Saved in:
Bibliographic Details
Main Authors: Beyene, Y., Gowda, M., Perez-Rodriguez, P., Olsen, M., Robbins, K., Burgueño, J., Prasanna, B.M., Crossa, J.
Format: Article biblioteca
Language:English
Published: Frontiers 2021
Subjects:AGRICULTURAL SCIENCES AND BIOTECHNOLOGY, Early-Stage Testing, Genomic Selection, Prediction Accuracy, Tropical Maize, GBLUP, TESTING, MARKER-ASSISTED SELECTION, MAIZE,
Online Access:https://hdl.handle.net/10883/21592
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-cimmyt-10883-21592
record_format koha
spelling dig-cimmyt-10883-215922021-08-25T13:26:07Z Application of genomic selection at the early stage of breeding pipeline in tropical maize Beyene, Y. Gowda, M. Perez-Rodriguez, P. Olsen, M. Robbins, K. Burgueño, J. Prasanna, B.M. Crossa, J. AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Early-Stage Testing Genomic Selection Prediction Accuracy Tropical Maize GBLUP TESTING MARKER-ASSISTED SELECTION MAIZE In maize, doubled haploid (DH) line production capacity of large-sized maize breeding programs often exceeds the capacity to phenotypically evaluate the complete set of testcross candidates in multi-location trials. The ability to partially select DH lines based on genotypic data while maintaining or improving genetic gains for key traits using phenotypic selection can result in significant resource savings. The present study aimed to evaluate genomic selection (GS) prediction scenarios for grain yield and agronomic traits of one of the tropical maize breeding pipelines of CIMMYT in eastern Africa, based on multi-year empirical data for designing a GS-based strategy at the early stages of the pipeline. We used field data from 3,068 tropical maize DH lines genotyped using rAmpSeq markers and evaluated as test crosses in well-watered (WW) and water-stress (WS) environments in Kenya from 2017 to 2019. Three prediction schemes were compared: (1) 1 year of performance data to predict a second year; (2) 2 years of pooled data to predict performance in the third year, and (3) using individual or pooled data plus converting a certain proportion of individuals from the testing set (TST) to the training set (TRN) to predict the next year's data. Employing five-fold cross-validation, the mean prediction accuracies for grain yield (GY) varied from 0.19 to 0.29 under WW and 0.22 to 0.31 under WS, when the 1-year datasets were used training set to predict a second year's data as a testing set. The mean prediction accuracies increased to 0.32 under WW and 0.31 under WS when the 2-year datasets were used as a training set to predict the third-year data set. In a forward prediction scenario, good predictive abilities (0.53 to 0.71) were found when the training set consisted of the previous year's breeding data and converting 30% of the next year's data from the testing set to the training set. The prediction accuracy for anthesis date and plant height across WW and WS environments obtained using 1-year data and integrating 10, 30, 50, 70, and 90% of the TST set to TRN set was much higher than those trained in individual years. We demonstrate that by increasing the TRN set to include genotypic and phenotypic data from the previous year and combining only 10–30% of the lines from the year of testing, the predicting accuracy can be increased, which in turn could be used to replace the first stage of field-based screening partially, thus saving significant costs associated with the testcross formation and multi-location testcross evaluation. 2021-07-31T00:05:19Z 2021-07-31T00:05:19Z 2021 Article Published Version https://hdl.handle.net/10883/21592 10.3389/fpls.2021.685488 English https://figshare.com/collections/Application_of_Genomic_Selection_at_the_Early_Stage_of_Breeding_Pipeline_in_Tropical_Maize/5486790 CIMMYT manages Intellectual Assets as International Public Goods. The user is free to download, print, store and share this work. In case you want to translate or create any other derivative work and share or distribute such translation/derivative work, please contact CIMMYT-Knowledge-Center@cgiar.org indicating the work you want to use and the kind of use you intend; CIMMYT will contact you with the suitable license for that purpose Open Access Switzerland Frontiers 12 1664-462X Frontiers in Plant Science 685488
institution CIMMYT
collection DSpace
country México
countrycode MX
component Bibliográfico
access En linea
databasecode dig-cimmyt
tag biblioteca
region America del Norte
libraryname CIMMYT Library
language English
topic AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Early-Stage Testing
Genomic Selection
Prediction Accuracy
Tropical Maize
GBLUP
TESTING
MARKER-ASSISTED SELECTION
MAIZE
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Early-Stage Testing
Genomic Selection
Prediction Accuracy
Tropical Maize
GBLUP
TESTING
MARKER-ASSISTED SELECTION
MAIZE
spellingShingle AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Early-Stage Testing
Genomic Selection
Prediction Accuracy
Tropical Maize
GBLUP
TESTING
MARKER-ASSISTED SELECTION
MAIZE
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Early-Stage Testing
Genomic Selection
Prediction Accuracy
Tropical Maize
GBLUP
TESTING
MARKER-ASSISTED SELECTION
MAIZE
Beyene, Y.
Gowda, M.
Perez-Rodriguez, P.
Olsen, M.
Robbins, K.
Burgueño, J.
Prasanna, B.M.
Crossa, J.
Application of genomic selection at the early stage of breeding pipeline in tropical maize
description In maize, doubled haploid (DH) line production capacity of large-sized maize breeding programs often exceeds the capacity to phenotypically evaluate the complete set of testcross candidates in multi-location trials. The ability to partially select DH lines based on genotypic data while maintaining or improving genetic gains for key traits using phenotypic selection can result in significant resource savings. The present study aimed to evaluate genomic selection (GS) prediction scenarios for grain yield and agronomic traits of one of the tropical maize breeding pipelines of CIMMYT in eastern Africa, based on multi-year empirical data for designing a GS-based strategy at the early stages of the pipeline. We used field data from 3,068 tropical maize DH lines genotyped using rAmpSeq markers and evaluated as test crosses in well-watered (WW) and water-stress (WS) environments in Kenya from 2017 to 2019. Three prediction schemes were compared: (1) 1 year of performance data to predict a second year; (2) 2 years of pooled data to predict performance in the third year, and (3) using individual or pooled data plus converting a certain proportion of individuals from the testing set (TST) to the training set (TRN) to predict the next year's data. Employing five-fold cross-validation, the mean prediction accuracies for grain yield (GY) varied from 0.19 to 0.29 under WW and 0.22 to 0.31 under WS, when the 1-year datasets were used training set to predict a second year's data as a testing set. The mean prediction accuracies increased to 0.32 under WW and 0.31 under WS when the 2-year datasets were used as a training set to predict the third-year data set. In a forward prediction scenario, good predictive abilities (0.53 to 0.71) were found when the training set consisted of the previous year's breeding data and converting 30% of the next year's data from the testing set to the training set. The prediction accuracy for anthesis date and plant height across WW and WS environments obtained using 1-year data and integrating 10, 30, 50, 70, and 90% of the TST set to TRN set was much higher than those trained in individual years. We demonstrate that by increasing the TRN set to include genotypic and phenotypic data from the previous year and combining only 10–30% of the lines from the year of testing, the predicting accuracy can be increased, which in turn could be used to replace the first stage of field-based screening partially, thus saving significant costs associated with the testcross formation and multi-location testcross evaluation.
format Article
topic_facet AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Early-Stage Testing
Genomic Selection
Prediction Accuracy
Tropical Maize
GBLUP
TESTING
MARKER-ASSISTED SELECTION
MAIZE
author Beyene, Y.
Gowda, M.
Perez-Rodriguez, P.
Olsen, M.
Robbins, K.
Burgueño, J.
Prasanna, B.M.
Crossa, J.
author_facet Beyene, Y.
Gowda, M.
Perez-Rodriguez, P.
Olsen, M.
Robbins, K.
Burgueño, J.
Prasanna, B.M.
Crossa, J.
author_sort Beyene, Y.
title Application of genomic selection at the early stage of breeding pipeline in tropical maize
title_short Application of genomic selection at the early stage of breeding pipeline in tropical maize
title_full Application of genomic selection at the early stage of breeding pipeline in tropical maize
title_fullStr Application of genomic selection at the early stage of breeding pipeline in tropical maize
title_full_unstemmed Application of genomic selection at the early stage of breeding pipeline in tropical maize
title_sort application of genomic selection at the early stage of breeding pipeline in tropical maize
publisher Frontiers
publishDate 2021
url https://hdl.handle.net/10883/21592
work_keys_str_mv AT beyeney applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT gowdam applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT perezrodriguezp applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT olsenm applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT robbinsk applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT burguenoj applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT prasannabm applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
AT crossaj applicationofgenomicselectionattheearlystageofbreedingpipelineintropicalmaize
_version_ 1756087005725327360