Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data

Linking high-throughput environmental data (enviromics) to genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (G × E). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important G × E information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction-norms of past-evaluated genotypes; (2) pinpointing weights of the similarity among trial-sites (locations), given the historical impact of each envirotyping data variable for a given site. These results were then used as a dimensionality reduction strategy, integrating historical data to feed multi-environment GP models, which led to development of four new G × E kernels considering genomics, enviromics and EPA outcomes. The wheat trial data used included 36 locations, eight years and three target populations of environments (TPE) in India. Four prediction scenarios and six kernel-models within/across TPEs were tested. Our results suggest that the conventional GBLUP, without enviromic data or when omitting EPA, is inefficient in predicting the performance of wheat lines in future years. Nevertheless, when EPA was introduced as an intermediary learning step to reduce the dimensionality of the G × E kernels while connecting phenotypic and environmental-wide variation, a significant enhancement of G × E prediction accuracy was evident. EPA revealed that the effect of seasonality makes strategies such as “covariable selection” unfeasible because G × E is year-germplasm specific. We propose that the EPA effectively serves as a “reinforcement learner” algorithm capable of uncovering the effect of seasonality over the reaction-norms, with the benefits of better forecasting the similarities between past and future trialing sites. EPA combines the benefits of dimensionality reduction while reducing the uncertainty of genotype-by-year predictions and increasing the resolution of GP for the genotype-specific level.

Saved in:
Bibliographic Details
Main Authors: Costa-Neto, G., Crespo Herrera, L.A., Fradgley, N., Gardner, K.A., Bentley, A.R., Dreisigacker, S., Fritsche-Neto, R., Montesinos-Lopez, O.A., Crossa, J.
Format: Article biblioteca
Language:English
Published: Genetics Society of America 2023
Subjects:AGRICULTURAL SCIENCES AND BIOTECHNOLOGY, Genomic Selection, Wheat Breeding, Envirotyping, Target Population of Environments, MARKER-ASSISTED SELECTION, CLIMATE CHANGE, WHEAT, BREEDING, ADAPTABILITY, ENVIRONMENT,
Online Access:https://hdl.handle.net/10883/22364
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-cimmyt-10883-22364
record_format koha
spelling dig-cimmyt-10883-223642024-02-13T16:18:08Z Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data Costa-Neto, G. Crespo Herrera, L.A. Fradgley, N. Gardner, K.A. Bentley, A.R. Dreisigacker, S. Fritsche-Neto, R. Montesinos-Lopez, O.A. Crossa, J. AGRICULTURAL SCIENCES AND BIOTECHNOLOGY Genomic Selection Wheat Breeding Envirotyping Target Population of Environments MARKER-ASSISTED SELECTION CLIMATE CHANGE WHEAT BREEDING ADAPTABILITY ENVIRONMENT Linking high-throughput environmental data (enviromics) to genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (G × E). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important G × E information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction-norms of past-evaluated genotypes; (2) pinpointing weights of the similarity among trial-sites (locations), given the historical impact of each envirotyping data variable for a given site. These results were then used as a dimensionality reduction strategy, integrating historical data to feed multi-environment GP models, which led to development of four new G × E kernels considering genomics, enviromics and EPA outcomes. The wheat trial data used included 36 locations, eight years and three target populations of environments (TPE) in India. Four prediction scenarios and six kernel-models within/across TPEs were tested. Our results suggest that the conventional GBLUP, without enviromic data or when omitting EPA, is inefficient in predicting the performance of wheat lines in future years. Nevertheless, when EPA was introduced as an intermediary learning step to reduce the dimensionality of the G × E kernels while connecting phenotypic and environmental-wide variation, a significant enhancement of G × E prediction accuracy was evident. EPA revealed that the effect of seasonality makes strategies such as “covariable selection” unfeasible because G × E is year-germplasm specific. We propose that the EPA effectively serves as a “reinforcement learner” algorithm capable of uncovering the effect of seasonality over the reaction-norms, with the benefits of better forecasting the similarities between past and future trialing sites. EPA combines the benefits of dimensionality reduction while reducing the uncertainty of genotype-by-year predictions and increasing the resolution of GP for the genotype-specific level. 2022-12-24T01:10:13Z 2022-12-24T01:10:13Z 2023 Article Published Version https://hdl.handle.net/10883/22364 10.1093/g3journal/jkac313 English https://github.com/gcostaneto/EPA-PLS Poverty reduction, livelihoods & jobs Climate adaptation & mitigation Accelerated Breeding Genetic Innovation Bill & Melinda Gates Foundation (BMGF) Accelerating Genetic Gains in Maize and Wheat for Improved Livelihoods (AGG) United States Agency for International Development (USAID) CGIAR Research Program on Maize CGIAR Research Program on Wheat Foundation for Research Levy on Agricultural Products (FFL) Agricultural Agreement Research Fund https://hdl.handle.net/10568/126482 CIMMYT manages Intellectual Assets as International Public Goods. The user is free to download, print, store and share this work. In case you want to translate or create any other derivative work and share or distribute such translation/derivative work, please contact CIMMYT-Knowledge-Center@cgiar.org indicating the work you want to use and the kind of use you intend; CIMMYT will contact you with the suitable license for that purpose Open Access Bethesda, MD (USA) Genetics Society of America 2 13 2160-1836 G3: Genes, Genomes, Genetics jkac313
institution CIMMYT
collection DSpace
country México
countrycode MX
component Bibliográfico
access En linea
databasecode dig-cimmyt
tag biblioteca
region America del Norte
libraryname CIMMYT Library
language English
topic AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Wheat Breeding
Envirotyping
Target Population of Environments
MARKER-ASSISTED SELECTION
CLIMATE CHANGE
WHEAT
BREEDING
ADAPTABILITY
ENVIRONMENT
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Wheat Breeding
Envirotyping
Target Population of Environments
MARKER-ASSISTED SELECTION
CLIMATE CHANGE
WHEAT
BREEDING
ADAPTABILITY
ENVIRONMENT
spellingShingle AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Wheat Breeding
Envirotyping
Target Population of Environments
MARKER-ASSISTED SELECTION
CLIMATE CHANGE
WHEAT
BREEDING
ADAPTABILITY
ENVIRONMENT
AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Wheat Breeding
Envirotyping
Target Population of Environments
MARKER-ASSISTED SELECTION
CLIMATE CHANGE
WHEAT
BREEDING
ADAPTABILITY
ENVIRONMENT
Costa-Neto, G.
Crespo Herrera, L.A.
Fradgley, N.
Gardner, K.A.
Bentley, A.R.
Dreisigacker, S.
Fritsche-Neto, R.
Montesinos-Lopez, O.A.
Crossa, J.
Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
description Linking high-throughput environmental data (enviromics) to genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (G × E). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important G × E information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction-norms of past-evaluated genotypes; (2) pinpointing weights of the similarity among trial-sites (locations), given the historical impact of each envirotyping data variable for a given site. These results were then used as a dimensionality reduction strategy, integrating historical data to feed multi-environment GP models, which led to development of four new G × E kernels considering genomics, enviromics and EPA outcomes. The wheat trial data used included 36 locations, eight years and three target populations of environments (TPE) in India. Four prediction scenarios and six kernel-models within/across TPEs were tested. Our results suggest that the conventional GBLUP, without enviromic data or when omitting EPA, is inefficient in predicting the performance of wheat lines in future years. Nevertheless, when EPA was introduced as an intermediary learning step to reduce the dimensionality of the G × E kernels while connecting phenotypic and environmental-wide variation, a significant enhancement of G × E prediction accuracy was evident. EPA revealed that the effect of seasonality makes strategies such as “covariable selection” unfeasible because G × E is year-germplasm specific. We propose that the EPA effectively serves as a “reinforcement learner” algorithm capable of uncovering the effect of seasonality over the reaction-norms, with the benefits of better forecasting the similarities between past and future trialing sites. EPA combines the benefits of dimensionality reduction while reducing the uncertainty of genotype-by-year predictions and increasing the resolution of GP for the genotype-specific level.
format Article
topic_facet AGRICULTURAL SCIENCES AND BIOTECHNOLOGY
Genomic Selection
Wheat Breeding
Envirotyping
Target Population of Environments
MARKER-ASSISTED SELECTION
CLIMATE CHANGE
WHEAT
BREEDING
ADAPTABILITY
ENVIRONMENT
author Costa-Neto, G.
Crespo Herrera, L.A.
Fradgley, N.
Gardner, K.A.
Bentley, A.R.
Dreisigacker, S.
Fritsche-Neto, R.
Montesinos-Lopez, O.A.
Crossa, J.
author_facet Costa-Neto, G.
Crespo Herrera, L.A.
Fradgley, N.
Gardner, K.A.
Bentley, A.R.
Dreisigacker, S.
Fritsche-Neto, R.
Montesinos-Lopez, O.A.
Crossa, J.
author_sort Costa-Neto, G.
title Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
title_short Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
title_full Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
title_fullStr Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
title_full_unstemmed Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
title_sort envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data
publisher Genetics Society of America
publishDate 2023
url https://hdl.handle.net/10883/22364
work_keys_str_mv AT costanetog enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT crespoherrerala enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT fradgleyn enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT gardnerka enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT bentleyar enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT dreisigackers enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT fritschenetor enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT montesinoslopezoa enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
AT crossaj enviromewideassociationsenhancemultiyeargenomebasedpredictionofhistoricalwheatbreedingdata
_version_ 1792501486859583488