Knowledge Center Catalog

Local cover image
Local cover image

Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data

By: Contributor(s): Material type: ArticleArticleLanguage: English Publication details: Genetics Society of America, 2023. Bethesda, MD (USA) :ISSN:
  • 2160-1836 (Online)
Subject(s): Online resources: In: G3: Genes, Genomes, Genetics v. 13, no. 2, art. jkac313Summary: Linking high-throughput environmental data (enviromics) to genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (G × E). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important G × E information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction-norms of past-evaluated genotypes; (2) pinpointing weights of the similarity among trial-sites (locations), given the historical impact of each envirotyping data variable for a given site. These results were then used as a dimensionality reduction strategy, integrating historical data to feed multi-environment GP models, which led to development of four new G × E kernels considering genomics, enviromics and EPA outcomes. The wheat trial data used included 36 locations, eight years and three target populations of environments (TPE) in India. Four prediction scenarios and six kernel-models within/across TPEs were tested. Our results suggest that the conventional GBLUP, without enviromic data or when omitting EPA, is inefficient in predicting the performance of wheat lines in future years. Nevertheless, when EPA was introduced as an intermediary learning step to reduce the dimensionality of the G × E kernels while connecting phenotypic and environmental-wide variation, a significant enhancement of G × E prediction accuracy was evident. EPA revealed that the effect of seasonality makes strategies such as “covariable selection” unfeasible because G × E is year-germplasm specific. We propose that the EPA effectively serves as a “reinforcement learner” algorithm capable of uncovering the effect of seasonality over the reaction-norms, with the benefits of better forecasting the similarities between past and future trialing sites. EPA combines the benefits of dimensionality reduction while reducing the uncertainty of genotype-by-year predictions and increasing the resolution of GP for the genotype-specific level.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Status Date due Barcode Item holds
Article CIMMYT Knowledge Center: John Woolston Library CIMMYT Staff Publications Collection Available
Total holds: 0

Peer review

Open Access

Linking high-throughput environmental data (enviromics) to genomic prediction (GP) is a cost-effective strategy for increasing selection intensity under genotype-by-environment interactions (G × E). This study developed a data-driven approach based on Environment-Phenotype Associations (EPA) aimed at recycling important G × E information from historical breeding data. EPA was developed in two applications: (1) scanning a secondary source of genetic variation, weighted from the shared reaction-norms of past-evaluated genotypes; (2) pinpointing weights of the similarity among trial-sites (locations), given the historical impact of each envirotyping data variable for a given site. These results were then used as a dimensionality reduction strategy, integrating historical data to feed multi-environment GP models, which led to development of four new G × E kernels considering genomics, enviromics and EPA outcomes. The wheat trial data used included 36 locations, eight years and three target populations of environments (TPE) in India. Four prediction scenarios and six kernel-models within/across TPEs were tested. Our results suggest that the conventional GBLUP, without enviromic data or when omitting EPA, is inefficient in predicting the performance of wheat lines in future years. Nevertheless, when EPA was introduced as an intermediary learning step to reduce the dimensionality of the G × E kernels while connecting phenotypic and environmental-wide variation, a significant enhancement of G × E prediction accuracy was evident. EPA revealed that the effect of seasonality makes strategies such as “covariable selection” unfeasible because G × E is year-germplasm specific. We propose that the EPA effectively serves as a “reinforcement learner” algorithm capable of uncovering the effect of seasonality over the reaction-norms, with the benefits of better forecasting the similarities between past and future trialing sites. EPA combines the benefits of dimensionality reduction while reducing the uncertainty of genotype-by-year predictions and increasing the resolution of GP for the genotype-specific level.

Text in English

Costa-Neto, G. : No CIMMYT Affiliation

Montesinos-Lopez, O.A. : No CIMMYT Affiliation

Click on an image to view it in the image viewer

Local cover image

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021.
Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237.
If you have any question, please contact us at
CIMMYT-Knowledge-Center@cgiar.org