The use of unbalanced historical data for genomic selection in an international wheat breeding program

By:

Dawson, J.C

Contributor(s):

Material type: Article

ArticlePublication details: 2013ISSN:

No (Revista en electrónico)
0378-4290

Subject(s):

Online resources:

Open Access through DSpace

In: Field Crops Research v. 154, p. 12-22Summary: Genomic selection (GS) offers breeders the possibility of using historic data and unbalanced breeding trials to form training populations for predicting the performance of new lines. However, when using datasets that are unbalanced over time and space, there is increasing exposure to different genotype ? environment combinations and interactions that may make predictions less accurate. Global cross-validated genomic prediction accuracies may be high when using large historic datasets but accuracies for individual years using a forward-prediction approach, or accuracies for individual locations, are often much lower. The objective of this study was to evaluate the overall accuracy of genomic predictions for untested genotypes using an unbalanced dataset to train a genomic selection model, and to explore ways of combining genomic selection and genotype-by-environment (G×E) interaction models to better target untested lines to different locations. Using the International Center for Maize and Wheat Improvement's (CIMMYT) Semi-Arid Wheat Yield Trials (SAWYT) we assessed the accuracy of genomic predictions and the potential to subset these nurseries using the concept of mega-environments (ME) adapted to a genomic selection context. We found that there was no difference in accuracy between models accounting for G×E interactions and global models. Data-driven methods of clustering locations based on similarities in genomic predictions also failed to improve accuracies within clusters. Using a simulation based on the empirical SAWYT data, we found that if there were different true genotypic values between clusters, there was an advantage to modeling G×E in prediction models. In the SAWYT dataset it appears that there is not a consistent pattern of genotype-by-environment interaction among the ME, and this dataset is not balanced enough to partition into new clusters that have predictive power.

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Collection	Call number	Status	Date due	Barcode	Item holds
Article	CIMMYT Knowledge Center: John Woolston Library	CIMMYT Staff Publications Collection	CIS-7452 (Browse shelf(Opens below))	Available

Total holds: 0

Browsing CIMMYT Knowledge Center: John Woolston Library shelves, Collection: CIMMYT Staff Publications Collection Close shelf browser (Hides shelf browser)

Previous								Next
Previous	CIS-745 Contrasting farming systems in Morogoro Region, Tanzania	CIS-7450 Using molecular marker order to compare genetic structure in plant populations undergoing selection	CIS-7451 Sample size for detecting transgenic plants using inverse binomial group testing with dilution effect	CIS-7452 The use of unbalanced historical data for genomic selection in an international wheat breeding program	CIS-7453 An assessment of wheat yield sensitivity and breeding gains in hot environments	CIS-7454 A reaction norm model for genomic selection using high-dimensional genomic and environmental data	CIS-7455 Evaluation of fungal isolates as possible biocontrol agents against Striga hermonthica	Next

Peer-review: Yes - Open Access: Yes|http://science.thomsonreuters.com/cgi-bin/jrnlst/jlresults.cgi?PC=MASTER&ISSN=0378-4290

Peer review

Open Access

Genomic selection (GS) offers breeders the possibility of using historic data and unbalanced breeding trials to form training populations for predicting the performance of new lines. However, when using datasets that are unbalanced over time and space, there is increasing exposure to different genotype ? environment combinations and interactions that may make predictions less accurate. Global cross-validated genomic prediction accuracies may be high when using large historic datasets but accuracies for individual years using a forward-prediction approach, or accuracies for individual locations, are often much lower. The objective of this study was to evaluate the overall accuracy of genomic predictions for untested genotypes using an unbalanced dataset to train a genomic selection model, and to explore ways of combining genomic selection and genotype-by-environment (G×E) interaction models to better target untested lines to different locations. Using the International Center for Maize and Wheat Improvement's (CIMMYT) Semi-Arid Wheat Yield Trials (SAWYT) we assessed the accuracy of genomic predictions and the potential to subset these nurseries using the concept of mega-environments (ME) adapted to a genomic selection context. We found that there was no difference in accuracy between models accounting for G×E interactions and global models. Data-driven methods of clustering locations based on similarities in genomic predictions also failed to improve accuracies within clusters. Using a simulation based on the empirical SAWYT data, we found that if there were different true genotypic values between clusters, there was an advantage to modeling G×E in prediction models. In the SAWYT dataset it appears that there is not a consistent pattern of genotype-by-environment interaction among the ME, and this dataset is not balanced enough to partition into new clusters that have predictive power.

Genetic Resources Program|Global Wheat Program

English

Elsevier|CIMMYT Informa No. 1875

CCJL01|INT2692

CIMMYT Staff Publications Collection

Click on an image to view it in the image viewer

Knowledge Center Catalog

The use of unbalanced historical data for genomic selection in an international wheat breeding program

Browsing CIMMYT Knowledge Center: John Woolston Library shelves, Collection: CIMMYT Staff Publications Collection Close shelf browser (Hides shelf browser)

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021.
Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237.
If you have any question, please contact us at
CIMMYT-Knowledge-Center@cgiar.org

Knowledge Center Catalog

The use of unbalanced historical data for genomic selection in an international wheat breeding program

Browsing CIMMYT Knowledge Center: John Woolston Library shelves, Collection: CIMMYT Staff Publications Collection Close shelf browser (Hides shelf browser)

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021. Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237. If you have any question, please contact us at CIMMYT-Knowledge-Center@cgiar.org

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021.
Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237.
If you have any question, please contact us at
CIMMYT-Knowledge-Center@cgiar.org