Knowledge Center Catalog

Genomic-enabled prediction with classification algorithms (Record no. 30447)

MARC details
000 -LEADER
fixed length control field 05337nab a22005537a 4500
001 - CONTROL NUMBER
control field G98583
003 - CONTROL NUMBER IDENTIFIER
control field MX-TxCIM
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20240919020947.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 121211b |||p||p||||||| |z||| |
022 ## - INTERNATIONAL STANDARD SERIAL NUMBER
International Standard Serial Number 1365-2540 (Revista en electrónico)
022 0# - INTERNATIONAL STANDARD SERIAL NUMBER
International Standard Serial Number 0018-067X
024 8# - OTHER STANDARD IDENTIFIER
Standard number or code https://doi.org/10.1038/hdy.2013.144
040 ## - CATALOGING SOURCE
Original cataloging agency MX-TxCIM
041 0# - LANGUAGE CODE
Language code of text/sound track or separate title En
090 ## - LOCALLY ASSIGNED LC-TYPE CALL NUMBER (OCLC); LOCAL CALL NUMBER (RLIN)
Classification number (OCLC) (R) ; Classification number, CALL (RLIN) (NR) CIS-7464
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Ornella, L.
245 10 - TITLE STATEMENT
Title Genomic-enabled prediction with classification algorithms
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Date of publication, distribution, etc. 2014
500 ## - GENERAL NOTE
General note Peer-review: Yes - Open Access: Yes |http://science.thomsonreuters.com/cgi-bin/jrnlst/jlresults.cgi?PC=MASTER&ISSN=0018-067X
500 ## - GENERAL NOTE
General note Peer review
500 ## - GENERAL NOTE
General note Open Access
520 ## - SUMMARY, ETC.
Summary, etc. Pearson?s correlation coefficient (ρ) is the most commonly reported metric of the success of prediction in genomic selection (GS). However, in real breeding ρ may not be very useful for assessing the quality of the regression in the tails of the distribution, where individuals are chosen for selection. This research used 14 maize and 16 wheat data sets with different trait?environment combinations. Six different models were evaluated by means of a cross-validation scheme (50 random partitions each, with 90% of the individuals in the training set and 10% in the testing set). The predictive accuracy of these algorithms for selecting individuals belonging to the best α=10, 15, 20, 25, 30, 35, 40% of the distribution was estimated using Cohen?s kappa coefficient (κ) and an ad hoc measure, which we call relative efficiency (RE), which indicates the expected genetic gain due to selection when individuals are selected based on GS exclusively. We put special emphasis on the analysis for α=15%, because it is a percentile commonly used in plant breeding programmes (for example, at CIMMYT). We also used ρ as a criterion for overall success. The algorithms used were: Bayesian LASSO (BL), Ridge Regression (RR), Reproducing Kernel Hilbert Spaces (RHKS), Random Forest Regression (RFR), and Support Vector Regression (SVR) with linear (lin) and Gaussian kernels (rbf). The performance of regression methods for selecting the best individuals was compared with that of three supervised classification algorithms: Random Forest Classification (RFC) and Support Vector Classification (SVC) with linear (lin) and Gaussian (rbf) kernels. Classification methods were evaluated using the same cross-validation scheme but with the response vector of the original training sets dichotomised using a given threshold. For α=15%, SVC-lin presented the highest κ coefficients in 13 of the 14 maize data sets, with best values ranging from 0.131 to 0.722 (statistically significant in 9 data sets) and the best RE in the same 13 data sets, with values ranging from 0.393 to 0.948 (statistically significant in 12 data sets). RR produced the best mean for both κ and RE in one data set (0.148 and 0.381, respectively). Regarding the wheat data sets, SVC-lin presented the best κ in 12 of the 16 data sets, with outcomes ranging from 0.280 to 0.580 (statistically significant in 4 data sets) and the best RE in 9 data sets ranging from 0.484 to 0.821 (statistically significant in 5 data sets). SVC-rbf (0.235), RR (0.265) and RHKS (0.422) gave the best κ in one data set each, while RHKS and BL tied for the last one (0.234). Finally, BL presented the best RE in two data sets (0.738 and 0.750), RFR (0.636) and SVC-rbf (0.617) in one and RHKS in the remaining three (0.502, 0.458 and 0.586). The difference between the performance of SVC-lin and that of the rest of the models was not so pronounced at higher percentiles of the distribution. The behaviour of regression and classification algorithms varied markedly when selection was done at different thresholds, that is, κ and RE for each algorithm depended strongly on the selection percentile. Based on the results, we propose classification method as a promising alternative for GS in plant breeding.
536 ## - FUNDING INFORMATION NOTE
Text of note Global Maize Program|Genetic Resources Program|Global Wheat Program
546 ## - LANGUAGE NOTE
Language note English
591 ## - CATALOGING NOTES
Affiliation CIMMYT Informa No. 1876|Nature Publishing Group
594 ## - STAFFID
StaffID INT3239|INT3400|INT3098|INT3035|INT2902|INT2692|INT0610|CCJL01
595 ## - COLLECTION
Collection CIMMYT Staff Publications Collection
650 10 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Genomic selection
9 (RLIN) 1513
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Maize
Miscellaneous information AGROVOC
Source of heading or term
9 (RLIN) 1173
650 10 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element support vector machines
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Wheat
Miscellaneous information AGROVOC
Source of heading or term
9 (RLIN) 1310
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Artificial Selection
9 (RLIN) 8685
Source of heading or term AGROVOC
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Statistical methods
9 (RLIN) 2624
Source of heading or term AGROVOC
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Gonzlez-Camacho, J.M.,
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Long, N.,
Relator term coaut.
9 (RLIN) 576
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Perez, P.,
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Tapia, E.,
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Vicente, F.S.,
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
9 (RLIN) 892
Personal name Sukhwinder-Singh
Miscellaneous information Genetic Resources Program
Field link and sequence number INT3098
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
9 (RLIN) 907
Personal name Burgueño, J.
Miscellaneous information Genetic Resources Program
Field link and sequence number INT3239
700 0# - ADDED ENTRY--PERSONAL NAME
Personal name Xuecai Zhang
Miscellaneous information Global Maize Program
Field link and sequence number INT3400
9 (RLIN) 951
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Singh, R.P.
Miscellaneous information Global Wheat Program
Field link and sequence number INT0610
9 (RLIN) 825
700 1# - ADDED ENTRY--PERSONAL NAME
9 (RLIN) 851
Personal name Dreisigacker, S.
Miscellaneous information Global Wheat Program
Field link and sequence number INT2692
700 1# - ADDED ENTRY--PERSONAL NAME
9 (RLIN) 871
Personal name Bonnett, D.G.
Miscellaneous information Global Wheat Program
Field link and sequence number INT2902
Relator term coaut.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Crossa, J.
Miscellaneous information Genetic Resources Program
Field link and sequence number CCJL01
9 (RLIN) 59
773 0# - HOST ITEM ENTRY
Title Heredity
Related parts v. 112, p. 616-626
856 4# - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://hdl.handle.net/10883/19766
Link text Open Access through DSpace
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type Article
Source of classification or shelving scheme Dewey Decimal Classification
Suppress in OPAC No
Holdings
Date last seen Total Checkouts Full call number Price effective from Koha item type Lost status Damaged status Not for loan Collection code Withdrawn status Home library Current library Date acquired
07/03/2017   CIS-7464 07/03/2017 Article Not Lost     CIMMYT Staff Publications Collection   CIMMYT Knowledge Center: John Woolston Library CIMMYT Knowledge Center: John Woolston Library 07/03/2017

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021.
Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237.
If you have any question, please contact us at
CIMMYT-Knowledge-Center@cgiar.org