The modified location model for classifying genetic resources : II. Unrestricted variance–covariance matrices
Franco, J.
The modified location model for classifying genetic resources : II. Unrestricted variance–covariance matrices - Madison (USA) : CSSA, 2002. - Printed
Peer-review: Yes - Open Access: Yes|http://science.thomsonreuters.com/cgi-bin/jrnlst/jlresults.cgi?PC=MASTER&ISSN=0011-183X
When evaluating genetic resources and forming core subsets, gene bank accessions are classified into homogeneous and well-separated groups. The modified location model (MLM) is used in the context of a two-stage clustering strategy in which initial groups are first defined using a hierarchical clustering method (such as Ward) and then the MLM is applied to the groups that are formed (Ward-MLM). The MLM allows assuming correlations (between attributes) and variances (of the attributes) among subpopulations (SPs) to be equal (homogeneous, HOM) or different (heterogeneous, HET). The objectives of this study were (i) to compare the effect of assuming homogeneity with the effect of assuming heterogeneity of variance–covariance matrices on the classification of two simulated data sets using the Ward-MLM strategy; and (ii) to make the same type of comparison using data from maize (Zea mays L.) accessions from nine countries. When simulated HOM data were analyzed with the HOM model and the simulated HET data were analyzed with the HET model, some of the original SPs were represented in the resulting clusters but others changed and formed more separated groups. The HET model always formed the most compact and separated clusters, even for HOM data. Classification of 10 real data sets showed that the HET model produced more compact and well-separated groups than the HOM model. However, only the HOM model identified and grouped a small number of observations with very peculiar attributes. Although the HET model may suffice in most situations, the recommended strategy when classifying genetic resources would be to use both models.
Text in English
0011-183X 1435-0653 (Online)
https://doi.org/10.2135/cropsci2002.1727
Data analysis
Gene banks
Maize
Methods
Models
Genetic resources
The modified location model for classifying genetic resources : II. Unrestricted variance–covariance matrices - Madison (USA) : CSSA, 2002. - Printed
Peer-review: Yes - Open Access: Yes|http://science.thomsonreuters.com/cgi-bin/jrnlst/jlresults.cgi?PC=MASTER&ISSN=0011-183X
When evaluating genetic resources and forming core subsets, gene bank accessions are classified into homogeneous and well-separated groups. The modified location model (MLM) is used in the context of a two-stage clustering strategy in which initial groups are first defined using a hierarchical clustering method (such as Ward) and then the MLM is applied to the groups that are formed (Ward-MLM). The MLM allows assuming correlations (between attributes) and variances (of the attributes) among subpopulations (SPs) to be equal (homogeneous, HOM) or different (heterogeneous, HET). The objectives of this study were (i) to compare the effect of assuming homogeneity with the effect of assuming heterogeneity of variance–covariance matrices on the classification of two simulated data sets using the Ward-MLM strategy; and (ii) to make the same type of comparison using data from maize (Zea mays L.) accessions from nine countries. When simulated HOM data were analyzed with the HOM model and the simulated HET data were analyzed with the HET model, some of the original SPs were represented in the resulting clusters but others changed and formed more separated groups. The HET model always formed the most compact and separated clusters, even for HOM data. Classification of 10 real data sets showed that the HET model produced more compact and well-separated groups than the HOM model. However, only the HOM model identified and grouped a small number of observations with very peculiar attributes. Although the HET model may suffice in most situations, the recommended strategy when classifying genetic resources would be to use both models.
Text in English
0011-183X 1435-0653 (Online)
https://doi.org/10.2135/cropsci2002.1727
Data analysis
Gene banks
Maize
Methods
Models
Genetic resources