Allelic Variation at the VERNALIZATION-A1, VRN-B1, VRN-B3 and PHOTOPERIOD-A1 Genes In
Total Page:16
File Type:pdf, Size:1020Kb
Allelic variation at the VERNALIZATION-A1, VRN-B1, VRN-B3 and PHOTOPERIOD-A1 genes in cultivars of Triticum durum Desf. Alexandr Muterko, Ruslan Kalendar, Elena Salina
Supplementary material 2
Algorithm of the controlled hierarchical clustering In present time there are numerous algorithms for sorting data sets based on hierarchical clustering. The main task of these methods is calculation of the measure similarity between objects and the further grouping of similar objects in clusters. However, in case when need to perform a sorting of objects where it is important not so much definition of similarity between objects, for their classification, but their grouping, based on specified hierarchy of compared attributes, these methods make no sense, because does not allow a priori determine the rules of clustering, i.e. are "uncontrolled". The algorithm of controlled hierarchical clustering of data set with categorical variables (attributes), based on agglomerative method of sequential clustering was developed. According to this algorithm in the first time the hierarchical structure of attributes is formed in required order (with specified distribution on tiers) based on their combinations in given data set, with the following clustering of this data set based on inferred hierarchy. Thus, the key moment of present method is possibility of a priori determination of order of attributes on which the sequential clustering of data set is carried out. This allows us to manipulate by the clustering, forming hierarchy of attributes depending on required representation of the final graph (dendrogram). Such, for a data set containing combination from n attributes the n! variants of hierarchy structure and same number of cluster representation can be formed. Thus, in contrast to most of the existing, "uncontrolled" methods of cluster analysis, where final dendrogram is one of the n! variants, which mostly is dependent on selected clustering algorithm, the method of controlled hierarchical clustering allows to increase informativeness of cluster analysis to maximum level through using of all potential variants (n!) of hierarchical structure and final dendrogram. PHP implementation
#======Controlled sequential hierarchical clustering ======#======Alexandr Muterko (2015) ======# first row must contain names of columns # first column must contain names of objects (varieties) #======HTML interface ======if(!$_POST['DataCSV']){//for example $_POST['DataCSV']="Variety;Country;Gene_A;Gene_B;Gene_C;Gene_D\nVar1 ;Ukraine;A1;B2;C1;D1\nVar2;Ukraine;A1;B1;C1;D1\nVar3;Ukraine;A2;B2;C 1;D1\nVar4;China;A2;B1;C2;D2\nVar5;China;A1;B2;C1;D1\nVar6;Russia;A1 ;B2;C2;D1\nVar7;Russia;A1;B1;C1;D1\nVar8;USA;A2;B2;C2;D1\nVar9;USA;A 1;B1;C2;D1\nVar10;Czech Republic;A1;B2;C2;D1\nVar11;Czech Republic;A1;B2;C1;D1"; $_POST['Order']="13240"; echo 'Example
'; } echo'
'.++$ind.' | '.join($comb,'|').' | '. $nameS.' | '.round($freqComb*100,2).'% |
Order traits: '.rtrim($OrdTrSt,'=> ').'
Combination tree '. $Tree.' | Hierarchy clustering '. $ClustTree.' |