The database contains distributions representing 90 populations (N = 16,751 males) by the frequencies of the published and unpublished Y-chromosome Hgs. These Hgs were combined into 18 different Hgs (C, E, ABDF*, G, H, I1, I2, J1, J2, K*, L, N, O, Q, R1a, R1b, R2, T), so that published sources could be used for comparisons.
As shown in Fig. 1, Middle Eastern (Class 7) and Central European Classes (Class 8) form one non-separable cluster in the central part of the figure. All of the others of the 10 classes can be identified in different well separable areas around this central region. The Central Asian (Class 4) and Northwest Caucasian (Class 9) Classes are in neighbouring areas in the upper and upper-left parts, while the Arab-Dagestanian Class (Class 1) occupies the opposite, lower-left part of the map. The North-Central and Western European (Class 3 + Class 6) as well as the Atlantic (Class 10) Classes form a common branch in the lower-left part of the figure. The opposite, upper-right branch contains the East Baltic (Class 5) and North Eurasian (Class 2) Classes.Forensic Science International: Genetics Supplement Series Available online 26 October 2013
Classification of the Y-haplogroup distributions of Western Eurasian populations using a self-learning algorithm
H. Pamjav et al.
The understanding of historical relationship between populations is a core aspect of human population history studies. We have compared the frequency of 18 different Y-SNP haplogroups in 90 Western Eurasian populations. Classification of haplogroup distribution vectors using a new self-learning classification algorithm so called “self-organizing cloud (SOC)” proved to be an effective tool to identify population groups, which share common paternal genetic features. By means of the algorithm, we have determined 10 different classes of populations based on the similarity of haplogroup composition. The analysis showed that paternal genetic markers tend to reflect geographical proximity of populations better than linguistic relationship, although certain Y-SNP haplogroups have relatively good correlation with specific language families. These observations are based on the comparative analysis of the Hg distributions of contemporary populations may reflect demographic history of them in the past.