Showing posts with label haplogroup. Show all posts
Showing posts with label haplogroup. Show all posts

July 11, 2016

Y-chromosome haplogroup N phylogeny resolved

AJHG Volume 99, Issue 1, p163–173, 7 July 2016

Human Y Chromosome Haplogroup N: A Non-trivial Time-Resolved Phylogeography that Cuts across Language Families

Anne-Mai Ilumäe et al.

The paternal haplogroup (hg) N is distributed from southeast Asia to eastern Europe. The demographic processes that have shaped the vast extent of this major Y chromosome lineage across numerous linguistically and autosomally divergent populations have previously been unresolved. On the basis of 94 high-coverage re-sequenced Y chromosomes, we establish and date a detailed hg N phylogeny. We evaluate geographic structure by using 16 distinguishing binary markers in 1,631 hg N Y chromosomes from a collection of 6,521 samples from 56 populations. The more southerly distributed sub-clade N4 emerged before N2a1 and N3, found mostly in the north, but the latter two display more elaborate branching patterns, indicative of regional contrasts in recent expansions. In particular, a number of prominent and well-defined clades with common N3a3’6 ancestry occur in regionally dissimilar northern Eurasian populations, indicating almost simultaneous regional diversification and expansion within the last 5,000 years. This patrilineal genetic affinity is decoupled from the associated higher degree of language diversity.

Link

May 19, 2016

35,000 year old mtDNA haplogroup U6 from Romania

I wouldn't be very surprised if many of the markers supposedly signifying recent gene flow Africa and Eurasia were actually quite old in Eurasia. The trouble is that reports of such gene flow were often based on simply observing that marker "X" occurs at a higher frequency in Africa than in Eurasia, so a common sense explanation is that it reflects limited recent gene flow between the continents. But, it is now known that common sense is not always the best guide, as e.g., ancient Europeans had mtDNA haplogroup M (in the past considered evidence of Asian admixture), Y-chromosome haplogroup C (ditto), and now U6.

The same should also apply to the Middle East where there has been admixture with Africans since the Islamic period at least. The existence of such admixture does not mean that every single lineage that occurs at low frequency in the Middle East and high frequency in Africa is diagnostic of this later period of admixture. Some of them could well be relics of old Middle Eastern populations. Who knows what people inhabited the presently inhospitable landscape of the Saharan-Arabian desert zone? The living populations can certainly make no claim to being the first ones there, but the genetic heritage of those earlier occupants may still persist in them in traces.

Similarly for the New World; in that case, there is a better case that European-looking lineages are indeed due to the colonization of the Americas over the last five centuries. However, that does not mean that all of them are, and we should be mindful of the possibility of pre-Columbian contact between the Old and New worlds.

Scientific Reports 6, Article number: 25501 (2016)

The mitogenome of a 35,000-year-old Homo sapiens from Europe supports a Palaeolithic back-migration to Africa

M. Hervella et al.

After the dispersal of modern humans (Homo sapiens) Out of Africa, hominins with a similar morphology to that of present-day humans initiated the gradual demographic expansion into Eurasia. The mitogenome (33-fold coverage) of the Peştera Muierii 1 individual (PM1) from Romania (35 ky cal BP) we present in this article corresponds fully to Homo sapiens, whilst exhibiting a mosaic of morphological features related to both modern humans and Neandertals. We have identified the PM1 mitogenome as a basal haplogroup U6*, not previously found in any ancient or present-day humans. The derived U6 haplotypes are predominantly found in present-day North-Western African populations. Concomitantly, those found in Europe have been attributed to recent gene-flow from North Africa. The presence of the basal haplogroup U6* in South East Europe (Romania) at 35 ky BP confirms a Eurasian origin of the U6 mitochondrial lineage. Consequently, we propose that the PM1 lineage is an offshoot to South East Europe that can be traced to the Early Upper Paleolithic back migration from Western Asia to North Africa, during which the U6 lineage diversified, until the emergence of the present-day U6 African lineages.

Link

April 25, 2016

Bursts in human male demography

From the paper:
When the tree is calibrated with a mutation rate estimate of 0.76 × 10-9 mutations per base pair per year9, the time to the most recent common ancestor (TMRCA) of the tree is ~190,000 years, but we consider the implications of alternative mutation rate estimates below. Of the clades resulting from the four deepest branching events, all but one are exclusive to Africa, and the TMRCA of all non-African lineages (that is, the TMRCA of haplogroups DE and CF) is ~76,000 years (Fig. 1, Supplementary Figs. 18 and 19, Supplementary Table 10, and Supplementary Note). We saw a notable increase in the number of lineages outside Africa ~50–55 kya, perhaps reflecting the geographical expansion and differentiation of Eurasian populations as they settled the vast expanse of these continents. Consistent with previous proposals14, a parsimonious interpretation of the phylogeny is that the predominant African haplogroup, haplogroup E, arose outside the continent. This model of geographical segregation within the CT clade requires just one continental haplogroup exchange (E to Africa), rather than three (D, C, and F out of Africa). Furthermore, the timing of this putative return to Africa—between the emergence of haplogroup E and its differentiation within Africa by 58 kya—is consistent with proposals, based on non–Y chromosome data, of abundant gene flow between Africa and nearby regions of Asia 50–80 kya15.
I've long argued for the Y-chromosome haplogroup E migration into Africa and it is nice to see this common-sense interpretation finally adopted. Too much focus has been placed on figuring out which routes modern humans took out of Africa, and not at all to figure out how Eurasian males came to overwhelm the African Y-chromosome gene pool so decisively. The Eurasian migration into Africa must have taken place in the ~70-60kya window, constrained by the D/E split and  the deepest intra-African E splits. I think that the Out-of-Arabia scenario I outlined in 2012 continues to make a lot of sense. It would be awesome to get data from the first Later Stone Age people from Africa which are probably the best bet to trace this migration from Eurasia into Sub-Saharan Africa.

Nature Genetics (2016) doi:10.1038/ng.3559

Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences

G David Poznik et al.

We report the sequences of 1,244 human Y chromosomes randomly ascertained from 26 worldwide populations by the 1000 Genomes Project. We discovered more than 65,000 variants, including single-nucleotide variants, multiple-nucleotide variants, insertions and deletions, short tandem repeats, and copy number variants. Of these, copy number variants contribute the greatest predicted functional impact. We constructed a calibrated phylogenetic tree on the basis of binary single-nucleotide variants and projected the more complex variants onto it, estimating the number of mutations for each class. Our phylogeny shows bursts of extreme expansion in male numbers that have occurred independently among each of the five continental superpopulations examined, at times of known migrations and technological innovations.

Link

December 22, 2015

Refining Y-chromosome phylogeny with South African sequences

bioRxiv http://dx.doi.org/10.1101/034983

Refining the Y chromosome phylogeny with southern African sequences

Chiara Barbieri, Alexander Hübner, Enrico Macholdt, Shengyu Ni, Sebastian Lippold, Roland Schröder, Sununguko Wata Mpoloka, Josephine Purps, Lutz Roewer, Mark Stoneking, Brigitte Pakendorf

The recent availability of large-scale sequence data for the human Y chromosome has revolutionized analyses of and insights gained from this non-recombining, paternally inherited chromosome. However, the studies to date focus on Eurasian variation, and hence the diversity of early-diverging branches found in Africa has not been adequately documented. Here we analyze over 900 kb of Y chromosome sequence obtained from 547 individuals from southern African Khoisan and Bantu-speaking populations, identifying 232 new sequences from basal haplogroups A and B. We find new branches within haplogroups A2 and A3b1 and suggest that the prehistory of haplogroup B2a is more complex than previously suspected; this haplogroup is likely to have existed in Khoisan groups before the arrival of Bantu-speakers, who brought additional B2a lineages to southern Africa. Furthermore, we estimate older dates than obtained previously for both the A2-T node within the human Y chromosome phylogeny and for some individual haplogroups. Finally, there is pronounced variation in branch length between major haplogroups; haplogroups associated with Bantu-speakers have significantly longer branches. This likely reflects a combination of biases in the SNP calling process and demographic factors, such as an older average paternal age (hence a higher mutation rate), a higher effective population size, and/or a stronger effect of population expansion for Bantu-speakers than for Khoisan groups.

Link

November 16, 2015

West_Asian in the flesh (hunter-gatherers from Georgia) (Jones et al. 2015)

Years ago, I detected the presence of a West_Asian genetic component (with dual modes in "Caucasus" and "Gedrosia") whose origins I placed in the "highlands of West Asia" and which I proposed spread into Europe post-5kya with Indo-European languages.

Earlier this year, the study by Haak et al. showed that steppe invaders after 5kya brought into Europe a 50/50 mix of "Eastern European Hunter-Gatherer" (EHG) ancestry/An unknown population from the Near East/Caucasus. The "unknown population" was most similar to Caucasians/Near Easterners like Armenians but did not correspond to any ancient sample.

A new paper in Nature Communications by Jones et al. finds this "missing link" in the flesh in Upper Paleolithic/Mesolithic hunter-gatherers from Georgia which they call "Caucasus Hunter-Gatherers" (CHG). From the paper:
The separation between CHG and both EF and WHG ended during the Early Bronze Age when a major ancestral component linked to CHG was carried west by migrating herders from the Eurasian Steppe. The foundation group for this seismic change was the Yamnaya, who we estimate to owe half of their ancestry to CHG-linked sources.
The authors also make the connection to South Asia:
In modern populations, the impact of CHG also stretches beyond Europe to the east. Central and South Asian populations received genetic influx from CHG (or a population close to them), as shown by a prominent CHG component in ADMIXTURE (Supplementary Fig. 5; Supplementary Note 9) and admixture f3-statistics, which show many samples as a mix of CHG and another South Asian population (Fig. 4b; Supplementary Table 9).
Also of interest:
Both Georgian hunter-gatherer samples were assigned to haplogroup J with Kotias belonging to the subhaplogroup J2a (see methods).
The paper is open access, so go ahead and read it for other details.

Nature Communications 6, Article number: 8912 doi:10.1038/ncomms9912

Upper Palaeolithic genomes reveal deep roots of modern Eurasians

Eppie R. Jones et al.

We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ~45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ~25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ~3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.

Link

July 12, 2015

Phylogeographic refinement of haplogroup E

Genome Biol Evol (2015) 7 (7): 1940-1950.

Phylogeographic Refinement and Large Scale Genotyping of Human Y Chromosome Haplogroup E Provide New Insights into the Dispersal of Early Pastoralists in the African Continent

Beniamino Trombetta et al.

Haplogroup E, defined by mutation M40, is the most common human Y chromosome clade within Africa. To increase the level of resolution of haplogroup E, we disclosed the phylogenetic relationships among 729 mutations found in 33 haplogroup DE Y-chromosomes sequenced at high coverage in previous studies. Additionally, we dissected the E-M35 subclade by genotyping 62 informative markers in 5,222 samples from 118 worldwide populations. The phylogeny of haplogroup E showed novel features compared with the previous topology, including a new basal dichotomy. Within haplogroup E-M35, we resolved all the previously known polytomies and assigned all the E-M35* chromosomes to five new different clades, all belonging to a newly identified subhaplogroup (E-V1515), which accounts for almost half of the E-M35 chromosomes from the Horn of Africa. Moreover, using a Bayesian phylogeographic analysis and a single nucleotide polymorphism-based approach we localized and dated the origin of this new lineage in the northern part of the Horn, about 12 ka. Time frames, phylogenetic structuring, and sociogeographic distribution of E-V1515 and its subclades are consistent with a multistep demic spread of pastoralism within north-eastern Africa and its subsequent diffusion to subequatorial areas. In addition, our results increase the discriminative power of the E-M35 haplogroup for use in forensic genetics through the identification of new ancestry-informative markers.

Link

May 21, 2015

More Y-chromosome super-fathers

The time estimates are based on a mutation rate of 1x10-9 mutations/bp/year which is ~1/3 higher than mutation rate of Karmin et al.  So the values on the table may be a little lower.

There may be additional founders with recent time depths than shown in the table, e.g., a very shallow clusters within E-M35 (probably E-V13?) and a couple of shallow clusters within I-P215

Also of interest is the fact that Greeks and Anatolian Turks do not show evidence of the recent Y-chromosomal bottleneck:
The plots are consistent with patterns seen in the relative numbers of singletons, described above, in that the Saami and Palestinians show markedly different demographic histories compared with the rest, featuring very recent reductions, while the Turks and Greeks show evidence of general expansion, with increased growth rate around 14 KYA. A different pattern is seen in the remaining majority (13/17) of populations, which share remarkably similar histories featuring a minimum effective population size ~2.1–4.2 KYA (considering the 95% confidence intervals (CIs) reported in Supplementary Table 4), followed by expansion to the present.


Related:
Nature Communications 6, Article number: 7152 doi:10.1038/ncomms8152

Large-scale recent expansion of European patrilineages shown by population resequencing

Chiara Batini, Pille Hallast et al.

The proportion of Europeans descending from Neolithic farmers ~10 thousand years ago (KYA) or Palaeolithic hunter-gatherers has been much debated. The male-specific region of the Y chromosome (MSY) has been widely applied to this question, but unbiased estimates of diversity and time depth have been lacking. Here we show that European patrilineages underwent a recent continent-wide expansion. Resequencing of 3.7 Mb of MSY DNA in 334 males, comprising 17 European and Middle Eastern populations, defines a phylogeny containing 5,996 single-nucleotide polymorphisms. Dating indicates that three major lineages (I1, R1a and R1b), accounting for 64% of our sample, have very recent coalescent times, ranging between 3.5 and 7.3 KYA. A continuous swathe of 13/17 populations share similar histories featuring a demographic expansion starting ~2.1–4.2 KYA. Our results are compatible with ancient MSY DNA data, and contrast with data on mitochondrial DNA, indicating a widespread male-specific phenomenon that focuses interest on the social structure of Bronze Age Europe.

Link

May 12, 2015

mtDNA haplogroup A10 in Bronze Age West Siberia

PLoS ONE 10(5): e0127182. doi:10.1371/journal.pone.0127182

MtDNA Haplogroup A10 Lineages in Bronze Age Samples Suggest That Ancient Autochthonous Human Groups Contributed to the Specificity of the Indigenous West Siberian Population

Aleksandr S. Pilipenko et al.

Abstract

Background

The craniometric specificity of the indigenous West Siberian human populations cannot be completely explained by the genetic interactions of the western and eastern Eurasian groups recorded in the archaeology of the area from the beginning of the 2nd millennium BC. Anthropologists have proposed another probable explanation: contribution to the genetic structure of West Siberian indigenous populations by ancient human groups, which separated from western and eastern Eurasian populations before the final formation of their phenotypic and genetic features and evolved independently in the region over a long period of time. This hypothesis remains untested. From the genetic point of view, it could be confirmed by the presence in the gene pool of indigenous populations of autochthonous components that evolved in the region over long time periods. The detection of such components, particularly in the mtDNA gene pool, is crucial for further clarification of early regional genetic history.

Results and Conclusion

We present the results of analysis of mtDNA samples (n = 10) belonging to the A10 haplogroup, from Bronze Age populations of West Siberian forest-steppe (V—I millennium BC), that were identified in a screening study of a large diachronic sample (n = 96). A10 lineages, which are very rare in modern Eurasian populations, were found in all the Bronze Age groups under study. Data on the A10 lineages’ phylogeny and phylogeography in ancient West Siberian and modern Eurasian populations suggest that A10 haplogroup underwent a long-term evolution in West Siberia or arose there autochthonously; thus, the presence of A10 lineages indicates the possible contribution of early autochthonous human groups to the genetic specificity of modern populations, in addition to contributions of later interactions of western and eastern Eurasian populations.

Link

May 03, 2015

Structure of Y-haplogroup N

arXiv:1504.06463 [q-bio.PE]

The dichotomy structure of Y chromosome Haplogroup N

Kang Hu et al.

Haplogroup N-M231 of human Y chromosome is a common clade from Eastern Asia to Northern Europe, being one of the most frequent haplogroups in Altaic and Uralic-speaking populations. Using newly discovered bi-allelic markers from high-throughput DNA sequencing, we largely improved the phylogeny of Haplogroup N, in which 16 subclades could be identified by 33 SNPs. More than 400 males belonging to Haplogroup N in 34 populations in China were successfully genotyped, and populations in Northern Asia and Eastern Europe were also compared together. We found that all the N samples were typed as inside either clade N1-F1206 (including former N1a-M128, N1b-P43 and N1c-M46 clades), most of which were found in Altaic, Uralic, Russian and Chinese-speaking populations, or N2-F2930, common in Tibeto-Burman and Chinese-speaking populations. Our detailed results suggest that Haplogroup N developed in the region of China since the final stage of late Paleolithic Era.

Link

April 13, 2015

Haplogroup G1, Y-chromosome mutation rate and migrations of Iranic speakers

The origin of Iranian speakers is a big puzzle as in ancient times there were two quite different groups of such speakers: nomadic steppe people such as Scythians and settled farmers such as Persians and Medes.

I am guessing that the story of Iranian origins will only be solved in correlation to their Indo-Aryan brethren and their more distant Indo-European relations.

Clearly, G1 cannot be Proto-Indo-European as it has a rather limited distribution in Eurasia, but it could very well have been a marker of a subset of Indo-Europeans. If it was present in ancestral Iranians, then this would geographically constrain the places where ancestral Iranians were formed.

PLoS ONE 10(4): e0122968. doi:10.1371/journal.pone.0122968

Deep Phylogenetic Analysis of Haplogroup G1 Provides Estimates of SNP and STR Mutation Rates on the Human Y-Chromosome and Reveals Migrations of Iranic Speakers

Oleg Balanovsky et al.

Y-chromosomal haplogroup G1 is a minor component of the overall gene pool of South-West and Central Asia but reaches up to 80% frequency in some populations scattered within this area. We have genotyped the G1-defining marker M285 in 27 Eurasian populations (n= 5,346), analyzed 367 M285-positive samples using 17 Y-STRs, and sequenced ~11 Mb of the Y-chromosome in 20 of these samples to an average coverage of 67X. This allowed detailed phylogenetic reconstruction. We identified five branches, all with high geographical specificity: G1-L1323 in Kazakhs, the closely related G1-GG1 in Mongols, G1-GG265 in Armenians and its distant brother clade G1-GG162 in Bashkirs, and G1-GG362 in West Indians. The haplotype diversity, which decreased from West Iran to Central Asia, allows us to hypothesize that this rare haplogroup could have been carried by the expansion of Iranic speakers northwards to the Eurasian steppe and via founder effects became a predominant genetic component of some populations, including the Argyn tribe of the Kazakhs. The remarkable agreement between genetic and genealogical trees of Argyns allowed us to calibrate the molecular clock using a historical date (1405 AD) of the most recent common genealogical ancestor. The mutation rate for Y-chromosomal sequence data obtained was 0.78×10-9 per bp per year, falling within the range of published rates. The mutation rate for Y-chromosomal STRs was 0.0022 per locus per generation, very close to the so-called genealogical rate. The “clan-based” approach to estimating the mutation rate provides a third, middle way between direct farther-to-son comparisons and using archeologically known migrations, whose dates are subject to revision and of uncertain relationship to genetic events.

Link

January 14, 2015

Y chromosome super-fathers

This is a very exciting paper. Most of these lineages are so young that there are good chances that their founders were figures from history or mythology.

The most interesting one is DC2 which was also found in south Siberian Kurgans, belongs to haplogroup R1a1 and is given an age of 3,284 years by the authors (with some almost impossibly wide confidence intervals). Based on its distribution, and if a Bronze Age date is right, it is difficult to see in it anything other than a patrilineage that was present in Proto-Indo-Iranians.


European Journal of Human Genetics advance online publication 14 January 2015; doi: 10.1038/ejhg.2014.285

Y-chromosome descent clusters and male differential reproductive success: young lineage expansions dominate Asian pastoral nomadic populations

Patricia Balaresque et al.

High-frequency microsatellite haplotypes of the male-specific Y-chromosome can signal past episodes of high reproductive success of particular men and their patrilineal descendants. Previously, two examples of such successful Y-lineages have been described in Asia, both associated with Altaic-speaking pastoral nomadic societies, and putatively linked to dynasties descending, respectively, from Genghis Khan and Giocangga. Here we surveyed a total of 5321 Y-chromosomes from 127 Asian populations, including novel Y-SNP and microsatellite data on 461 Central Asian males, to ask whether additional lineage expansions could be identified. Based on the most frequent eight-microsatellite haplotypes, we objectively defined 11 descent clusters (DCs), each within a specific haplogroup, that represent likely past instances of high male reproductive success, including the two previously identified cases. Analysis of the geographical patterns and ages of these DCs and their associated cultural characteristics showed that the most successful lineages are found both among sedentary agriculturalists and pastoral nomads, and expanded between 2100 BCE and 1100 CE. However, those with recent origins in the historical period are almost exclusively found in Altaic-speaking pastoral nomadic populations, which may reflect a shift in political organisation in pastoralist economies and a greater ease of transmission of Y-chromosomes through time and space facilitated by the use of horses.

Link

January 01, 2015

Y-chromosome tree bursts to leaf (Hallast, Batini, Zadik et al. 2014)

This is an extremely important study of Y-chromosome variation, the most intriguing part of which are the copious references to a yet to come manuscript:
Elsewhere (Batini C, Hallast P, Zadik D, Maisano Delser P, Benazzo A, Ghirotto S, Arroyo-Pardo E, Cavalleri GL, de Knijff P, Dupuy BM, Eriksen H, King TE, López de Munain A, López-Parra AM, Milasin J, Novelletto A, Pamjav H, Sajantila A, Tolun A, Winney B and Jobling MA, submitted.) we have described an NGS-based MSY phylogeny based on 5,996 SNPs ascertained in 334 human Y chromosomes comprising 17 population samples from Europe and the Near East, focused on illuminating the origins and histories of European patrilineages.
Anyway, the current paper is openly available, so do read it if you haven't already. Of interest to long-time readers of this blog is this bit:
Generally, the STRs perform poorly, giving a wide variety of TMRCAs for nodes with similar SNP-based dates, and correlation coefficients consistently below 0.6. Considering the variables described above: 1) ASD generally outperforms rho, and choice of rooting method (ancestral or modal) makes little difference. For rho, rooting through the ancestral haplotype performs much worse than through the modal haplotype; 2) removal of RM-YSTRs, and STRs showing repeat array complexity, does not have a major influence on relationships between SNP- and STR-based estimates of TMRCA, and the effects depend upon how the root is specified; and 3) the evolutionary STR mutation rate consistently overestimates, and the pedigree rate underestimates, the TMRCAs of nodes (fig. 4a). As expected, the pedigree mutation rate performs better for young nodes (less than 10 ka; supplementary table S6, Supplementary Material online), whereas the evolutionary rate performs better for older nodes.
Mol Biol Evol (2014) doi: 10.1093/molbev/msu327

The Y-Chromosome Tree Bursts into Leaf: 13,000 High-Confidence SNPs Covering the Majority of Known Clades

Pille Hallast, Chiara Batini, Daniel Zadik et al.

Many studies of human populations have used the male-specific region of the Y chromosome (MSY) as a marker, but MSY sequence variants have traditionally been subject to ascertainment bias. Also, dating of haplogroups has relied on Y-specific short tandem repeats (STRs), involving problems of mutation rate choice, and possible long-term mutation saturation. Next-generation sequencing can ascertain single nucleotide polymorphisms (SNPs) in an unbiased way, leading to phylogenies in which branch-lengths are proportional to time, and allowing the times-to-most-recent-common-ancestor (TMRCAs) of nodes to be estimated directly. Here we describe the sequencing of 3.7 Mb of MSY in each of 448 human males at a mean coverage of 51×, yielding 13,261 high-confidence SNPs, 65.9% of which are previously unreported. The resulting phylogeny covers the majority of the known clades, provides date estimates of nodes, and constitutes a robust evolutionary framework for analyzing the history of other classes of mutation. Different clades within the tree show subtle but significant differences in branch lengths to the root. We also apply a set of 23 Y-STRs to the same samples, allowing SNP- and STR-based diversity and TMRCA estimates to be systematically compared. Ongoing purifying selection is suggested by our analysis of the phylogenetic distribution of nonsynonymous variants in 15 MSY single-copy genes.

Link

December 02, 2014

Remains of Richard III identified

From the paper:
Four of the modern relatives were found to belong to Y-haplogroup R1b-U152 (x L2, Z36, Z56, M160, M126 and Z192)13, 14 with STR haplotypes being consistent with them comprising a single patrilinear group. One individual (Somerset 3) was found to belong to haplogroup I-M170 (x M253, M223) and therefore could not be a patrilinear relative of the other four within the time span considered, indicating that a false-paternity event had occurred within the last four generations. 
... 
In contrast to the Y-haplotypes of the putative modern relatives, Skeleton 1 belongs to haplogroup G-P287, with a corresponding Y-STR haplotype. Thus, the putative modern patrilinear relatives of Richard III are not genetically related to Skeleton 1 through the male line over the time period considered. However, this is not surprising, given an estimated average false-paternity rate of ~1–2% (refs 12, 17, 18). The putative modern relatives and Richard III are related through a male relative (Edward III) four generations up from Richard III (Fig. 1a and Supplementary Fig. 2), and a false-paternity event could have happened in any of the 19 generations separating Richard III and the 5th Duke of Beaufort, on either branch of the genealogy descending from Edward III. Indeed, even with a conservative false-paternity rate18 (see Supplementary Methods) the chance of a false-paternity occuring in this number of generations is 16%.

Nature Communications 5, Article number: 5631 doi:10.1038/ncomms6631

Identification of the remains of King Richard III

Turi E. King et al.

Abstract

In 2012, a skeleton was excavated at the presumed site of the Grey Friars friary in Leicester, the last-known resting place of King Richard III. Archaeological, osteological and radiocarbon dating data were consistent with these being his remains. Here we report DNA analyses of both the skeletal remains and living relatives of Richard III. We find a perfect mitochondrial DNA match between the sequence obtained from the remains and one living relative, and a single-base substitution when compared with a second relative. Y-chromosome haplotypes from male-line relatives and the remains do not match, which could be attributed to a false-paternity event occurring in any of the intervening generations. DNA-predicted hair and eye colour are consistent with Richard’s appearance in an early portrait. We calculate likelihood ratios for the non-genetic and genetic data separately, and combined, and conclude that the evidence for the remains being those of Richard III is overwhelming.

Link

November 25, 2014

E-M81 in Morocco

Hum Biol. 2014 May;86(2):105-12.

Phylogeography of e1b1b1b-m81 haplogroup and analysis of its subclades in morocco.

Reguig A, Harich N, Barakat A, Rouba H.

Abstract

In this study we analyzed 295 unrelated Berber-speaking men from northern, central, and southern Morocco to characterize frequency of the E1b1b1b-M81 haplogroup and to refine the phylogeny of its subclades: E1b1b1b1-M107, E1b1b1b2-M183, and E1b1b1b2a-M165. For this purpose, we typed four biallelic polymorphisms: M81, M107, M183, and M165. A large majority of the Berber-speaking male lineages belonged to the Y-chromosomal E1b1b1b-M81 haplogroup. The frequency ranged from 79.1% to 98.5% in all localities sampled. E1b1b1b2-M183 was the most dominant subclade in our samples, ranging from 65.1% to 83.1%. In contrast, the E1b1b1b1-M107 and E1b1b1b2a-M165 subclades were not found in our samples. Our results suggest a predominance of the E1b1b1b-M81 haplogroup among Moroccan Berber-speaking males with a decreasing gradient from south to north. The most prevalent subclade in this haplogroup was E1b1b1b2-M183, for which diffferences among these three groups were statistically significant between central and southern groups.

Link

Paternal lineages and languages in the Caucasus

An interesting new study on Y chromosome and languages in the Caucasus. The distribution of haplogroups is on the left. The authors make some associations of haplogroups with language families:

  • R1b: Indo-European
  • R1a: Scytho-Sarmatian
  • J2: Hurro-Urartian
  • G2: Kartvelian

Hum Biol. 2014 May;86(2):113-30.

Human paternal lineages, languages, and environment in the caucasus.

Tarkhnishvili D1, Gavashelishvili A1, Murtskhvaladze M1, Gabelaia M1, Tevzadze G2.

Abstract

Publications that describe the composition of the human Y-DNA haplogroup in diffferent ethnic or linguistic groups and geographic regions provide no explicit explanation of the distribution of human paternal lineages in relation to specific ecological conditions. Our research attempts to address this topic for the Caucasus, a geographic region that encompasses a relatively small area but harbors high linguistic, ethnic, and Y-DNA haplogroup diversity. We genotyped 224 men that identified themselves as ethnic Georgian for 23 Y-chromosome short tandem-repeat markers and assigned them to their geographic places of origin. The genotyped data were supplemented with published data on haplogroup composition and location of other ethnic groups of the Caucasus. We used multivariate statistical methods to see if linguistics, climate, and landscape accounted for geographical diffferences in frequencies of the Y-DNA haplogroups G2, R1a, R1b, J1, and J2. The analysis showed significant associations of (1) G2 with wellforested mountains, (2) J2 with warm areas or poorly forested mountains, and (3) J1 with poorly forested mountains. R1b showed no association with environment. Haplogroups J1 and R1a were significantly associated with Daghestanian and Kipchak speakers, respectively, but the other haplogroups showed no such simple associations with languages. Climate and landscape in the context of competition over productive areas among diffferent paternal lineages, arriving in the Caucasus in diffferent times, have played an important role in shaping the present-day spatial distribution of patrilineages in the Caucasus. This spatial pattern had formed before linguistic subdivisions were finally shaped, probably in the Neolithic to Bronze Age. Later historical turmoil had little influence on the patrilineage composition and spatial distribution. Based on our results, the scenario of postglacial expansions of humans and their languages to the Caucasus from the Middle East, western Eurasia, and the East European Plain is plausible.

Link (pdf)

October 22, 2014

High coverage genome from 45,000-year old Siberian (Ust'-Ishim)

This is the oldest full genome of a modern human published to date and it also comes from a time (45 thousand years ago) that coincides with the Upper Paleolithic revolution in Eurasia.

45 thousand years ago is probably close to when Eurasians started diverging from each other as they spread in all directions. So, we expect that a human from that time would be "undifferentiated Eurasian" and indeed this seems to be the case.

First the Y-chromosome:
The Y chromosome sequence of the Ust’-Ishim individual is similarly inferred to be ancestral to a group of related Y chromosomes (haplogroup K(xLT)) that occurs across Eurasia today6 (Supplementary Information section 9).
and mtDNA:
The Ust’-Ishim mtDNA sequence falls at the root of a large group of related mtDNAs (the ‘R haplogroup’), which occurs today across Eurasia (Supplementary Information section 8).
It is clear that this was a Eurasian individual:
Based on genotyping data for 87 African and 108 non-African individuals (Supplementary Information section 11), the Ust’-Ishim genome shares more alleles with non-Africans than with sub-Saharan Africans (|Z| = 41–89), consistent with the principal component analysis, mtDNA and Y chromosome results.
It was also more like East Asians than Europeans:
Among the non-Africans, the Ust’-Ishim genome shares more derived alleles with present-day people from East Asia than with present-day Europeans (|Z| = 2.1–6.4).
But, when they compared East Asians with La Brana and MA-1 they didn't see a difference:
However, when an ~8,000-year-old genome from western Europe (La Braña)9 or a 24,000-year-old genome from Siberia (Mal’ta 1)10 were analysed, there is no evidence that the Ust’-Ishim genome shares more derived alleles with present-day East Asians than with these prehistoric individuals (|Z| < 2). This suggests that the population to which the Ust’-Ishim individual belonged diverged from the ancestors of present-day West Eurasian and East Eurasian populations before—or simultaneously with—their divergence from each other. The finding that the Ust’-Ishim individual is equally closely related to present-day Asians and to 8,000- to 24,000-year-old individuals from western Eurasia, but not to present-day Europeans, is compatible with the hypothesis that present-day Europeans derive some of their ancestry from a population that did not participate in the initial dispersals of modern humans into Europe and Asia11.
So it seems that the Ust'-Ishim individual belonged to the same branch as Asians and WHG/ANE and modern Europeans are less like it because they also have "Basal Eurasian" admixture which they inherited via the EEF in the model of Lazaridis et al.

The authors could also get estimates of the mutation rate because this is a 45,000 year old individual that hasn't experienced 45,000 years worth of mutations:
Assuming that this corresponds to the number of mutations that have accumulated over around 45,000 years, we estimate a mutation rate of 0.43 × 10−9 per site per year (95% CI 0.38 × 10−9 to 0.49 × 10−9) that is consistent across all non-African genomes regardless of their coverage (Supplementary Information section 14). This overall rate, as well as the relative rates inferred for different mutational classes (transversions, non-CpG transitions, and CpG transitions), is similar to the rate observed for de novo estimates from human pedigrees (~0.5 × 10−9 per site per year14, 15) and to the direct estimate of branch shortening (Supplementary Information section 10). As discussed elsewhere14, 16, 17, these rates are slower than those estimated using calibrations based on the fossil record and thus suggest older dates for the splits of modern human and archaic populations.
This is a very direct confirmation of the "slow" autosomal rate of ~1.2x10-8 mutations/generation/bp using a technology much different than those used before to estimate this. The slower mutation rate implies that major splits in human history (such as the Out-of-Africa event) took place much earlier than the Upper Paleolithic revolution and the spread of humans across Eurasia. Modern humans probably established an early presence in the Levant/Arabia (consistent with Out-of-Arabia), and invented the Upper Paleolithic-related tools/behaviors there much later, and only then spread across Eurasia.

The authors write:
we estimate that the admixture between the ancestors of the Ust’-Ishim individual and Neanderthals occurred approximately 50,000 to 60,000 years BP, which is close to the time of the major expansion of modern humans out of Africa and the Middle East.
This clinches the hypothesis of Neandertal introgression in Eurasians, as Ust'-Ishim has longer Neandertal segments than modern humans, as one might expect from an individual who experienced this admixture more recently in its evolutionary past than modern humans did. It's probably in the Middle East that the Levantine/Arabian modern humans that expanded Out-of-Africa more than 100 thousand years ago came into contact with Neandertals, admixed with them and later carried this ancestry to the rest of Eurasia. I tend to think that the AMH "colony" was first limited to Arabia and only later (post-70kya) expanded north as the climate deteriorated there. The authors estimate the common ancestor of non-African Y-chromosomes (including E, which is probably a back-migration to Africa) to around 70 thousand years ago which may coincide with the Arabian Exodus event.

Nature 514, 445–449 (23 October 2014) doi:10.1038/nature13810

Genome sequence of a 45,000-year-old modern human from western Siberia

Qiaomei Fu et al.

We present the high-quality genome sequence of a ~45,000-year-old modern human male from Siberia. This individual derives from a population that lived before—or simultaneously with—the separation of the populations in western and eastern Eurasia and carries a similar amount of Neanderthal ancestry as present-day Eurasians. However, the genomic segments of Neanderthal ancestry are substantially longer than those observed in present-day individuals, indicating that Neanderthal gene flow into the ancestors of this individual occurred 7,000–13,000 years before he lived. We estimate an autosomal mutation rate of 0.4 × 10−9 to 0.6 × 10−9 per site per year, a Y chromosomal mutation rate of 0.7 × 10−9 to 0.9 × 10−9 per site per year based on the additional substitutions that have occurred in present-day non-Africans compared to this genome, and a mitochondrial mutation rate of 1.8 × 10−8 to 3.2 × 10−8 per site per year based on the age of the bone.

Link

September 28, 2014

A limited genetic link between Mansi and Hungarians

Mol Genet Genomics. 2014 Sep 26. [Epub ahead of print]

Y-SNP L1034: limited genetic link between Mansi and Hungarian-speaking populations.

Fehér T1, Németh E, Vándor A, Kornienko IV, Csáji LK, Pamjav H.

Abstract

Genetic studies noted that the Hungarian Y-chromosomal gene pool significantly differs from other Uralic-speaking populations. Hungarians show very limited or no presence of haplogroup N-Tat, which is frequent among other Uralic-speaking populations. We proposed that some genetic links need to be observed between the linguistically related Hungarian and Mansi populations.This is the first attempt to divide haplogroup N-Tat into subhaplogroups by testing new downstream SNP markers L708 and L1034. Sixty Northern Mansi samples were collected in Western Siberia and genotyped for Y-chromosomal haplotypes and haplogroups. We found 14 Mansi and 92 N-Tat samples from 7 populations. Comparative results showed that all N-Tat samples carried the N-L708 mutation. Some Hungarian, Sekler, and Uzbek samples were L1034 SNP positive, while all Mongolians, Buryats, Khanty, Finnish, and Roma samples yielded a negative result for this marker. Based on the above, L1034 marker seems to be a subgroup of N-Tat, which is typical for Mansi and Hungarian-speaking ethnic groups so far. Based on our time to most recent common ancestor data, the L1034 marker arose 2,500 years before present. The overall frequency of the L1034 is very low among the analyzed populations, thus it does not necessarily mean that proto-Hungarians and Mansi descend from common ancestors. It does provide, however, a limited genetic link supporting language contact. Both Hungarians and Mansi have much more complex genetic population history than the traditional tree-based linguistic model would suggest.

Link

September 13, 2014

Ancient mtDNA from southern Africa related to San

Genome Biol Evol (2014) doi: 10.1093/gbe/evu202

First Ancient Mitochondrial Human Genome from a Pre-Pastoralist Southern African

Alan G. Morris et al.

The oldest contemporary human mitochondrial lineages arose in Africa. The earliest divergent extant maternal offshoot, namely haplogroup L0d, is represented by click-speaking forager peoples of Southern Africa. Broadly defined as Khoesan, contemporary Khoesan are today largely restricted to the semi-desert regions of Namibia and Botswana, while archeological, historical and genetic evidence promotes a once broader southerly dispersal of click-speaking peoples including southward migrating pastoralists and indigenous marine-foragers. Today extinct, no genetic data has been recovered from the indigenous peoples that once sustained life along the southern coastal waters of Africa pre-pastoral arrival. In this study we generate a complete mitochondrial genome from a 2,330 year old male skeleton, confirmed via osteological and archeological analysis as practicing a marine-based forager existence. The ancient mtDNA represents a new L0d2c lineage (L0d2c1c) that is today, unlike its Khoe-language based sister-clades (L0d2c1a and L0d2c1b) most closely related to contemporary indigenous San-speakers (specifically Ju). Providing the first genomic evidence that pre-pastoral Southern African marine foragers carried the earliest diverged maternal modern human lineages, this study emphasizes the significance of Southern African archeological remains in defining early modern human origins.

Link

September 04, 2014

Y chromosomes and mtDNA of early farmers from Hungary

A new preprint has just appeared on the bioRxiv. It's free to read so I'll just summarize some results. First:
The haplotype of the Mesolithic skeleton from the Croatian Island Korčula belongs to the mtDNA haplogroup U5b2a5 (Dataset S3). The sub-haplogroup U5b has been shown to be frequent in pre-Neolithic hunter-gatherer communities across Europe [28–30,32,33,45,46]. 
But:
Contrary to the low mtDNA diversity reported from hunter-gatherers of Central/North Europe [28–30], we identify substantially higher variability in early farming communities of the Carpathian Basin  including the haplogroups N1a, T1, T2, J, K, H, HV, V, W, X, U2, U3, U4, and U5a (Table 1). Previous studies have shown that haplogroups N1a, T2, J, K, HV, V, W and X are most characteristic for the Central European LBK and have described these haplogroups as the mitochondrial ʻNeolithic packageʼ that had reached Central Europe in the 6th millennium BC [36,37]. Interestingly, most of these haplogroups show comparable frequencies between the STA, LBKT and LBK,
N1a is the "signature group" of the LBK based on previous publications and now it seems that it was also found in the Starcevo culture of Hungary. The mtDNA PCA plot (right) shows clearly that the Hungarian farmers are very similar to the German ones so it seems that the LBK is a direct outgrowth of the Carpathian Neolithic; some earlier models of "demic diffusion" argued that Neolithic farmers spread slowly across Europe, picking up hunter-gatherer ancestry as they went along, but now it seems that at least in the Hungary->Germany part of this journey interaction with hunter-gatherers was minimum.

mtDNA change over time in Europe is pictured in Figure 3 (left) showing a shared haplotype analysis. The Y-chromosome data genetic distance is shown on the right and shows the Balkan-Anatolian-Caucasian-Mesopotamian relationship of the early farmer Y-chromosomes. Practically, this is due to haplogroup G2a (and especially G2a2b), which has turned up in lots of ancient European farmers (including the famous Iceman):
Three STA individuals belong to the NRY haplogroup F* (M89) and two specimens can be assigned to the G2a2b (S126) haplogroup, and one each to G2a (P15) and I2a1 (P37.2) (Dataset S3, S5). The two investigated LBKT samples carry haplogroups G2a2b (S126) and I1 (M253). Furthermore, the incomplete SNP profiles of eight specimens potentially belong to the same haplogroups; STA: three G2a2b (S126), two G2a (P15), and one I (M170); LBKT: one G2a2b (S126) and one F* (M89) (Dataset S5).
I believe this is the first ancient finding of haplogroup I1 which attains a peak in modern Swedes. This might be useful to those who have tied this to Germanic migrations because of this, as it was already in Central Europe with the earliest farmers.

Interestingly:
Surprisingly, Y chromosome haplogroups, such as E1b1b1 (M35), E1b1b1a1 (M78), E1b1b1b2a (M123), J2 (M172), J1 (M267), and R1b1a2 (M269), which were claimed to be associated with the Neolithic expansion [23–25], have not been found so far in the 6th millennium BC of the Carpathian Basin and Central Europe. Intriguingly, R1a and R1b, which represent the most frequent European Y chromosome haplogroups today, have been reported from cultures that emerged in Central Europe during the 3rd/2nd millennium BC, while a basal R type has been reported from a Palaeolithic sample in Siberia [60] in agreement with a proposed Central Asian/Siberian origin of this lineage. In contrast, G2a has not been detected yet in late Neolithic cultures [42,43]. This suggests further demographic events in later Neolithic or post-Neolithic periods.
A cautionary tale against over-reliance on modern distributions to trace ancient origins.

Also:
Considering the entire set of 32 published NRY records available for Neolithic Europe thus far, the low paternal diversity is indeed quite remarkable: G2a is the prevailing haplogroup in the Central European and Carpathian Basin Neolithic, and in French and Iberian Neolithic datasets [36,40,41]. There are only two exceptions, namely one E1b1b (V13) [41] individual from the Avellaner cave in Spain (~5,000-4,500 BC), and two I2a [40] individuals from Treilles, France (~3,000 BC).


biorxiv http://dx.doi.org/10.1101/008664

Tracing the genetic origin of Europe's first farmers reveals insights into their social organization

Anna Szécsényi-Nagy et al.

Farming was established in Central Europe by the Linearbandkeramik culture (LBK), a well-investigated archaeological horizon, which emerged in the Carpathian Basin, in today's Hungary. However, the genetic background of the LBK genesis has not been revealed yet. Here we present 9 Y chromosomal and 84 mitochondrial DNA profiles from Mesolithic, Neolithic Starčevo and LBK sites (7th/6th millennium BC) from the Carpathian Basin and south-eastern Europe. We detect genetic continuity of both maternal and paternal elements during the initial spread of agriculture, and confirm the substantial genetic impact of early farming south-eastern European and Carpathian Basin cultures on Central European populations of the 6th-4th millennium BC. Our comprehensive Y chromosomal and mitochondrial DNA population genetic analyses demonstrate a clear affinity of the early farmers to the modern Near East and Caucasus, tracing the expansion from that region through south-eastern Europe and the Carpathian Basin into Central Europe. Our results also reveal contrasting patterns for male and female genetic diversity in the European Neolithic, suggesting patrilineal descent system and patrilocal residential rules among the early farmers.

Link

August 21, 2014

Ancient Y-DNA from China

From the paper:
Dividing the samples further using social status shows that the six aristocrats had haplogroups Q1a1, O3a, and N, the 14 commoners had haplogroups Q1a1, O3a, and O*, and the seven slaves had haplogroups O3a, O2a, and O* (Fig. 2).
Am. J. Hum. Biol. DOI: 10.1002/ajhb.22604

Ancient DNA evidence reveals that the Y chromosome haplogroup Q1a1 admixed into the Han Chinese 3,000 years ago

Yong-Bin Zhao et al.

Objectives

Y chromosome haplogroup Q1a1 is found almost only in Han Chinese populations. However, it has not been found in ancient Han Chinese samples until now. Thus, the origin of haplogroup Q1a1 in Han Chinese is still obscure. This study attempts to provide answer to this question, and to uncover the origin and paternal genetic structure of the ancestors of the Han Chinese.

Methods

Eighty-nine ancient human remains that were excavated from the presumed geographic source of the Han Chinese and dated to approximately 3,000 years ago were treated by the amelogenin gene polymerase chain reaction test, to determine their sex. Then, Y chromosome single nucleotide polymorphisms were subsequently analyzed from the samples detected as male.

Results 

Samples from 27 individuals were successfully amplified. Their haplotypes could be attributed to haplogroups N, O*, O2a, O3a, and Q1a1. Analyses showed that the assigned haplogroup of each sample is correlated to the suspected social status and observed burial custom associated with the sample.

Conclusions

The origins of the observed haplotypes and their distribution in present day Han Chinese and in the samples suggest that haplogroup Q1a1 was probably introduced into the Han Chinese population approximately 3,000 years ago. Am. J. Hum. Biol., 2014. © 2014 Wiley Periodicals, Inc.

Link