Showing posts with label Bantu. Show all posts
Showing posts with label Bantu. Show all posts

December 06, 2014

African Genome Variation project paper

A choice quote:
To assess the effect of gene flow on population differentiation in SSA, we masked Eurasian ancestry across the genome (Supplementary Methods and Supplementary Note 6). This markedly reduced population differentiation, as measured by a decline in mean pairwise FST from 0.021 to 0.015 (Supplementary Note 6), suggests that Eurasian ancestry has a substantial impact on differentiation among SSA populations. We speculate that residual differentiation between Ethiopian and other SSA populations after masking Eurasian ancestry (pairwise FST = 0.027) may be a remnant of East African diversity pre-dating the Bantu expansion10.
I think this should be highlighted for a couple of reasons.

1. In too many papers to count, decreasing genetic diversity from East Africa was taken as evidence of an origin of H. sapiens in that locality and its expansion from there to Eurasia. This "East Africa=cradle of mankind" theory has, as far as I can tell, nothing really to stand on. Granted, the oldest anatomically modern human remains have been found in East Africa 200-150 thousand years ago. But, the fact that old sapiens have been found in East Africa and not elsewhere is easily explained by the excellent conditions for preservation (as opposed, e.g., deserts or rainforests of Africa or elsewhere), and by the extraordinary effort by palaeoanthropologists in that area. One also needs to overlook a century of physical anthropology that concluded that East Africa was a contact zone between Caucasoids and Sub-Saharan Africans. We now know that there is no deep lineage of humans in modern east Africans. Take out the Eurasian ancestry and only a paltry Fst=0.027 remains with other Sub-Saharan Africans, a fraction of the Fst between, say, Europeans and East Asians.

2. There has been enormous literature about phenotypic variation in Africans. The ultra-migrationism of old was replaced by ultra-selectionism that sought to explain every phenotypic marker of Eurasian admixture in Africa not as evidence of such admixture, but as a parallel process of evolution whereby some Africans tended to resemble some Eurasians not because of admixture but because of adaptation to similar environmental conditions.

But:
This suggests that a large proportion of differentiation observed among African populations could be due to Eurasian admixture, rather than adaptation to selective forces (Supplementary Note 6).
This study also confirms the presence of Eurasian admixture in the Yoruba
Our finding of ancient Eurasian admixture corroborates findings of non-zero Neanderthal ancestry in Yoruba, which is likely to have been introduced through Eurasian admixture and back migration, possibly facilitated by greening of the Sahara desert during this period13, 14.

Nature (2014) doi:10.1038/nature13997

The African Genome Variation Project shapes medical genetics in Africa

Deepti Gurdasani, Tommy Carstensen, Fasil Tekola-Ayele, Luca Pagani, Ioanna Tachmazidou, et al.

Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

Link

February 13, 2014

Human admixture common in human history (Hellenthal et al. 2014)

A string of recent papers argued for admixture in human populations at time scales from the Middle Pleistocene to recent centuries. A new paper in Science makes the point convincingly for extensive admixture in humans over the last few thousand years. The authors include the creators of Chromopainter/fineStructure software; the new "Globetrotter" method appears to be a natural extension of that method that seemed to work wonderfully well except for the limitation of producing only a tree of the studied populations.

The paper has a companion website in which you can look up the admixture history of individual populations.

While reading this study, it is important to remember its limitations. Two are immediately obvious: (i) admixture events can only be detected for the last few thousand years, as this method depends on pattern of linkage disequilibrium which decays exponentially with time due to recombination, and (ii) detection of admixture seems to depend on the presence of maximally differentiated populations from the edges of the human geographical range; for example, the Japanese appear unadmixed even though they are clearly of dual Jomon/Yayoi ancestry. On the other hand, the method does detect the admixture present in the San at a similar time scale.

The case of Northwestern Europe appears especially striking as none of the populations from the region show evidence of admixture. This may be because the mixtures taking place there (e.g., between "Celts" and "Anglo-Saxons" in Great Britain) involved populations that were not strongly differentiated. Alternatively, population admixture history may have preceded the last few thousand years and is thus beyond the temporal scope of this method.

An exception to the rule that populations at the edges of the human range appear to be unadmixed are the Armenians who appear to be the only * between the Atlantic and Pacific in Figure 2D (shown at the beginning of this post). The companion site lists their status as "uncertain".

Other results are more questionable; for example, the authors assert that Sardinians are an admixed population with one side being "Egyptian-like" and the other "French-like" whereas the ancient DNA evidence as it stands would rather indicate that Sardinians are the best approximation of Neolithic Europeans currently in existence and so are more likely to (mostly) possess a gene pool that traces back to ~8-9 thousand years in Europe. It will be quite the surprise if so many Europeans from 5kya or earlier look like modern Sardinians and ancient Sardinians don't!

The analysis of Eastern Europe is particularly interesting as it documents three way admixture (Northern/Southern/NE Asian) in most populations but two way admixture (Northern/Southern) in Greeks, estimated at ~37%. The authors claim that this is related to the Slavs, which seems reasonable given the 1,054AD age estimate. On the other hand, according to the companion website, the southern element in Greeks is inferred to be Cypriot-like and it's far from clear that the pre-Slavic population of Greece was Cypriot-like or indeed represented by any of the populations in the authors' dataset.

The three-way admixture in much of eastern Europe is not particularly surprising as history furnishes ample evidence for groups of steppe origin in the region during historical times. Some bequeathed their both language and name (e.g., Magyars), others only their name (e.g., Bulgarians) on the local Europeans, but records indicate a widespread presence of "eastern" groups in Europe from the time of the Huns to that of the Ottomans. A study of late Antique eastern Europeans from the Baltic to the Aegean may help better document how the twin phenomena of the eastern invasions and the spread of the Slavs shaped the present-day genetic diversity of the region.

I suspect that a few ancient samples will be far more informative for understanding the recent history of our species than the most sophisticated modeling of modern populations. Nonetheless, it's great to have a new method that maximizes what can be learned about the past from the messy palimpsest of the present.

Science 14 February 2014: Vol. 343 no. 6172 pp. 747-751 DOI: 10.1126/science.1243518

A Genetic Atlas of Human Admixture History

Garrett Hellenthal et al.

Modern genetic data combined with appropriate statistical methods have the potential to contribute substantially to our understanding of human history. We have developed an approach that exploits the genomic structure of admixed populations to date and characterize historical mixture events at fine scales. We used this to produce an atlas of worldwide human admixture history, constructed by using genetic data alone and encompassing over 100 events occurring over the past 4000 years. We identified events whose dates and participants suggest they describe genetic impacts of the Mongol empire, Arab slave trade, Bantu expansion, first millennium CE migrations in Eastern Europe, and European colonialism, as well as unrecorded events, revealing admixture to be an almost universal force shaping human populations.

Link

January 29, 2014

Austronesian (~1/3) Bantu (~2/3) admixture in Madagascar

Illumina data from this study can be found here.

PNAS doi: 10.1073/pnas.1321860111

Genome-wide evidence of Austronesian–Bantu admixture and cultural reversion in a hunter-gatherer group of Madagascar

Denis Pierron et al.

Linguistic and cultural evidence suggest that Madagascar was the final point of two major dispersals of Austronesian- and Bantu-speaking populations. Today, the Mikea are described as the last-known Malagasy population reported to be still practicing a hunter-gatherer lifestyle. It is unclear, however, whether the Mikea descend from a remnant population that existed before the arrival of Austronesian and Bantu agriculturalists or whether it is only their lifestyle that separates them from the other contemporary populations of South Madagascar. To address these questions we have performed a genome-wide analysis of >700,000 SNP markers on 21 Mikea, 24 Vezo, and 24 Temoro individuals, together with 50 individuals from Bajo and Lebbo populations from Indonesia. Our analyses of these data in the context of data available from other Southeast Asian and African populations reveal that all three Malagasy populations are derived from the same admixture event involving Austronesian and Bantu sources. In contrast to the fact that most of the vocabulary of the Malagasy speakers is derived from the Barito group of the Austronesian language family, we observe that only one-third of their genetic ancestry is related to the populations of the Java-Kalimantan-Sulawesi area. Because no additional ancestry components distinctive for the Mikea were found, it is likely that they have adopted their hunter-gatherer way of life through cultural reversion, and selection signals suggest a genetic adaptation to their new lifestyle.

Link

May 09, 2013

Phylogeography of Bantu languages (Currie et al. 2013)

Proc. R. Soc. B 7 July 2013 vol. 280 no. 1762 20130695

Cultural phylogeography of the Bantu Languages of sub-Saharan Africa

Thomas E. Currie et al.

There is disagreement about the routes taken by populations speaking Bantu languages as they expanded to cover much of sub-Saharan Africa. Here, we build phylogenetic trees of Bantu languages and map them onto geographical space in order to assess the likely pathway of expansion and test between dispersal scenarios. The results clearly support a scenario in which groups first moved south through the rainforest from a homeland somewhere near the Nigeria–Cameroon border. Emerging on the south side of the rainforest, one branch moved south and west. Another branch moved towards the Great Lakes, eventually giving rise to the monophyletic clade of East Bantu languages that inhabit East and Southeastern Africa. These phylogenies also reveal information about more general processes involved in the diversification of human populations into distinct ethnolinguistic groups. Our study reveals that Bantu languages show a latitudinal gradient in covering greater areas with increasing distance from the equator. Analyses suggest that this pattern reflects a true ecological relationship rather than merely being an artefact of shared history. The study shows how a phylogeographic approach can address questions relating to the specific histories of certain groups, as well as general cultural evolutionary processes.

Link

March 15, 2013

Admixture in Southern Africa (Petersen et al. 2013)

Related:



PLoS Genet 9(3): e1003309. doi:10.1371/journal.pgen.1003309

Complex Patterns of Genomic Admixture within Southern Africa

Desiree C. Petersen et al.

Within-population genetic diversity is greatest within Africa, while between-population genetic diversity is directly proportional to geographic distance. The most divergent contemporary human populations include the click-speaking forager peoples of southern Africa, broadly defined as Khoesan. Both intra- (Bantu expansion) and inter-continental migration (European-driven colonization) have resulted in complex patterns of admixture between ancient geographically isolated Khoesan and more recently diverged populations. Using gender-specific analysis and almost 1 million autosomal markers, we determine the significance of estimated ancestral contributions that have shaped five contemporary southern African populations in a cohort of 103 individuals. Limited by lack of available data for homogenous Khoesan representation, we identify the Ju/'hoan (n = 19) as a distinct early diverging human lineage with little to no significant non-Khoesan contribution. In contrast to the Ju/'hoan, we identify ancient signatures of Khoesan and Bantu unions resulting in significant Khoesan- and Bantu-derived contributions to the Southern Bantu amaXhosa (n = 15) and Khoesan !Xun (n = 14), respectively. Our data further suggests that contemporary !Xun represent distinct Khoesan prehistories. Khoesan assimilation with European settlement at the most southern tip of Africa resulted in significant ancestral Khoesan contributions to the Coloured (n = 25) and Baster (n = 30) populations. The latter populations were further impacted by 170 years of East Indian slave trade and intra-continental migrations resulting in a complex pattern of genetic variation (admixture). The populations of southern Africa provide a unique opportunity to investigate the genomic variability from some of the oldest human lineages to the implications of complex admixture patterns including ancient and recently diverged human lineages.

Link

March 01, 2013

Extremely old (237–581 kya) root of human Y-chromosome phylogeny

I had mentioned this research before, and now it has officially been published. There are three things to be excited about this new paper:

First, it forces us to consider the possibility of either (i) archaic admixture in Africa, or (ii) a much more ancient time depth of modern humans than the first fossils from Ethiopia dated to about ~200 thousand years ago.

Second, it underscores the importance of collaboration between academia and regular folk, since it was the combined contributions of academics, genetic genealogists, and the owners of the new A00 basal Y-chromosomes that made this new discovery possible.

And, third, it shows that the extraordinary can be discovered without contacting isolated tribes or seeking human bones in remote regions, but rather through careful scrutiny of large volumes of data for the proverbial needle in the haystack.

The paper developed a model of Y-chromosome mutation based on the estimate of Kong et al. Significantly, though:

If we were to use the higher mutation rate (1.0x10^-9 per base per year6) rather than a realistic range derived from whole-genome sequencing (4.39x10^-9  -   7.07x10^- 9), the estimated TMRCA for the tree incorporating A00 as the basal lineage would be 209 kya, which is only slightly older than current estimates of the TMRCA of mtDNA and the age of the oldest AMH fossil remains. We note, however, that the higher mutation rate produces an estimate for the common ancestor of all non-African Y chromosome haplogroups (C through T) of ~39 kya6 (i.e., versus ~63 kya for the mutation rate used here).
A 39kya common ancestor for Eurasian Y-chromosomes makes no sense, since we now know for sure that by that time, the differentiation of Eurasians was already well on its way and modern humans in remote parts of the Old World have been documented much earlier than that time.

A ~63kya common ancestor, on the other hand, fits nicely with my "two deserts" theory of modern human origins, according to which the ancestors of Eurasians faced an ecological crisis in Arabia when it became much drier post-70kya; that seems like a most opportune time for the major Eurasian bottleneck and the corresponding coalescence of Eurasian Y-chromosomes to a single man. And, while there is no a priori reason for Y chromosomes and mtDNA to behave similarly, the age of the "older" Eurasian ur-mother, haplogroup N at 59 thousand years, with presumably an older ancestor within mtDNA haplogroup L3 founding the Eurasian population.

Also, if modern human-Neandertal admixture had occurred  "most likely 47,000–65,000 years ago", then the expansion of modern Proto-Eurasians within a 70-60kya timeframe north Out-of-Arabia would have brought them in contact with their northern Neandertal neighbors. On the other hand, it would be incredible if modern humans experienced admixture with Neandertals but were still much later a very small population (to allow for the coalescence of their Y-chromosomes to one man ~39kya).


So, in summary, the mutation rate used by the authors seems consistent with what we know about an important calibration point of the human story.

But, who were the people in Africa responsible for the introgression of A00 chromosomes? Mendez et al. used the haplotype of the African American A00 individual and discovered his patrilineal kin among the Mbo of Cameroon, who are Bantu farmers. 

I have observed before that Pygmies and Bushmen represent only a tiny fraction of pre-existing African genetic diversity, the part that had not yet been absorbed into the farmers' expanding population by the time that Africa came to the attention of of modern science. We see traces of Pygmy and Bushman ancestry in some African farmers, and there were probably other groups, no longer extant as distinct ethno-cultural entities, but, nonetheless, surviving as genetic fragments in the genomes of the farmers.

Thus, while it still makes sense to study the surviving hunter-gatherers of Africa who make up perhaps a percent or less of the population of Africa, it may be equally important to study different groups of African farmers who may possess a much richer treasure trove (albeit diluted) of such "Palaeoafrican" ancestry.

Finally:

Although the stochastic nature of the evolutionary process can explain the aforementioned incongruences, the extreme age and rarity of the A00 lineage point to the possibility of a highly structured ancestral population, consistent with recent work on the autosomes.40,41,43,44 This could take the form of long-standing population structure among AMH populations45 or archaic introgression from an archaic form into the ancestors of AMHs.46 Interestingly, the Mbo live less than 800 km away from a Nigerian site known as Iwo Eleru, where human skeletal remains with both archaic and modern features were found and dated to ~13 kya.47 Further surveys in sub- Saharan Africa and in the African Diaspora might uncover more diverged basal lineages, which will help to disentangle some of the complex evolutionary processes that shape patterns of Y chromosome diversity.


AJHG 10.1016/j.ajhg.2013.02.002

An African American Paternal Lineage Adds an Extremely Ancient Root to the Human Y Chromosome Phylogenetic Tree 

Fernando L. Mendez et al.

We report the discovery of an African American Y chromosome that carries the ancestral state of all SNPs that defined the basal portion of the Y chromosome phylogenetic tree. We sequenced ∼240 kb of this chromosome to identify private, derived mutations on this lineage, which we named A00. We then estimated the time to the most recent common ancestor (TMRCA) for the Y tree as 338 thousand years ago (kya) (95% confidence interval = 237–581 kya). Remarkably, this exceeds current estimates of the mtDNA TMRCA, as well as those of the age of the oldest anatomically modern human fossils. The extremely ancient age combined with the rarity of the A00 lineage, which we also find at very low frequency in central Africa, point to the importance of considering more complex models for the origin of Y chromosome diversity. These models include ancient population structure and the possibility of archaic introgression of Y chromosomes into anatomically modern humans. The A00 lineage was discovered in a large database of consumer samples of African Americans and has not been identified in traditional hunter-gatherer populations from sub-Saharan Africa. This underscores how the stochastic nature of the genealogical process can affect inference from a single locus and warrants caution during the interpretation of the geographic location of divergent branches of the Y chromosome phylogenetic tree for the elucidation of human origins.

Link

February 05, 2013

mtDNA from southeastern Kenyan Bantu

Am J Phys Anthropol DOI: 10.1002/ajpa.22227

Mitochondrial DNA Diversity in Two Ethnic Groups in Southeastern Kenya: Perspectives from the Northeastern Periphery of the Bantu Expansion

Ken Batai et al.

The Bantu languages are widely distributed throughout sub-Saharan Africa. Genetic research supports linguists and historians who argue that migration played an important role in the spread of this language family, but the genetic data also indicates a more complex process involving substantial gene flow with resident populations. In order to understand the Bantu expansion process in east Africa, mtDNA hypervariable region I variation in 352 individuals from the Taita and Mijikenda ethnic groups was analyzed, and we evaluated the interactions that took place between the Bantu- and non-Bantu-speaking populations in east Africa. The Taita and Mijikenda are Bantu-speaking agropastoralists from southeastern Kenya, at least some of whose ancestors probably migrated into the area as part of Bantu migrations that began around 3,000 BCE. Our analyses indicate that they show some distinctive differences that reflect their unique cultural histories. The Taita are genetically more diverse than the Mijikenda with larger estimates of genetic diversity. The Taita cluster with other east African groups, having high frequencies of haplogroups from that region, while the Mijikenda have high frequencies of central African haplogroups and cluster more closely with central African Bantu-speaking groups. The non-Bantu speakers who lived in southeastern Kenya before Bantu speaking groups arrived were at least partially incorporated into what are now Bantu-speaking Taita groups. In contrast, gene flow from non-Bantu speakers into the Mijikenda was more limited. These results suggest a more complex demographic history where the nature of Bantu and non-Bantu interactions varied throughout the area.

Link

January 17, 2013

Deep mtDNA substructure in southern Africa (Barbieri et al. 2013)

The Khoisan have been used in many different ways in reconstructions of human history.

Being probably the most genetically diverse modern human population, they are occasionally viewed as akin to the ur-humans, with everyone else shedding diversity via founder effects as they moved away from a south African modern human urheimat.

They are also sometimes viewed as a basal branch of the human family tree, and they probably are -if modern humans are made to fit a tree model. But, modern humans didn't really evolve tree-like (some African farmers have Khoisan-like admixture, and the Khoisan themselves have relatively "shallow" common ancestry with other Africans and many Eurasians on account of their possession of a respectable frequency of Y-haplogroup E).

I have sometimes noted that in the case of South African groups were are lucky that the Khoisan exist as a discrete set of populations, making it easier to discern the legacy of South African hunter-gatherers in the genomes of immigrant farmers and pastoralists who converged southwards over the last few thousand years. This can be contrasted with the presumable situation in places like West Africa (the cradle of Sub-Saharan African farming), in which any indigenous hunter-gatherer groups have ceased to exist as distinct entities a long time ago.

A new AJHG paper sample south African genomes extensively and arrives at a startling conclusion. In the words of the authors:
Overall, the results of this analysis indicate that it is very unlikely that the highly divergent L0k1b/L0k2 lineages were incorporated into the Bantu-speaking populations via gene flow from a population that was ancestral to a Khoisan population in our sample but subsequently lost from the Khoisan population via drift. Instead, these results support the hypothesis that the ancestors of the Bantu-speaking populations carrying the divergent L0k lineages (who now live mainly in Zambia) experienced gene flow from a pre-Bantu population that is nowadays extinct. Alternatively, it is possible that descendants from this pre-Bantu population do exist but have not yet been included in population genetic studies; however, our extensive sampling of populations from Botswana, Namibia, andWest Zambia (which includes representatives of nearly all known Khoisan groups) makes it highly unlikely that this pre-Bantu Khoisan population has not yet been sampled.
In other words, we must resist the tendency to think of the Khoisan as representatives of all pre-Bantu south Africans. The Khoisan are certainly descendants of old south Africans, and represent a part of the pre-Bantu genetic landscape that retained its cultural distinctiveness (and hence can be nowadays sampled as a distinct population). But, there were other, now submerged, peaks in that landscape that are no longer extant in distinct form, but only in absorbed form in the gene pool of south African farmers.

This is fairly interesting in itself, and certainly ought to change our belief about what Africa looked like pre-Bantu expansion. We ought to think of, perhaps, a cornucopia of groups: many of them may have gone extinct; some may have been completely absorbed into more successful ones, and perhaps only a handful survive as distinct entities. Such a view would agree with the conclusions of physical anthropology about the persistence of archaic-leaning groups in parts of Africa down to the Holocene boundary.

The American Journal of Human Genetics, 17 January 2013 doi:10.1016/j.ajhg.2012.12.010

Ancient Substructure in Early mtDNA Lineages of Southern Africa

Chiara Barbieri et al.


Among the deepest-rooting clades in the human mitochondrial DNA (mtDNA) phylogeny are the haplogroups defined as L0d and L0k, which are found primarily in southern Africa. These lineages are typically present at high frequency in the so-called Khoisan populations of hunter-gatherers and herders who speak non-Bantu languages, and the early divergence of these lineages led to the hypothesis of ancient genetic substructure in Africa. Here we update the phylogeny of the basal haplogroups L0d and L0k with 500 full mtDNA genome sequences from 45 southern African Khoisan and Bantu-speaking populations. We find previously unreported subhaplogroups and greatly extend the amount of variation and time-depth of most of the known subhaplogroups. Our major finding is the definition of two ancient sublineages of L0k (L0k1b and L0k2) that are present almost exclusively in Bantu-speaking populations from Zambia; the presence of such relic haplogroups in Bantu speakers is most probably due to contact with ancestral pre-Bantu populations that harbored different lineages than those found in extant Khoisan. We suggest that although these populations went extinct after the immigration of the Bantu-speaking populations, some traces of their haplogroup composition survived through incorporation into the gene pool of the immigrants. Our findings thus provide evidence for deep genetic substructure in southern Africa prior to the Bantu expansion that is not represented in extant Khoisan populations.


Link

September 22, 2012

ADMIXTURE analysis of Schlebusch et al. (2012) data

The ADMIXTURE analysis of Schlebusch et al. (2012) did not include Eurasian references, but thanks to the fact that the authors have made their data publicly available, anyone can carry out additional analyses on it. I am sure that this data will be very useful in the future. The list of included populations, with sample sizes are:


  • ColouredColesberg_Sch 20
  • ColouredWellington_Sch 20
  • Khomani_Sch 39
  • Karretjie_Sch 20
  • Khwe_Sch 17
  • GuiGhanaKgal_Sch 15
  • Juhoansi_Sch 18
  • Nama_Sch 20
  • Xun_Sch 19
  • SEBantu_Sch 20
  • SWBantu_Sch 12

As is my convention, the _Sch ending denotes that these populations are from the Schlebusch et al. paper


As always with a new dataset, after processing it, I ran a quick test to make sure everything seemed to be alright. This time, I included the 220 individuals in the released datasets together with 28 HGDP Sardinians and 10 HGDP Dai, and ran a quick K=4 ADMIXTURE analysis:


These appear to make sense. The "green" Dai-like element in the Coloured samples is probably a stand-in for Indian ancestry in that population. The plot of individuals shows considerable variation within several populations:

September 12, 2012

How some Bantu got clickin' (Barbieri et al. 2012)

Eur J Hum Genet. 2012 Aug 29. doi: 10.1038/ejhg.2012.192. [Epub ahead of print]

Genetic perspectives on the origin of clicks in Bantu languages from southwestern Zambia. 

Barbieri C, Butthof A, Bostoen K, Pakendorf B. Source Max Planck Research Group on Comparative Population Linguistics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.

Abstract 

Some Bantu languages spoken in southwestern Zambia and neighboring regions of Botswana, Namibia, and Angola are characterized by the presence of click consonants, whereas their closest linguistic relatives lack such clicks. As clicks are a typical feature not of the Bantu language family, but of Khoisan languages, it is highly probable that the Bantu languages in question borrowed the clicks from Khoisan languages. In this paper, we combine complete mitochondrial genome sequences from a representative sample of populations from the Western Province of Zambia speaking Bantu languages with and without clicks, with fine-scaled analyses of Y-chromosomal single nucleotide polymorphisms and short tandem repeats to investigate the prehistoric contact that led to this borrowing of click consonants. Our results reveal complex population-specific histories, with female-biased admixture from Khoisan-speaking groups associated with the incorporation of click sounds in one Bantu-speaking population, while concomitant levels of potential Khoisan admixture did not result in sound change in another. Furthermore, the lack of sequence sharing between the Bantu-speaking groups from southwestern Zambia investigated here and extant Khoisan populations provides an indication that there must have been genetic substructure in the Khoisan-speaking indigenous groups of southern Africa that did not survive until the present or has been substantially reduced. 

  Link

August 17, 2012

African Y chromosome news (E1b1a and R-V88)

European Journal of Human Genetics advance online publication 15 August 2012; doi: 10.1038/ejhg.2012.176

Evidence from Y-chromosome analysis for a late exclusively eastern expansion of the Bantu-speaking people

Naser Ansari Pour1, Christopher A Plaster1 and Neil Bradman1

Abstract

The expansion of the Bantu-speaking people (EBSP) during the past 3000–5000 years is an event of great importance in the history of humanity. Anthropology, archaeology, linguistics and, in recent decades, genetics have been used to elucidate some of the events and processes involved. Although it is generally accepted that the EBSP has its origin in the so-called Bantu Homeland situated in the area of the border between Nigeria and the Grassfields of Cameroon, and that it followed both western and eastern routes, much less is known about the number and dates of those expansions, if more than one. Mitochondrial, Y-chromosome and autosomal DNA analyses have been carried out in attempts to understand the demographic events that have taken place. There is an increasing evidence that the expansion was a more complex process than originally thought and that neither a single demographic event nor an early split between western and eastern groups occurred. In this study, we analysed unique event polymorphism and short tandem repeat variation in non-recombining Y-chromosome haplogroups contained within the E1b1a haplogroup, which is exclusive to individuals of recent African ancestry, in a large, geographically widely distributed, set of sub-Saharan Africans (groups=43, n=2757), all of whom, except one Nilo-Saharan-speaking group, spoke a Niger-Congo language and most a Bantu tongue. Analysis of diversity and rough estimates of times to the most recent common ancestors of haplogroups provide evidence of multiple expansions along eastern and western routes and a late, exclusively eastern route, expansion.

Link

European Journal of Human Genetics advance online publication 15 August 2012; doi: 10.1038/ejhg.2012.167

The genetic landscape of Equatorial Guinea and the origin and migration routes of the Y chromosome haplogroup R-V88

Miguel González1, Verónica Gomes1,2, Ana Maria López-Parra3, António Amorim1,4, Ángel Carracedo2, Paula Sánchez-Diz2, Eduardo Arroyo-Pardo3 and Leonor Gusmão1

Abstract

Human Y chromosomes belonging to the haplogroup R1b1-P25, although very common in Europe, are usually rare in Africa. However, recently published studies have reported high frequencies of this haplogroup in the central-western region of the African continent and proposed that this represents a ‘back-to-Africa’ migration during prehistoric times. To obtain a deeper insight into the history of these lineages, we characterised the paternal genetic background of a population in Equatorial Guinea, a Central-West African country located near the region in which the highest frequencies of the R1b1 haplogroup in Africa have been found to date. In our sample, the large majority (78.6%) of the sequences belong to subclades in haplogroup E, which are the most frequent in Bantu groups. However, the frequency of the R1b1 haplogroup in our sample (17.0%) was higher than that previously observed for the majority of the African continent. Of these R1b1 samples, nine are defined by the V88 marker, which was recently discovered in Africa. As high microsatellite variance was found inside this haplogroup in Central-West Africa and a decrease in this variance was observed towards Northeast Africa, our findings do not support the previously hypothesised movement of Chadic-speaking people from the North across the Sahara as the explanation for these R1b1 lineages in Central-West Africa. The present findings are also compatible with an origin of the V88-derived allele in the Central-West Africa, and its presence in North Africa may be better explained as the result of a migration from the south during the mid-Holocene.

Link

June 24, 2012

SMBE 2012 abstracts (Part II)

Some more abstracts from SMBE 2012.


The Neolithic trace in mitochondrial haplogroup U8 
Joana Barbosa Pereira 1,2 , Marta Daniela Costa 1,2 , Pedro Soares 2 , Luísa Pereira 2,3 , Martin Brian Richards 1,4 1 Institute of Integrative and Comparative Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK, 2 Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Porto, Portugal,  3 Faculdade de Medicina da  Universidade do Porto, Porto, Portugal,  4 School of Applied Sciences, University of Huddersfield, Huddersfield, UK  

The mitochondrial DNA (mtDNA) still remains an important marker in the study of human history, especially if  considering the increasing amount of data available. Among the several questions regarding human history that are  under debate, the model of expansion of agriculture into Europe from its source in the Near East is still unclear. Recent  studies have indicated that clusters belonging to haplogroup K, a major clade from U8, might be related with the  Neolithic expansions. Therefore, it is crucial to identify the founder lineages of the Neolithic in Europe so that we may  understand the real genetic input of the first Near Eastern farmers in the current European population and comprehend  how agriculture spread so quickly throughout all Europe.  In order to achieve this goal, a total of 55 U8 samples from the Near East, Europe and North Africa were selected for  complete characterisation of mtDNA. A maximum-parsimonious phylogenetic tree was constructed using all published  sequences available so far. Coalescence ages of specific clades were estimated using ρ statistic, maximum likelihood  and Bayesian methods considering a mutation rate for the complete molecule corrected for purifying selection.   Our results show that U8 dates to ~37-54 thousand years ago (ka) suggesting that this haplogroup might have been  carried by the first modern humans to arrive in Europe, ~50 ka. Haplogroup K most likely originated in the Near East  ~23-32 ka where it might have remained during the Last Glacial Maximum, between 26-19 years ago. The majority of K  subclades date to the Late Glacial and are related with the repopulation of Europe from the southern refugia areas. Only  a few lineages appear to reflect post glacial, Neolithic or post-Neolithic expansions, mostly occurring within Europe. The  major part of the lineages dating to the Neolithic period seems to have an European origin with exception of haplogroup  K1a4 and K1a3. Clade K1a4 appears to be originated from the Near East where it also reaches its highest peak of  diversity. Despite the main clades of K1a4 arose in the Near East during the Late Glacial, its subclade K1a4a1 dates to  ~9-11 ka and is most likely related with the Neolithic dispersal to Europe. Similarly, K1a3 probably originated in the Near  East during the Late Glacial and its subclade K1a1a dispersed into Europe ~11-13 ka alongside with the expansion of  agriculture. 
Late Glacial Expansions in Europe revealed through the fine-resolution characterisation of mtDNA haplogroup  U8 
Marta Daniela Costa 1,2 , Joana Barbosa Pereira 1,2 , Pedro Soares 2 , Luisa Pereira 2,3 , Martin Brian Richards 1,4 1 Institute of Integrative and Comparative Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK, 2 IPATIMUP - Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Porto, Portugal,  3 Faculdade de  Medicina, Universidade do Porto, Porto, Portugal,  4 School of Applied Sciences, University of Huddersfield, Huddersfield,  UK  

The maternally inherited and fast evolving mitochondrial DNA (mtDNA) molecule is a highly informative tool with which  to reconstruct human prehistory. This has become even more true in recent years, as mtDNA based studies are  becoming more robust and powerful due to the availability of complete mtDNA genomes. These allow better mutation  rate estimates and fine-resolution characterisation of the phylogeography of mtDNA haplogroups, or named  clades.  MtDNA haplogroup K, the major subclade of U8, occurs at low frequencies through West Eurasian populations,  and is much more common in Ashkenazi Jews. However, the lack of variation on the first hypervariable segment (HVSI) has precluded any meaningful phylogeographic analysis to date. We therefore completely sequenced 50 haplogroup  K and 5 non-K U8 mtDNA samples from across Europe and the Near East, and combined them with 343 genomes  previously deposited in GenBank, in order to reconstruct a detailed phylogenetic tree. By combining several inference  methods, including maximum parsimony, maximum likelihood and Bayesian inference it was possible to trace the  timescale and geography of the main expansions and dispersals associated with this lineage. We confirmed that  haplogroup K, dating to ~32 thousand years (ka) ago, descended from the U8 clade, which coalesces ~48 ka ago. The  latter is close to the timing of the first arrival of modern humans in Europe and U8 could be one of the few surviving  mtDNA lineages brought by the first settlers from the Near East. U8 split into the widespread U8b, at ~43 ka, and U8a,  which seems to have expanded only in Europe ~24 ka ago. Considering the pattern of diversity and the geographic  distribution, haplogroup K is most likely to have arisen in the Near East, ~32 ka ago. However, some subclades were  evidently carried to Europe during the Last Glacial Maximum (LGM). We observed significant expansions of haplogroup  K lineages in the Late Glacial period (14-19 ka), reflecting expansions out of refuge areas in southwest and possibly  also southeast Europe. 

Reticulated origin of domesticated tetraploid wheat 
Peter Civan Centro de Ciencias do Mar, Universidade do Algarve, Faro, Portugal  

The past 15 years have witnessed a notable scientific interest in the topic of crop domestication and the emergence of  agriculture in the Near East. Multi-disciplinary approaches brought a significant amount of new data and a multitude of  hypotheses and interpretations. However, some seemingly conflicting evidence, especially in the case of emmer wheat,  caused certain controversy and a broad scientific consensus on the circumstances of the wheat domestication has not  been reached, yet.  The past phylogenetic research has translated the issue of wheat domestication into somewhat simplistic mono- /polyphyletic dilemma, where the monophyletic origin of a crop signalizes rapid and geographically localized  domestication, while the polyphyletic evidence suggests independent, geographically separated domestication events.  Interestingly, the genome-wide and haplotypic data analyzed in several studies did not yield consistent results and the  proposed scenarios are usually in conflict with the archaeological evidence of lengthy domestication.  Here I suggest that the main cause of the above mentioned inconsistencies might lie in the inadequacy of the divergent,  tree-like evolutional model. The inconsistent phylogenetic results and implicit archaeological evidence indicate a  reticulate (rather than divergent) origin of domesticated emmer. Reticulated genealogy cannot be properly represented  on a phylogenetic tree; hence different sets of samples and genetic loci are prone to conclude different domestication  scenarios. On a genome-wide super-tree, the conflicting phylogenetic signals are suppressed and the origin of  domesticated crop may appear monophyletic, leading to misinterpretations of the circumstances of the Neolithic  transition.  The network analysis of multi-locus sequence data available for tetraploid wheat clearly supports the reticulated origin of  domesticated emmer and durum wheat. The concept of reticulated genealogy of domesticated wheat sheds new light  onto the emergence of Near-Eastern agriculture and is in agreement with current archaeological evidence of protracted  and dispersed emmer domestication.

High-coverage population genomics of diverse African hunter-gatherers 
Joseph Lachance 1 , Benjamin Vernot 2 , Clara Elbers 1 , Bart Ferwerda 1 , Alain Froment 3 , Jean-Marie Bodo 4 , Godfrey  Lema 5 , Thomas Nyambo 5 , Timothy Rebbeck 1 , Kun Zhang 6 , Joshua Akey 2 , Sarah Tishkoff 1 1 University of Pennsylvania, Philadelphia, PA, USA,  2 University of Washington, Seattle, WA, USA,  3 IRD-MNHN, Musee  de l'Homme, Paris, France,  4 Ministere de la Recherche Scientifique et de l’Innovation, Yaounde, Cameroon,  5 Muhimbili  University College of Health Sciences, Dar es Salaam, Tanzania,  6 University of California at San Diego, San Diego, CA,  USA     
In addition to their distinctive subsistence patterns, African hunter-gatherers belong to some of the most genetically  diverse populations on Earth.  To infer demographic history and detect signatures of natural selection, we sequenced  the whole genomes of five individuals in each of three geographically and linguistically diverse African hunter-gatherer  populations at >60x coverage.  In these 15 genomes we identify 13.4 million variants, many of which are novel,  substantially increasing the set of known human variation.  These variants result in allele frequency distributions that are  free of SNP ascertainment bias.  This genetic data is used to infer population divergence times and demographic history  (including population bottlenecks and inbreeding).  We find that natural selection continues to shape the genomes of  hunter-gatherers, and that deleterious genetic variation is found at similar levels for hunter-gatherers and African  populations with agricultural or pastoral subsistence patterns.  In addition, the genomes of each hunter-gatherer  population contain unique signatures of local adaptation.  These highly-divergent genomic regions include genes  involved in immunity, metabolism, olfactory and taste perception, reproduction, and wound healing.

Reconstructing past Native American genetic diversity in Puerto Rico from contemporary populations Marina Muzzio 1,2 , Fouad Zakharia 1 , Karla Sandoval 1 , Jake K. Byrnes 3 , Andres Moreno-Estrada 1 , Simon Gravel 1 , Eimear  Kenny 1 , Juan L. Rodriguez-Flores 5 , Chris R. Gignoux 6 , Wilfried Guiblet 4 , Julie Dutil 7 , The 1000 Genomes Consortium 0 ,  Andres Ruiz-Linares 8 , David Reich 9,10 , Taras K. Oleksyk 4 , Juan Carlos Martinez-Cruzado 4 , Esteban Gonzalez  Burchard 6 , Carlos D. Bustamante 1 1 Department of Genetics, Stanford University School of Medicine, Stanford, California, USA,  2 Facultad de Ciencias  Naturales, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina,  3 Ancestry. com®, San Francisco,  California, USA,  4 Department of Biology, University of Puerto Rico at Mayagüez, Mayagüez, Puerto Rico,  5 Department  of Genetic Medicine, Weill Cornell Medical College, New York, New York, USA,  6 Institute for Human Genetics,  University of California San Francisco, San Francisco, California, USA,  7 Ponce School of Medicine, Ponce, Puerto Rico, 8 Department of Genetics, Evolution and Environment. University College London, London, UK,  9 Department of  Genetics, Harvard Medical School, Boston, Massachusetts, USA,  10 Broad Institute of MIT and Harvard, Cambridge,  Massachusetts, USA  

The Caribbean region has a rich cultural and biological diversity, including several countries with different languages,  and important historical events like the arrival of the Europeans in the late fifteenth century affected it deeply. Although it  has been said that two main Native American groups peopled the Caribbean at the time of Columbus’s voyages—the  Arawakan-speaking Tainos and the Caribs—this model has been questioned because it comes from the descriptions  written by the conquerors. The archaeological record shows a richer picture of trade among the islands, cultural change  and diversity than what colonial documents depict, from the early settlements around 8000 B.P. to the chiefdoms and  towns at the time of contact. How this area was peopled and how its inhabitants interacted with the surrounding  continent are questions that remain to be answered due to the fragmentary nature of the historical and archaeological  records.   
We aim to reconstruct the Native American genetic diversity from the time of the Spanish arrival at the island of Puerto  Rico from its contemporary population. We seek to find out how the original peopling of Puerto Rico occurred, along  with which contemporary Native American populations are the most closely related to the Native tracks found. We used  PCAdmix to trace Native American segments in admixed individuals, thus enabling us to reconstruct the original native  lineages previous to the European and African contact.   

Specifically, we generated local ancestry calls for the 70 parents of the 35 complete Puerto Rican trios from the wholegenome and Illumina Omni 2.5M chip Genotype data of the 1000 Genomes Project, both to examine genome-wide  admixture patterns and to infer demographic historical events from ancestry tract length distributions and an ancestryspecific PCA approach, adding 55 Native American groups as potential source populations (N=475 genotyped through  Illumina’s 650K array) and 15 selected Mexican trios (genotyped on Affymetrix’s 6.0 array, including about 906,000  SNPs) to provide population context. ADMIXTURE analysis has shown that in Puerto Rico there is no single source of  contribution for the Native component. Rather, this component seems to include a mixture of major Mexican and  Andean components with little contributions from the Amazonian isolates. On the other hand, the ancestry-specific PCA  plotted the Puerto Rican Native segments tightly clustered with the Native segments of groups from the same language  family as the Tainos (Equatorial-Tucanoan), showing a clear association between linguistics and genetics instead of a  geographical one.
 Inference of demographic history and natural selection in African Pygmy populations from whole-genome  sequencing data
 Martin Sikora 1 , Etienne Patin 2 , Helio Costa 1 , Katherine Siddle 2 , Brenna M Henn 1 , Jeffrey M Kidd 1,3 , Ryosuke Kita 1 ,  Carlos D Bustamante 1 , Lluis Quintana-Murci 2 1 Department of Genetics, School of Medicine, Stanford Uni, Stanford, CA, USA,  2 Unit of Human Evolutionary Genetics,  Institut Pasteur, CNRS URA3012, Paris, France,  3 Departments of Human Genetics and Computational Medicine and  Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA     

The Pygmy populations of Central Africa are some of the last remaining hunter-gatherers among present-day human  populations, and can be broadly classified into two geographically separated groups, the Western and Eastern Pygmies.  Compared to their neighboring populations of predominantly Bantu origin, Pygmy populations show distinct cultural and  physical characteristics, most notably short stature, often referred to as the “Pygmy phenotype”. Given their distinct  physical characteristics, the questions of the demographic history and origin of the Pygmy phenotype have attracted  much attention. Previous studies have shown an ancient divergence (~60,000 years ago) of the ancestors of modernday Pygmies from non-Pygmies, and a more recent split of the Eastern and Western Pygmy groups. However, these  studies were generally based on a relatively small set of markers, precluding accurate estimations of demographic  parameters. Furthermore, despite the considerable interest, to date there is still little known about the genetic basis of  the small stature phenotype of Pygmy populations.   
In order to address these questions, we sequenced the genomes of 47 individuals from three populations: 20 Baka, a  Pygmy hunter-gatherer population from the Western subgroup of the African Pygmies; 20 Nzebi, a neighboring nonPygmy agriculturist population from the Bantu ethnolinguistic group; as well as 7 Mbuti, Eastern Pygmy population, from  the Human Genome Diversity Project (HGDP). We performed whole-genome sequencing using Illumina Hi-Seq 2000 to  a median sequencing depth of 5.5x per individual. After stringent quality control filters, we call over 17 Million SNPs  across the three populations, 32% of them novel (relative to dbSNP 132). Genotype accuracy after imputation was  assessed using genotype data from the Illumina OMNI1 SNP array, and error rates were found to be comparable to  other low-coverage studies (< 3% for most individuals). Preliminary results show relatively low genetic differentiation  between the Baka and the Nzebi (mean FST = 0.026), whereas the Mbuti show higher differentiation to both Baka and  Nzebi (mean FST = 0.060 and 0.070, respectively). Furthermore, we find that alleles previously found to be associated with height in other populations are not enriched for the “small” alleles in the Pygmy populations. We find a number of  highly differentiated genomic regions as candidate loci for height differentiation, which will be verified using simulations  under the best-fit demographic model, inferred from multi-dimensional allele frequency spectra using DaDi. Our dataset  will allow a detailed investigation of the demographic history and the genomics of adaptation in these populations.
Genetic structure in North African human populations and the gene flow to Southern Europe
Laura R Botigué 1 , Brenna M Henn 2 , Simon Gravel 2 , Jaume Bertranpetit 1 , Carlos D Bustamante 2 , David Comas 1 1 Institut de Biologia Evolutiva (IBE, CSIC-UPF), Barcelona, Spain,  2 Stanford University, Stanford CA, USA Despite being in the African continent and at the shores of the Mediterranean, North African populations might have  experienced a different population history compared to their neighbours. However, the extent of their genetic divergence  and gene flow from neighbouring populations is poorly understood. In order to establish the genetic structure of North  Africans and the gene flow with the Near East, Europe and sub-Saharan Africa, a genomewide SNP genotyping array  data (730,000 sites) from several North African and Spanish populations were analysed and compared to a set of  African, European and Middle Eastern samples. We identify a complex pattern of autochthonous, European, Near  Eastern, and sub-Saharan components in extant North African populations; where the autochthonous component  diverged from the European and Near Eastern component more than 12,000 years ago, pointing to a pre-Neolithic  ‘‘back-to-Africa’’ gene flow. To estimate the time of migration from sub-Saharan populations into North Africa, we  implement a maximum likelihood dating method based on the frequency and length distribution of migrant tracts, which  has suggested a migration of western African origin into Morocco ~1,200 years ago and a migration of individuals with  Nilotic ancestry into Egypt ~ 750 years ago.  We characterize broad patterns of recent gene flow between Europe and Africa, with a gradient of recent African  ancestry that is highest in southwestern Europe and decreases in northern latitudes. The elevated shared African  ancestry in SW Europe (up to 20% of the individuals’ genomes) can be traced to populations in the North African  Maghreb. Our results, based on both allele-frequencies and shared haplotypes, demonstrate that recent migrations from  North Africa substantially contribute to the higher genetic diversity in southwestern Europe

Estimating a date of mixture of ancestral South Asian populations
Priya Moorjani 1,2 , Nick Patterson 2 , Periasamy Govindaraj 3 , Danish Saleheen 4 , John Danesh 4 , Lalji Singh* 3,5 ,  Kumarasamy Thangaraj* 3 , David Reich* 1,2 1 Harvard University, Boston, Massachusetts, USA,  2 Broad Institute, Cambridge, Massachusetts, USA,  3 Centre for  Cellular and Molecular Biology, Hyderabad, Andhra Pradesh, India,  4 Dept of Public Health and Care, University of  Cambridge, Cambridge, UK,  5 Genome Foundation, Hyderabad, Andhra Pradesh, India Linguistic and genetic studies have demonstrated that almost all groups in South Asia today descend from a mixture of  two highly divergent populations: Ancestral North Indians (ANI) related to Central Asians, Middle Easterners and  Europeans, and Ancestral South Indians (ASI) not related to any populations outside the Indian subcontinent. ANI and  ASI have been estimated to have diverged from a common ancestor as much as 60,000 years ago, but the date of the  ANI-ASI mixture is unknown. Here we analyze data from about 60 South Asian groups to estimate that major ANI-ASI  mixture occurred 1,200-4,000 years ago. Some mixture may also be older—beyond the time we can query using  admixture linkage disequilibrium—since it is universal throughout the subcontinent: present in every group speaking  Indo-European or Dravidian languages, in all caste levels, and in primitive tribes. After the ANI-ASI mixture that  occurred within the last four thousand years, a cultural shift led to widespread endogamy, decreasing the rate of  additional mixture.   
Long IBD in Europeans and recent population history 
Peter Ralph, Graham Coop  UC Davis, Davis, CA, USA  
Numbers of common ancestors shared at various points in time across populations  can tell us about recent demography, migration, and population movements.  These rates of shared ancestry over tens of generations can be inferred from  genomic data, thereby dramatically increasing our ability to infer population  history much more recent than was previously possible with population genetic  techniques.  We have analyzed patterns of IBD in a dataset of thousands of  Europeans from across the continent, which provide a window into recent  European geographic structure and migration.   
Gene flow between human populations during the exodus from Africa, and the timeline of recent human  evolution  
Aylwyn Scally, Richard Durbin  Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK 
We present a novel test for historical gene flow between populations using unphased genotypes in present-day  individuals, based on the sharing of derived alleles and making a minimal set of assumptions about their demographic  history. We apply this test to data for three human individuals of African, European and Asian ancestry. We find that the  joint distribution of European and Asian genotypes is compatible with these populations having separated cleanly at  some time in the past without subsequent genetic exchange. However the same is not true of the European-African and  Asian-African distributions, which instead suggest an extended period of continued exchange between African and nonAfrican populations after their initial separation. 
We discuss this in comparison with recent models and estimates of separation time between these populations. We  also consider the impact of recent direct experimental studies of the human mutation rate, which suggest rates of  around 0.5 × 10 -9  bp -1  y -1 , substantially lower than prior estimates of 1 × 10 -9  bp -1  y -1  obtained from calibration against  the primate fossil record. We show that in several places the lower rate, implying older dates, yields better agreement  between genetic and non-genetic (paleoanthropological and archaeological) evidence for events surrounding the  exodus of modern humans from Africa and their dispersion worldwide.
Long-term presence versus recent admixture: Bayesian and approximate-Bayesian analyses of genetic  diversity of human populations in Central Asia 
Friso Palstra, Evelyne Heyer, Frederic Austerlitz  Eco-anthropologie et Ethnobiologie UMR 7206 CNRS, Equipe Genetique des Populations Humaines, Museum National  d'Histoire Naturelle, Paris, France 
A long-standing goal in population genetics is to unravel the relative importance of evolutionary forces that shape  genetic diversity. Here we focus on human populations in Central Asia, a region that has long been known to contain  the highest genetic diversity on the Eurasian continent. However, whether this variation principally reflects long-term  presence, or rather the result of admixture associated with repeated migrations into this region in more recent historical  times, remains unclear. Here we investigate the underlying demographic history of Central Asian populations in explicit  relation to Western Europe, Eastern Asia and the Middle East. For this purpose we employ both full Bayesian and  approximate-Bayesian analyses of nuclear genetic diversity in 20 unlinked non-coding resequenced DNA regions,  known to be at least 200 kb apart from any known gene, mRNA or spliced EST (total length of 24 kb), and 22 unlinked  microsatellite loci.   
Using an approximate Bayesian framework, we find that present patterns of genetic diversity in Central Asia may be  best explained by a demographic history which combines long-term presence of some ethnic groups (Indo-Iranians)  with a more recent admixed origin of other groups (Turco-Mongols). Interestingly, the results also provide indications  that this region might have genetically influenced Western European populations, rather than vice versa. A further  evaluation in MCMC-based Bayesian analyses of isolation-with-migration models confirms the different times of  establishment of ethnic groups, and suggests gene flow into Central Asia from the east. The results from the  approximate Bayesian and full Bayesian analyses are thus largely congruent. In conclusion, these analyses illustrate  the power of Bayesian inference on genetic data and suggest that the high genetic diversity in Central Asia reflects both  long-term presence and admixture in more recent historical times. 
Population structure and evidence of selection in the Khoe-San and Coloured populations from southern Africa 
Carina Schlebusch 1 , Pontus Skoglund 1 , Per Sjödin 1 , Lucie Gattepaille 1 , Sen Li 1 , Flora Jay 2 , Dena Hernandez 3 , Andrew  Singleton 3 , Michael Blum 2 , Himla Soodyall 4,5 , Mattias Jakobsson 1 1 Uppsala University, Uppsala, Sweden,  2 Université Joseph Fourier, Grenoble, France,  3 National Institute on Aging (NIH),  Bethesda, USA,  4 University of the Witwatersrand, Johannesburg, South Africa,  5 National Health Laboratory Service,  Johannesburg, South Africa  

The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of  hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking  groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Mitochondrial DNA, Y-chromosome  and autosomal studies conducted on a few San groups revealed that they harbour some of the most divergent lineages  found in living peoples throughout the world.   

We used autosomal data to characterize patterns of genetic variation among southern African individuals in order to  understand human evolutionary history, in particular the demographic history of Africa. To this end, we successfully  genotyped ~ 2.3 million genome wide SNP markers in 220 individuals, comprising seven Khoe-San, two Coloured and  two Bantu-speaking groups from southern Africa. After quality filtering, the data were combined with publicly available  SNP data from other African populations to investigate stratification and demography of African populations.  

We also  applied a newly developed method of estimating population topology and divergence times. Genotypes and inferred  haplotypes were used to assess genetic diversity, patterns of haplotype variation and linkage disequilibrium in different  populations.  We found that six of the seven Khoe-San populations form a common population lineage basal to all other modern  human populations. The studied Khoe-San populations are genetically distinct, with diverse histories of gene flow with  surrounding populations. A clear geographic structuring among Khoe-San groups was observed, the northern and  southern Khoe-San groups were most distinct from each other with the central Khoe-San group being intermediate. The  Khwe group contained variation that distinguished it from other Khoe-San groups. Population divergence within the  Khoe-San group is approximately 1/3 as ancient as the divergence of the Khoe-San as a whole to other human  populations (on the same order as the time of divergence between West Africans and Eurasians). Genetic diversity in  some, but not all, Khoe-San populations is among the highest worldwide, but it is influenced by recent admixture. We  furthermore find evidence of a Nilo-Saharan ancestral component in certain Khoe-San groups, possibly related to the  introduction of pastoralism to southern Africa.   

We searched for signatures of selection in the different population groups by scanning for differentiated genome-regions  between populations and scanning for extended runs of haplotype homozygosity within populations. By means of the  selection scans, we found evidence for diverse adaptations in groups with different demographic histories and modes of  subsistence. 
Impacts of life-style on human evolutionary history: A genome-wide comparison of herder and farmer  populations in Central Asia 
Michael C. Fontaine 1,2 , Laure Segurel 2,3 , Christine Lonjou 4 , Tatiana Hegay 5 , Almaz Aldashev 6 , Evelyne Heyer 2 , Frederic  Austerlitz 1,2 1 Ecology, Systematics & Evolution. UMR8079 Univ. Paris Sud - CNRS - AgroParisTech, Orsay, France,  2 EcoAnthropologie et Ethnobiologie, UMR 7206 CNRS, MNHN, Univ Paris Diderot, Sorbonne Paris Cite, Paris, France, 3 Department of Human Genetics, University of Chicago, Chicago, USA,  4 C2BiG (Centre de  Bioinformatique/Biostatistique Genomique d’Ile de France), Plateforme Post-genomique P3S, Hopital Pitie Salpetriere,  Paris, France,  5 Uzbek Academy of Sciences, Institute of Immunology, Tashkent, Uzbekistan,  6 Institute of Molecular  Biology and Medicine, National Center of Cardiology and Internal Medicine, Bishkek,  

Kyrgyzstan Human populations use a variety of subsistence strategies to exploit an exceptionally broad range of habitats and  dietary components. These aspects of human environments have changed dramatically during human evolution, giving  rise to new selective pressures. Here we focused on two populations in Central Asia with long-term contrasted lifestyles:  Kyrgyz’s that are traditionally nomadic herders, with a traditional diet based on meat and milk products, and Tajiks that  are traditionally agriculturalists, with a traditional diet based mostly on cereals. We genotyped 93 individuals for more  than 600,000 SNP markers (Human-660W-Quad-V1.0 from Illumina) spread across the genome. We first analysed the  population structure of these two populations in the world-wide context by combining our results with other available  genome-wide data. Principal component and Bayesian clustering analyses revealed that Tajiks and Kirgiz’s are both  admixed populations which differed however from each other with respect to their ancestry proportions: Tajiks display a  much larger proportion of common ancestry with European populations while Kirgiz’s share a larger common ancestry  with Asiatic populations. We then examined the region of the genome displaying unusual population differentiation  between these two populations to detect natural selection and checked whether they were specific to Central Asia or  not. We complemented these analyses with haplotype-based analyses of selection. 
Bayesian inference of the demographic history of Niger-Congo speaking populations 
Isabel Alves 1,2 , Lounès Chikhi 2,3 , Laurent Excoffier 1,4 1 CMPG, Institute of Ecology and Evolution, Berne, Switzerland,  2 Population and Conservation Genetics Group, Instituto  Gulbenkian de Ciência, Oeiras, Portugal,  3 CNRS, Université Paul Sabatier, ENFA, Toulouse, France,  4 Swiss Institute of  Bioinformatics, Lausanne, Switzerland  
The Niger-Congo phylum encompasses more than 1500 languages spread over sub-Saharan Africa. This current wide  range is mostly due to the spread of Bantu-speaking people across sub-equatorial regions in the last 4000-5000 years.  Although several genetic studies have focused on the evolutionary history of Bantu-speaking groups, much less effort  has been put into the relationship between Bantu and non-Bantu Niger-Congo groups. Additionally, archaeological and  linguistic evidence suggest that the spread of these populations occurred in distinct directions from the core region  located in what is now the border between Nigeria and Cameroon towards West and South Africa, respectively. We  have performed coalescent simulations within an approximate Bayesian computation (ABC) framework in order to  statistically evaluate the relative probability of alternative models of the spread of Niger-Congo speakers and to infer  demographic parameters underlying these important migration events. We have analysed 61 high-quality microsatellite  markers, genotyped in 130 individuals from three Bantu and three non Bantu-speaking populations, representing a  "Southern wave" or the Bantu expansion, and a "Western wave", respectively. Preliminary results suggest that models  inspired by a spatial spread of the populations are better supported than classical isolation with migration (IM) models.  We also find that Niger-Congo populations currently maintain high levels of gene flow with their neighbours, and that  they expanded from a single source between 200 and 600 generations, even though available genetic data do not  provide enough information to accurately infer these demographic parameters.

A genetic study of skin pigmentation variation in India  
Mircea Iliescu1 , Chandana Basu Mallick 2,3 , Niraj Rai 4 , Anshuman Mishra 4 , Gyaneshwer Chaubey 2 , Rakesh Tamang 4 ,  Märt Möls 3 , Rie Goto 1 , Georgi Hudjashov 2,3 , Srilakshmi Raj 1 , Ramasamy Pitchappan 5 , CG Nicholas Mascie-Taylor 1 , Lalji  Singh 4,6 , Marta Mirazon-Lahr 7 , Mait Metspalu 2,3 , Kumarasamy Thangaraj 4 , Toomas Kivisild 1,3 1 Division of Biological Anthropology, University of Cambridge, Cambridge, UK,  2 Evolutionary Biology Group, Estonian  Biocentre, Tartu, Estonia,  3 Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia,  4 Centre for Cellular  and Molecular Biology, Hyderabad, India,  5 Chettinad Academy of Research and Education, Chettinad Health City,  Chennai, India,  6 Banaras Hindu University, Varanasi, India,  7 Leverhulme Centre for Human Evolutionary Studies,  Division of Biological Anthropology, University of Cambridge, Cambridge, UK  

Human skin colour is a polygenic trait that is primarily determined by the amount and type of melanin produced in the  skin. The pigmentation variation between human populations across the world is highly correlated with geographic  latitude and the amount of UV radiation. Association studies together with research involving different model organisms  and coat colour variation have largely contributed to the identification of more than 378 pigmentation candidate genes.  These include TYR OCA2, that are known to cause albinism, MC1R responsible for the red hair phenotype, and genes  such as MATP, SLC24A5 and ASIP that are involved in normal pigmentation variation. In particular, SLC24A5 has been  shown to explain one third of the pigmentation difference between Europeans and Africans. However, the same gene  cannot explain the lighter East Asian phenotype; therefore, light pigmentation could be the result of convergent  evolution. A study on UK residents of Pakistani, Indian and Bangladeshi descent found significant association of  SLC24A5, SLC45A2 and TYR genes with skin colour. While these genes may explain a significant proportion of  interethnic differences in skin colour, it is not clear how much variation such genes explain within Indian populations  who are known for their high level of diversity of pigmentation. We have tested 15 candidate SNPs for association with  melanin index in a large sample of 1300 individuals, from three related castes native to South India. Using logistic  regression model we found that SLC24A5 functional SNP, rs1426654, is strongly associated with pigmentation in our  sample and explains alone more than half of the skin colour difference between the light and the dark group of  individuals. Conversely, the other tested SNPs fail to show any significance; this strongly argues in favour of one gene  having a major effect on skin pigmentation within ethnic groups of South India, with other genes having small additional  effects on this trait. We genotyped the SLC24A5 variant in over 40 populations across India and found that latitudinal  differences alone cannot explain its frequency patterns in the subcontinent. Key questions arising from this research are  when and where did the light skin variant enter South Asia and the manner and reason for it spreading across the Indian  sub-continent. Hence, a comprehensive view of skin colour evolution requires that in depth sequence information be  corroborated with population (genetic) history and with ancient DNA data of past populations of Eurasia