May 23, 2008

Human population history with a copying model

The movies of the authors' scenario can be found at the bottom of the link. This bit should be emphasized:
One potential avenue for extending the current approach to achieve this goal would be to impute chromosomes from “ancestral populations,” which would both represent populations that existed in the past and also act as efficient donors for the modern haplotypes.
Rather than seeing human variation as a network of gene exchange between the sampled populations, the goal should be to create a rich history of mankind where the named identifiable present-day populations are seen as leaves of a hidden web of interconnections between inferred, but statistically sound past populations, which in turn would be mapped to historical-archaeological entities.

PLoS Genet 4(5): e1000078. doi:10.1371/journal.pgen.1000078

Inferring Human Colonization History Using a Copying Model

Garrett Hellenthal et al.


Genome-wide scans of genetic variation can potentially provide detailed information on how modern humans colonized the world but require new methods of analysis. We introduce a statistical approach that uses Single Nucleotide Polymorphism (SNP) data to identify sharing of chromosomal segments between populations and uses the pattern of sharing to reconstruct a detailed colonization scenario. We apply our model to the SNP data for the 53 populations of the Human Genome Diversity Project described in Conrad et al. (Nature Genetics 38,1251-60, 2006). Our results are consistent with the consensus view of a single “Out-of-Africa” bottleneck and serial dilution of diversity during global colonization, including a prominent East Asian bottleneck. They also suggest novel details including: (1) the most northerly East Asian population in the sample (Yakut) has received a significant genetic contribution from the ancestors of the most northerly European one (Orcadian). (2) Native South Americans have received ancestry from a source closely related to modern North-East Asians (Mongolians and Oroquen) that is distinct from the sources for native North Americans, implying multiple waves of migration into the Americas. A detailed depiction of the peopling of the world is available in animated form.



Maju said...

Interesting excersise but the choice of populations seems a little capricious in fact. I specially miss South Asians (what actually explains the relative lack of connection between Western and Eastern Eurasians) and also think that Mozabites look a bad choice for the Africa-Eurasia connection as well (rather a mixed population).

The ignorance of South Asians in any study that attempts to research the spread of humankind out of Africa is not justified in any case, knowing as we do now that this region surely played a major role in the Eurasian dispersal.

South Central Haplo said...

Maju, You are little generous on your comments.

Lot of studies like this with cherry picking and then generalize.

How can academia let these kind of articles to come through.

J. said...

Rather interesting point the influence of Orcadians (picts?) on Yakuts and North Chinese populations. Orkney has been peopled for more than 12000 years.

Polak said...

Buddy, you need to stop ad think carefully here.

Obviously this study doesn't mean Orcadians influenced the Yakuts drectly.

What it says is that a population very similar to the Orcadians did - in other words, Northern Europeans.

These scientists should really be more careful when making such statements. Online forums are now going to be full of people suggesting tha the ancient Tocharians are Scots. LOL

n/a said...

It's worth noting Yakuts carry Tat C Y chromosomes at high frequencies. The findings of this team are consistent with the belief that N3 (Tat C) Y chromosomes diffused from Northeastern Europeans to Siberians rather than the reverse.

Dienekes said...

Yakuts have some European mtDNA which is sufficient to explain their partial European ancestry. Haplogroup N3 is not Western Eurasian, but rather Proto-Uralic, i.e., non-Caucasoid and originated in present-day China. The link provided by n/a quotes Prof. Villems from 2001; however, Villems is the co-author of the 2006 study which ascribes its origin in China.

n/a said...

Yakut N3 shows very low STR variance.

Finnish N3 shows much higher STR variance.

Those are the facts--now, in 2006, or in 2001.

Much of the reasoning in the "counter-clockwise northern route" paper is awful, but here are some relevant excerpts:
notable, however, is the fact that the spatial dynamics of
the whole N and O haplogroups greatly differ from each
other. The split between N* and O is dated to 34.674.7
thousand years (ky)
. The age of STR variation of hg O in
Southeast Asia probably exceeds 26 ky,10

[. . .]
In this regard, the age of accumulated STR variation in hg
N, estimated on all combined data from N1, N2 and N3 at
15 loci (Supplementary Table 2), yields an estimate of
19.474.8 ky. However, as will be argued below, the
European subcluster of N2 and the Yakutian N3 might
have descended from single founders with multiple jumps
at several loci, thus causing a possible shift in statistical
estimates that assume a step-wise mutation model. When
those chromosomes are excluded, the age of hg N STR
diversity is somewhat younger, 14.274.0 ky.

[. . .]
Europe can be considered as a place of secondary
expansion of N3
. Indeed, hg N3 occurs at high frequencies
in the Volga-Ural Ugric groups and related Finns, Saami
and Estonians. One may notice that while STR variation is
relatively low in the Volga-Ural group, some north-European populations have high STR variance (eg, 0.32 in
: data from,36 without DYS385ab).

[. . .]
Phylogenetic analysis of STR variation (Figure 3) shows
two overlapping subclusters of N3, one of them encompassing
predominantly Volga-Ural region, Finnic- as well as
Turkic-speaking populations together with Altaian, and the
other one both Baltic-Finnic (Estonians, Karelians and
Vepsa) and east Slavs (Russians, Ukrainians), as well as
West-Slavonic Slovak N3 chromosomes. The Yakut Y
chromosomes form their specific branch; they are almost
identical to each other
, consistent with earlier studies.

From an even more recent paper co-authored by Villems:

The presence of hg N (shared by many East European
and Northeast Asian populations) in Russians is in
contrast with the very limited and spotty presence of other
typically East Asian NRY variants, such as Q and C in the
Russian Y chromosomal pool (Table 2). In this sense, our
results, encompassing the historical Russian area, are in
a good agreement with the earlier results on central-southern
Russians.14 More generally, it appears that there has
been only limited general east-to-west flow of Y chromosomes
alongside the steppe belt over long prehistoric and
historic times, possibly even since the peopling of the temperate
zone of Eurasia by anatomically modern humans
That leaves hg N which probably reached East Europe
already around the beginning of Holocene
the only
prominent Y chromosomal ‘‘common denominator’’ for
the North-East Asian and East European paternal heritage.

dienekes claims: "Yakuts have some European mtDNA which is sufficient to explain their partial European ancestry."

The Li paper suggests Yakuts are autosomally about 10% European. As far as I know, Yakuts have much less than 20% European mtDNA.

dienekes claims: "Haplogroup N3 is not Western Eurasian, but rather Proto-Uralic,"

The attempt to identify N3 with a supposed skull type makes no sense, of course. N3 originated over 10,000 years ago. We have no idea of the racial type of the individual in whom N3 originated. Moving forward to the present, millions of Caucasoid NE Europeans carry N3, as do some number of Mongoloid and racially-mixed North and East Asians.

Nor does the study dienekes links support his "non-Caucasoid" claim.
Even if a distinct "Proto-Uralic" type ancestral to modern Uralics (in varying proportions) existed at one time, the question of where the ancestors of this group came from and which groups they were most closely affiliated with racially would remain. E.g., were they cold-adapted Caucasoids?

Dienekes said...

The Li paper suggests Yakuts are autosomally about 10% European. As far as I know, Yakuts have much less than 20% European mtDNA.

The Yakuts originated in Lake Baykal, which is the easternmost boundary of Caucasoid expansions according to Russian anthropologists. They could easily have picked up some Caucasoid DNA from there.

Also, both Russians and Finns show admixture with Asian populations, both have very little Asian mtDNA, or Asian Y chromosomes other than N3 (in the case of the Russians, they also have N2, but this is 0-3.3% in the two recent Y chromosome studies in the region of Vologda).

Ebizur said...

I'd like to see a comparison of the STR diversity between the East Asian subclades of haplogroup N (i.e. N1*-LLY22g(xN1a, N1b, N1c) and N1a) and the Northeast European/Siberian subclades of haplogroup N (i.e. N1b and N1c). This might help to elucidate the region of origin of haplogroup N. In terms of the raw numbers, there should be at least as many haplogroup N individuals in East/Southeast Asia as there are in Europe and Siberia.

We also need to determine more precisely the branching order within haplogroup N, preferably through new SNPs.

Maju said...

N as a whole is clearly Eastern Eurasian. NO* is only found in East Asia and that's also the case of N* (Shanghai, Cambodia). Now N3 may (and probably did) have a more western or at least northern origin. Certainly the main area of distribution of N is Northern Eurasia, rather than its ancestral East Asia, where it's quite rare nowadays.

In any case haplogroups are not races. Just paternal (or maternal in the case of mtDNA) lineages. The overall genetics defining phenotype may perfectly be almost totally unrelated.

dienekesp said...

Yakuts have 6.4% Y-haplogroup R1a1, and 6% of Caucasoid mtDNA (Puzyrev et al. 2003, Russian Journal of Genetics, Vol. 39, No. 7, 2003, pp. 816–822) which is consistent with the low levels of European genomic admixture that they exhibit.

dienekesp said...

The genetic diversity of N3 chromosomes has been studied by Derenko et al. (2007):

"Based on STR variance analysis we observed that hg N3a is more diverse in Eastern Europe than in south Siberia. However, analysis of median networks showed that there are two STR subclusters of hg N3a, N3a1 and N3a2, that are characterized by different genetic histories. Age calculation of STR variation within subcluster N3a1 indicated that its first expansion occurred in south Siberia [approximately 10,000 years (ky)] and then this subcluster spread into Eastern Europe where its age is around 8 ky ago. Meanwhile, younger subcluster N3a2 originated in south Siberia (probably in the Baikal region) approximately 4 ky ago."

"The data obtained allow us to suggest Siberian origin of haplogroups N3 and N2 that are currently widespread in some populations of Eastern Europe."

Ebizur said...

Xue et al. (2006) found Haplogroup N1*-LLY22g(xN1a-M128, N1b-P43, N1c-TAT) in 11.8% (4/34) of a sample of Hani people, a Tibeto-Burman group living mostly in southwestern China. This is the only case I know of in which proven N1*-LLY22g Y-DNA has been found in more than 10% of a sample.

Hammer et al. (2005) found haplogroup N1-LLY22g(xN1a-M128, N1b-P43, N1c1-M178) in 15.0% (6/40) of a sample of southern Han Chinese and 30.2% (13/43) of a sample of Yi people, another Tibeto-Burman-speaking group from southwestern China. Since N1c-TAT(xN1c1-M178) has been found only in one Mongolian as far as I know, all of the N1(xN1a, N1b, N1c1) Y-DNA found among southern Han Chinese and Yi people is probably actually N1*(xN1a, N1b, N1c), so it looks like the populations that have significant amounts of N1*-LLY22g are Yi people, Hani people (who themselves claim to be derived from Yi people), and Han Chinese (especially those from the southern parts of the country).

However, then another problem arises: Yi and Hani people are supposed to be closely related to one another, and Han Chinese may also share some common ancestry with these Lolo-Burmese ethnic groups, so there is a fairly great probability that the large numbers of N1*-LLY22g Y-chromosomes found in these populations belongs to a single yet-undefined subclade of N1-LLY22g, such as "N1d" or "pre-N1a" or something of that sort.