July 03, 2010

Y chromosomes of Arbereshe from Calabria

From the paper:
The Arbereshe are one of the largest linguistic minorities in Italy. They are the result of complicated movements of Albanians around the end of the 15th and beginning of the 16th century, often linked to the invasion of the Balkans by the Ottoman Empire. Despite that, it is generally agreed that most of the immigrants started moving from the south of Albania (Toskeria), with, very often, intermediate steps in Greece, particularly in the Peloponnese (Zangari 1941). Further evidence is provided by linguistic research, according to which Arberisht, the language spoken by Arbereshe, is part of the Tosk dialect group of Albanian, a language originally spoken in Toskeria (Babiniotis 1998).
On the sample:
The Arbereshe Y-chromosome variation was investigated by sampling individuals from different villages of the Pollino area (Calabria) who bear one of the founding surnames of the population. The genotyping was performed using 12 microsatellites (STRs) and 31 unique event polymorphisms (UEPs), defining, respectively, haplotypes and haplogroups. The Italian and Balkan genetic backgrounds were explored using the large amount of data provided by recent Y-chromosome studies in the two peninsulas and by literature data on STRs from forensic research.
Comparison of Y-haplogroup frequency and diversity between Albanians from Tirana and Arbereshe from Calabria (from Table III):

The presence of F*(xG,I,J,K) in Albanians is interesting as this occurs in Romania and Bosnia Herzegovina (all groups), and in South Apulia, It could potentially be haplogroup H and may reflect a Gypsy element that was not present when the Arbereshe moved to Italy from the Balkans.

Haplogroup I shows similar frequencies, but:
I-M170 is the most common Balkan haplogroup (Pericic et al. 2005a,b) and the second most frequent Arbereshe clade. Nevertheless, analysis of its network reveals unexpected results: most of the Arbereshe I-M170 haplotypes are not included in the Balkan cluster (Figure 3), but are located in the long branches containing mainly Italian chromosomes. Comparisons with literature data (Semino et al. 2000; Barac et al. 2003, Rootsi et al. 2004) show that the core haplotype of the Balkan cluster (16-14-15-13-31-24-11-11-13; locus order as above) is consistent with the almost Balkan exclusive I2a (formerly I1b) clade. The proposed interpretation of the Arbereshe as a proxy of the founder Albanian population leads us to hypothesize that the I2a clade was less common in the southern Balkans 500 years ago than nowadays. The very tight shape of the I2a cluster in the network suggests a very recent expansion of this haplogroup in the southern Balkans. Furthermore, I2a is still rare in
mountain populations such as the Albanians of Kosovo (Pericic et al. 2005a,b) and in a randomly selected Arbereshe sample from Rootsi et al. (2004).
This is an interesting finding in the light of recent evidence for selection in Y-haplogroup I.

The situation with J2 is also quite interesting as this is rarer in Arbereshe (3%) than Albanians (17%):
The scarcity of J2 chromosomes in the Arbereshe sample (1/40) is very difficult to explain, given that they are very common in both the Italian peninsula and the southern Balkans. Literature data on J2 indicate that most of the haplotypes included in the Balkan (B) cluster of the network (Figure 3) have an STR configuration consistent with the J2-M12 sub-clade (Di Giacomo et al. 2004; Semino et al. 2004; Cruciani et al. 2007). In contrast, most of the haplotypes in the other clusters agree with the STR configuration given for the J2-M67 clade, with its sub-clade J2-M92 (Di Giacomo et al. 2004). It is unconvincing to attribute the rarity of J2 in the Arbereshe to random sampling or to the effect of genetic drift. Furthermore, the Arbereshe sample analysed by Semino et al. (2004) also completely lacks the typically Balkan J2-M12 chromosomes. If we interpret our Arbereshe sample as representative of the founding Albanian population, we may hypothesize that the J2 haplogroup was considerably less diffuse in the southern Balkans five centuries ago than today.
What we can conclude from this study is that the founding Albanian population was J2- and I2a- lite compared to modern Albanians. The source for the I2a seems to be either the Albanization of people from the West Balkans and/or selection, although it would be difficult to see a massive increase in frequency in only five centuries. The I2a-deficiency of the Arbereshe also gives support to the theory that the Albanians are relatively recent arrivals from the northeast; this theory has been upheld in the past on the basis of the (i) their historical obscurity until the last millennium, and (ii) the paucity of native sea terms and Greek loanwords in Albanian, which is difficult to explain if Albanians always occupied their current location on the Adriatic.

The source of J2 is less clear, and could be either the Albanization of Greeks (the only Balkan population with a sizeable J2 frequency) or remnants of Muslim Anatolians from Ottoman times. However, modern Albanians belong mainly to clade J2b, while Anatolians belong to J2a. Thus, I tend to dismiss the Anatolian connection.

The low frequency of R1*x(R1a1) in the Arbereshe, together with the high E1b1b1a frequency are quite convincing of the Balkan origins of this population.

Ann Hum Biol.
2010 Jun 22. [Epub ahead of print]

Linking Italy and the Balkans. A Y-chromosome perspective from the Arbereshe of Calabria.

Boattini A, Luiselli D, Sazzini M, Useli A, Tagarelli G, Pettener D.


Background: The Arbereshe are an Albanian-speaking ethno-linguistic minority who settled in Calabria (southern Italy) about five centuries ago. Aim: This study aims to clarify the genetic relationships between Italy and the Balkans through analysis of Y-chromosome variability in a peculiar case study, the Arbereshe. Subject and methods: Founder surnames were used as a means to identify a sample of individuals that might trace back to the Albanians at the time of their establishment in Italy. These results were compared with data of more than 1000 individuals from Italy and the Balkans. Results: The distributions of haplogroups (defined using 31 UEPs) and haplotypes (12 STRs) show that the Italian and Balkan populations are clearly divergent from each other. Within this genetic landscape, the Arbereshe are characterized by two peculiarities: (a) they are a clear outlier in the Italian genetic background, showing a strong genetic affinity with southern Balkans populations; and (b) they retain a high degree of genetic diversity. Conclusion: These results support the hypothesis that the surname-chosen Arbereshe are representative of the Y-chromosome genetic variability of the Albanian founder population. Accordingly, the Arbereshe genetic structure can contribute to the interpretation of the recent biological history of the southern Balkans. Intra-haplogroup analyses suggest that this area may have experienced important changes in the last five centuries, resulting in a marked increase in the frequency of haplogroups I2a and J2.



  1. The Haplogroup I recent spread to the southern Balkans theory is interesting. This probably excludes it being brought by Slavic speakers (at its current frequencies) during the migrations of 1,300 years ago.

    Which group or groups, then, are responsible for bringing the haplogroup to the southern Balkans? Were Vlach migrations that substantial to bring it to its current levels? There are places in Greece where the haplogroup is frequent, like in Thrace and the city of Agrinion. If the Slavs didn't bring it there (at modern levels) and it wasn't indigenous, who brought it there? Haplogroup I is relatively common even in Crete, especially away from the Lasithi plateau.

    Are the Arbereshe that large of a group to conclude that the Balkan cluster was less frequent in southern Balkan populations over 500 years ago?

  2. I really consider this study flawed. There is all the probability that the Italian Albanians are composed of a limited number of founders from Albania with an usual array of haplogroups. It is well known that there is bleeding in minority groups, inter ethnic mixing taking in to the Italian Albanian community, and a loss of members who decide to assimilate into the greater Italian nation i.e pass as Italians and drop any connection with Albanian culture. An easier way to work out the infiltration of Italians into the Albanian minority is by studying their language. The retention of archaic words, the ingress of Italian Romance words will show the effects of a minority living among a majority of different language and ethnic group.

  3. If I had written this I would have been taken for a Nationalist, but what Ponto
    has written is my thinking from many time, knowing I the Arbereshe communities,
    the surnames they have and speaking also a little bit their language.
    The most interesting case of an Arbereshe is that of Ciulla, from Hora i Arbereshevet
    (Palermo, Sicily), that Argiedude defined an R1b1b2a with DYS385=11,11 like a
    Balkan cluster of R1b1b2.

    On Worldfamilies I wrote this:

    "Topic: An Albanian or an Italian R1b1b2a cluster? Argiedude found an Albanian cluster that Ciulla, an Italian Arberesh who belongs to it, has characterized so: "My Y-DNA line is Arberesh, part of the R1b Albanian/Balkans cluster ( key values at FTDNA`s 25 marker level are: DYS385a/b= 11-11, DYS459a= 8 and DYS464a/b/c/d similar to 14-16-16-17 )".

    I have found an Italian sample who could belong to the same cluster, but with more ancientness: the Albanian cluster is in fact very young. See Risso from Genova (Italy) (Ysearch EK3WY) with DYS393=14, DYS390=23, DYS385=11-11, DYS464=15-16-16-17, DYS461=10.

    Between Risso and Ciulla there is a TMRCA at about 2125-2550 YBP".

    No answer so far.

  4. "There is all the probability that the Italian Albanians are composed of a limited number of founders from Albania with an usual array of haplogroups".

    That was my first thoughts when I read the post. However the idea of the Albanians originally coming from somewhere like Dacia seems to ring true.

  5. Albanians have a strong element of nordic heretage. Since I am not a geneticist I don't know the name of the genome. This was made known to a European Genetics conference by Albanian institute of genetics. This claim was not disputed or accepted officialy by the conference. It was a genome that is very frequent in Sweeden. I remember the Albanian press ridiculed the claim of our Institute but they sticked in their claim. Since we are a obvoius mediterranian race I personally beleive that Nordic component comes from remnants of Viking conquests. Propably any Viking unit lost their way and settled there. But this is a fact. British anthropologists noticed that too. I am an Albanian by the way and not literate in genetics. If anyone can in simple words explain me what this study really means.


Stay on topic. Be polite. Use facts and arguments. Be Brief. Do not post back to back comments in the same thread, unless you absolutely have to. Don't quote excessively. Google before you ask.