To do this, I created an individual possessing the ancestral allele using hgdpGeo as a reference. According to the reference for this table:
Samples collected by the HGDP-CEPH from 1,043 individuals from around the world were genotyped for 657,000 SNPs at Stanford. Ancestral states for all SNPs were estimated using whole genome human-chimpanzee alignments from the UCSC database. For each SNP in the human genome (NCBI Build 35, UCSC database hg17), the allele at the corresponding position in the chimp genome (Build 2 version 1, UCSC database pantro2) was used as ancestral.
My new globe13anc calculator is simply a version of the latest globe13 one, but with an extra "Ancestral" component, so it has 13+1 = 14 ancestral components in total.
You can of course use globe13anc as any other calculator designed for DIYDodecad, and hopefully no one will get anything other than 0% for the "Ancestral" component :)
But, the main point of building this is to help you infer D-statistics with no suspicion that gene flow within the human species may affect the results; while the Khoesan of South Africa (where the Palaeo_African component is modal) are an approximate outgroup to the rest of mankind, there is evidence that even their most isolated groups have some external gene flow. So, using this "Ancestral" outgroup instead of Palaeo_African ought to make things cleaner for everyone.