Introduction to qpAdm#
qpAdm is a statistical method used in population genetics to model the ancestry of a target population as a mixture of several source populations. It is particularly useful for analyzing ancient DNA data, where researchers often want to understand the genetic contributions of different ancestral groups to a given population.
Development of qpAdm#
qpAdm was developed by David Reich and his colleagues as part of the ADMIXTOOLS software package. It builds on earlier methods for modeling admixture, such as f-statistics, but offers greater flexibility and accuracy in estimating ancestry proportions.
P Value and Model Fit#
The P value in qpAdm is used to assess the fit of the admixture model. A high P value indicates that the model has a good fit to the data, while a P value lower than 0.05 suggests that the model may not be valid. Researchers use the P value to compare different models and select the one that best represents the genetic history of the target population.
Standard Errors#
Standard errors in qpAdm provide a measure of uncertainty around the estimated admixture proportions. They help researchers understand the confidence they can have in the results and are crucial for interpreting the significance of the findings.
Why qpAdm Produces Reliable Models#
qpAdm is considered one of the most accurate methods for modeling ancient genetic admixture because it incorporates a robust statistical framework that accounts for various sources of error and uncertainty. It allows researchers to test multiple models and compare their fit to the data, ensuring that the conclusions drawn are based on solid evidence.
Requirements for a Valid qpAdm Model#
To ensure that a qpAdm model is valid, several conditions must be met:
- The source populations must be well-defined and representative of the ancestral groups being modeled.
- The target population must have sufficient SNP coverage to provide reliable estimates of ancestry proportions.
- The model must include appropriate right populations to help distinguish between different sources of ancestry.
- The P value must indicate a good fit for the model (e.g., above 0.05) to ensure that the model is statistically valid.
- The SE values should be lower than 8% / 0.0800 especially for a target population which has enough SNP coverage, to ensure that the estimates are not overly uncertain. While on the other hand, if the target population has low SNP coverage, such as under 300k, SE values up to 10% / 0.1000 may be acceptable, but caution should be exercised when interpreting the results.
- The Z scores for the admixture proportions should be sufficiently high (e.g., above 3.00) to indicate that the estimates are statistically significant and not due to random chance. While Z scores from 2.50 to 3.00 may be acceptable.
- The model should be tested against alternative models to confirm its robustness and reliability.
- The results should be consistent with known historical and archaeological evidence to support the inferred admixture events.
Left and Right Population Sets#
In qpAdm, the left populations are the source populations from which the target population is assumed to have inherited genetic material. The right population set includes outgroup populations that are used to help distinguish between different sources of ancestry. The choice of right populations is crucial for the accuracy of the model, as they provide a baseline for comparison and help to control for shared ancestry among the source populations.
Admixture Proportions#
Admixture proportions in qpAdm represent the estimated contributions of each source population to the target population. These proportions are calculated based on the genetic data and the specified model, and they provide insights into the ancestral composition of the target population. Accurate estimation of admixture proportions is essential for understanding the genetic history and demographic events that shaped the population under study.
qpAdm in practice#
For worked examples showing what these conditions look like applied to real populations, see our studies of the Picenes of Iron Age Italy, present-day Balkan populations, modern Anatolian Turks, and the Deep Maniots of southern Greece.




