Population genetics: Genetic diversity and its importance

Population genetics studies how genetic variation changes in populations. It’s key to understanding evolution and how species survive. By looking at genetic diversity, scientists learn about a population’s history and health risks.
Genetic variation is crucial to study. It helps us understand a population’s past, how it adapts, and its health. This knowledge is vital for many fields.
Population genetics is important for many studies. It helps us understand genetic differences in communities and diseases. This field offers deep insights into biology, medicine, and conservation.
Introduction to Population Genetics
Capturing genetic variations in a population is key in population genetics. Genotyping arrays help find known genetic variants across the genome. But, they have limits like not working well with diverse populations and covering only a few genomic sites.
This can miss important genetic variations. It makes it hard to find signs of natural selection or disease links.
To fix these issues, imputation is used. It’s a method that guesses unobserved genetic variants by matching haplotypes in a study with a big reference panel. This panel usually has sequencing data from many samples. The quality of the reference panel greatly affects imputation’s accuracy.
Capturing Genetic Variations in Populations
Genotyping arrays are a common tool in population genetics. They help spot known genetic variations across the genome. But, they only cover a few genomic sites and don’t work well with diverse populations.
This can lead to missing important genetic variations. It makes it tough to find signs of natural selection or disease links.
Imputation is a powerful solution. It uses stats to guess unobserved genetic variants by comparing haplotypes in a study with a big reference panel. This panel has sequencing data from many samples, showing a wide range of genetic diversity. The quality of the reference panel is crucial for imputation’s success.
Using imputation and big reference panels helps researchers understand genetic variations better. This is key for meaningful studies, finding natural selection signs, and linking genetic variations to traits or diseases.
Challenges in Capturing Genetic Variations
Accurately capturing genetic variations across diverse populations is a big challenge in population genetics. The main reference panels, like the 1000 Genomes Project, are mostly from European and East Asian ancestries. This leaves out many other populations, especially those from Southeast Asia.
These panels’ bias can make imputation less accurate, especially for rare variants. Rare variants are crucial in disease studies because they often have big effects. They help us understand disease risks in specific populations.
To fix this, we need more diverse and comprehensive reference panels. These should reflect the global genetic diversity. This will help us impute more accurately and find genetic links specific to certain populations. It will make genetic research and healthcare more effective and inclusive.

The lack of diversity in current reference panels is a big hurdle in using population genetics to tackle health disparities. By adding more diverse panels, researchers can gain insights into disease causes and treatment responses in underserved groups. This is key for improving precision medicine.
Imputation to Address Limitations
In population genetics, imputation is a key tool. It helps by using reference panels to guess the genotypes of SNPs not seen in studies. These panels have lots of genetic data.
Statistical Inference of Ungenotyped SNPs
Imputation finds similar DNA sequences in study data and reference panels. It guesses the genotypes of unseen SNPs. The quality of the reference panel is crucial for this.
Imputation is vital in genetics research. It lets researchers study more genetic traits than before. This is because it guesses unseen genetic markers.

But, imputation’s success varies by population. It works best when the study and reference panels are genetically similar. This is why making reference panels for different populations is important.
Reference Panel Sensitivity
The choice of reference panel greatly affects genetic imputation’s performance. Larger reference panels usually mean better imputation accuracy. They are more likely to have haplotypes that match the study dataset. Yet, mainstream reference panels like the 1000 Genomes Project and Haplotype Reference Consortium mainly focus on European and East Asian populations.
Populations from Southeast Asia are greatly underrepresented. This imbalance can reduce imputation accuracy, especially for rare variants. Rare variants show different genetic diversity across different populations. Having the right reference panels is key for accurate genotype imputation. They help infer unobserved genetic variants in study datasets.

Hormones: Chemical regulators in organisms
To tackle this issue, researchers are working on region-specific reference panels. These panels aim to capture the genetic diversity of underrepresented populations. By including a wider range of genetic variations, they can boost imputation accuracy for populations not well-covered by mainstream panels.
Population genetics: Genetic diversity and its importance
Understanding population genetics is key. It helps us see how genetic variations spread and evolve in groups. This knowledge is at the heart of understanding evolution, natural selection, and how species survive.
Genetic diversity is the base of population genetics. It lets us grasp how a population’s history, adaptation, and disease risks are linked. By studying genetic diversity, scientists can learn about a population’s past, how it adapts, and its health risks.
Population genetics looks at the genetic makeup of groups. It explores how gene flow, mutation, and natural selection shape genetic variations over time. This is crucial for seeing how species evolve and adapt to new environments.

Genetic diversity is key for species survival and adaptation. It’s also vital for studying diseases. By finding genetic links to health issues, researchers can find new treatments or ways to prevent diseases.
In short, population genetics is vital. It helps us understand genetic variations, evolution, natural selection, and how they relate to disease. This knowledge is essential for improving health and treatments.
Orang Asli: A Unique Population
The Orang Asli (OA) in Southeast Asia are fascinating in genetics. They have a long history and unique genes. These genes help them live in the tough rainforest.
They look different from most people in the area. This is because of their special genes.
But, we know very little about their genes. This is because there’s not much research on them. Recently, scientists made a “Southeast Asian Specific (SEA-specific) Reference Panel“. This helps us understand the OA’s genetic diversity better.
Long History and Distinct Genetic Characteristics
The OA have lived in Southeast Asia for thousands of years. They’ve adapted to the local environment in unique ways. This is shown in their genes.
Physical chemists have found special genes in the OA. These genes make them look and function differently.

Despite their importance, we don’t have much genetic data on the OA. Scientists are working to change this. They aim to create more ancestry-matched reference panels. These will help us understand the OA and other groups in Southeast Asia better.
Developing a Southeast Asia-Specific Reference Panel
To address the lack of Southeast Asian representation in reference panels, researchers created a “Southeast Asia-Specific (SEA-specific) Reference Panel”. They used a “Cross-panel Imputation” method. This panel was made by combining the GA100K and SG10K datasets, which included 2,550 samples and 113,851,450 genetic variants.
The SEA-specific panel showed more high-confidence variants than the 1000 Genomes Project when used on Orang Asli and Singapore Genome Variation Project datasets. This shows how crucial ancestry-matched reference panels are for capturing the genetic diversity of underrepresented groups.
The cross-imputation method added up to 3 million more imputed variants than non-cross imputed scenarios. It also found more imputed SNPs on the Orang Asli dataset than the “Meta-imputation” approach. This new method proves the worth of using diverse datasets to make a detailed reference panel. It better represents the Southeast Asia-specific reference panel and cross-panel imputation for studying the genetic diversity of these populations.

Cross-Panel Imputation Approach
Researchers used a new method called “Cross-panel Imputation” to create the Southeast Asia-Specific (SEA-specific) Reference Panel. This method mixed the GA100K and SG10K datasets. It added 68 million variants to the GA100K and 43 million variants to the SG10K.
After removing low-quality variants, the SEA-specific Reference Panel had 113,851,450 variants from 2,550 samples. This method was better than just using the overlapping variants from the three datasets. It showed how it can improve imputation accuracy for groups that are not well-represented.
Increasing Imputation Accuracy
The SEA-specific panel did a better job of imputing high-confidence variants than the 1000 Genomes Project (1KGP) in the Orang Asli (OA) and Singapore Genome Variation Project (SGVP) genotyping datasets. It also imputed SNPs with better quality scores (INFO, DR2, and R2) on the OA genotyping dataset compared to TOPMED and Human Genome Diversity Project.
The cross-imputation method added up to 3 million more imputed variants than non-cross-imputed variants. It also found more imputed SNPs on the OA datasets than the Meta-imputation approach.

| Metric | SEA-specific Panel | 1KGP | TOPMED | HGDP |
|---|---|---|---|---|
| High-confidence Variants Imputed | More | Less | Less | Less |
| Imputation Quality Scores | Better | Poorer | Poorer | Poorer |
Assessing Imputation Accuracy
It’s key to check how well reference panels work in genetics studies. Researchers compared the SEA-specific Reference Panel to others like the 1000 Genomes Project (1KGP), TOPMED, and the 1KGP-Human Genome Diversity Project (HGDP) [1]. They looked at how well it worked in Orang Asli and Singapore Genome Variation Project datasets.
Human genome: What we have Learned from DNA sequencing
When they made the sample sizes the same, the SEA-specific panel did better. It had more accurate imputed variants than the 1KGP panel for both datasets. This shows why using ancestry-matched reference panels is key for better imputation accuracy, especially for groups that are not well-represented.
| Reference Panel | High-Confidence Imputed Variants |
|---|---|
| SEA-specific Panel | More |
| 1KGP Panel | Less |

The study’s results show the importance of making reference panels for specific populations. This helps get better imputation accuracy. It also lets us learn more about the genetic diversity of groups like the Orang Asli and the Singapore Genome Variation Project.
Importance of Ancestry-Matched Reference Panels
In population genetics studies, the right reference panel is key. It must reflect the genetic diversity of the population. However, mainstream panels like the 1000 Genomes Project mainly focus on European and East Asian ancestries. Populations from Southeast Asia are often left out.
This imbalance can make imputation less accurate, especially for rare variants. These variants differ greatly in frequency across different populations.
Improving Imputation Accuracy for Rare Variants
Rare variants are vital in disease studies. They often have big effects and offer deep insights into disease risks. Using ancestry-matched panels, like the Southeast Asia-Specific (SEA-specific) Reference Panel, can greatly boost imputation accuracy for underrepresented groups.
This allows researchers to uncover valuable genetic diversity and health information.

The study in [https://www.nature.com/articles/s41525-024-00435-7] shows the benefits of using a SEA-specific panel. It found more accurate imputation and higher-confidence variants compared to usual panels. This highlights the need for reference panels that match the ancestry of the population being studied.
Impact on Disease Association Studies
Getting genetic variants right, especially rare ones, is key for disease studies. These studies need a wide dataset to spot natural selection signs and find disease links. Using panels like the Southeast Asia-Specific (SEA-specific) Reference Panel boosts imputation accuracy for groups often left out.
With a better view of genetic diversity, studies can understand disease better. This helps find new disease-associated variants. It also deepens our knowledge of complex diseases’ genetic roots.
The SEA-specific reference panel had 2,550 samples and 113,851,450 variants. It found more variants than the 1000 Genomes Project did for Orang Asli and Singapore Genome Variation Project datasets. It found 8.9 million SEA-specific variants for Orang Asli, and 1KGP found 8.1 million. For Singapore Genome Variation Project, it found 12.5 million SEA-specific variants, while 1KGP found 11.8 million.
The SEA-specific panel showed better recall and non-reference discordance rates. This shows how close the reference panel is to the ancestry matters for imputation accuracy. The cross-imputation method found more SNPs than Meta-imputation on Orang Asli datasets. But, it had slightly lower results on the Singapore Genome Variation Project dataset.

Thanks to these population genetics and imputation accuracy advances, disease studies can uncover new genetic insights. This is especially true for groups often overlooked. This knowledge can help create better treatments, improving health for diverse communities.
Unlocking Population Genetics Information
It’s key to understand the genetic diversity of groups often overlooked. These groups, like the Orang Asli in Southeast Asia, hold secrets about human evolution and health. But, we’ve not studied them enough, which limits our knowledge.
Creating special reference panels can help. For example, the Southeast Asia-Specific (SEA-specific) Reference Panel improves study accuracy. It helps us learn more about the genetics and health of these groups.
The SEA-specific panel has 2,550 samples and over 113 million genetic variants. It’s better at finding genetic information in Southeast Asian groups than older panels. For example, it found 8.9 million unique variants in Orang Asli data, more than the 1000 Genomes Project.
This panel also imputes data more accurately. It has higher quality scores than other panels when used on Orang Asli data. This shows how important it is to use panels that match the ancestry of the groups being studied.
Studying these groups can open up new research paths. It helps us understand human genetics better and could lead to new health discoveries. By focusing on these groups, scientists can make big strides in genetics and healthcare.

Future Directions
The Southeast Asia-Specific (SEA-specific) Reference Panel shows how vital it is to make ancestry-matched reference panels for diverse populations. We need to keep working on these panels to make our research more accurate. This will help us understand diseases better and lead to better healthcare for everyone.
Looking into groups like the Orang Asli can reveal new genetic information. This helps us understand human genetics better. It’s important for studying how humans adapt and stay healthy.
By making better reference panels, we can do more precise research. This research will help us understand health and disease in different groups. It will lead to more tailored and effective healthcare for all.
Extinction of species: Causes and consequences





