Comprehensive Resource for the Arab Pangenome Reference (APR)
Delve into the intricate genetic tapestry that defines the Arab people. A Comprehensive Resource for the Arab World built with 106 Assemblies (53 Individuals).
The Arab Pangenome Project
Comprehensive Resource for the Arab World Delve into the intricate genetic tapestry that defines the Arab people. A Comprehensive Resource for the Arab World built with 106 Assemblies (53 Individuals).
Key Features
- 86 Assemblies: Double the insight with assemblies from 43 individuals, ensuring a broad spectrum of genetic representation.
- High-Definition Data: Utilizing 30x HiFi coverage and 50x ONT Ultra-long data, we offer unparalleled depth and accuracy in our genome sequences
- Full-Length RNA Sequencing: Delve deeper into the transcriptomic landscape with full-length RNA sequencing performed on Oxford Nanopore Technologies (ONT).
Why Explore The Arab Pangenome?
The Arab world is a region steeped in history, culture, and genetic diversity. By understanding the genetic composition of its people, we are not only unraveling the stories of our ancestors but are also equipping ourselves with valuable insights that can aid in medical research, personalized medicine, and population genetics.
How to Explore The Arab Pangenome
Venturing into the vastness of The Arab Pangenome is an exciting journey that offers a plethora of insights into the genetic diversity and richness of the Arab population. For those keen on actively engaging with our data, here’s a quick guide to help you navigate and harness the potential of our comprehensive dataset.
1. Aligning Reads to the Pangenome
Long Reads: Utilize the power of vg giraffe and GraphAligner to ensure precision and accuracy when aligning long-read sequences. These tools have been optimized for aligning longer sequences against a pangenome graph, capturing the comprehensive genetic variation within our dataset.
Short Reads: When working with short-read sequences, vg giraffe is the preferred method, ensuring seamless alignment and optimal results.
2. Calling Variants
For variant calling, use the vg call tool. It’s tailored to work in conjunction with the pangenome graph, providing a robust and accurate mechanism to call variants. This tool captures the intricate nuances of genetic variation, allowing researchers to unearth unique genetic markers and variations with precision.
Data
To work with the Arab Pangenome Reference, use the following files:
All the data can be downloaded from the following links:
All the code and workflows can be found on the following Github repo: