NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE166483 Query DataSets for GSE166483
Status Public on Oct 26, 2021
Title Cataloging human PRDM9 variability utilizing long-read sequencing technologies reveals PRDM9 population-specificity and two distinct groupings of related alleles
Organism Homo sapiens
Experiment type Genome binding/occupancy profiling by high throughput sequencing
Other
Summary The PRDM9 protein determines sites of meiotic recombination in humans by directing meiotic DNA double strand breaks to specific loci. Targeting specificity is encoded by a long array of C2H2 zinc fingers that binds to DNA. This zinc finger array is hypervariable, resulting in innumerable alleles, each with a potentially different DNA binding preference. Assessment of PRDM9 diversity is important for understanding the complexity of human population genetics, inheritance linkage patterns, and the predisposition to genetic disease. Due to the repetitive nature of the PRDM9 zinc finger array, large-scale sequencing of human PRDM9 is challenging. We therefore developed a long-read sequencing strategy to infer the diploid PRDM9 zinc finger array genotype in a high-throughput manner. From an unbiased study of PRDM9 allelic diversity in 716 individuals from 7 human populations, we detected 70 high-confidence PRDM9 alleles. Several alleles differ in frequency among human populations and 33 alleles had not been identified by previous studies, which were heavily biased to European populations. PRDM9 alleles are distinguished by their DNA binding site preferences and fall into two major categories related to the most common PRDM9-A and PRDM9-C alleles. We also find that inter-conversion between allele types is likely rare. By mapping meiotic DSBs in testis, we find that small variations in PRDM9 can substantially alter the meiotic recombination landscape, demonstrating that minor PRDM9 variants may play an under-appreciated role in shaping patterns of human recombination. In summary, our data greatly expands our knowledge of PRDM9 diversity in humans.
 
Overall design Genotyping of PRDM9 in 7 human populations using amplicon enrichment coupled to long-read sequencing. DSB hotspot mapping in human males with a variant of the most common PRDM9-A allele.
**Please note that as the raw data need to be processed through a compute-intensive, specialized alignment pipeline, the (extensively processed) BAM files are included as processed data with special exception.
 
Contributor(s) Alleva B, Brick K, Pratto F, Huang M, Camerini-Otero RD
Citation(s) 34805134
Submission date Feb 09, 2021
Last update date Nov 30, 2021
Contact name Kevin Brick
E-mail(s) brickkm@mail.nih.gov, kevbrick@gmail.com, brickkm@niddk.nih.gov
Organization name NIDDK
Department GBB
Street address 5/205 Memorial Drive
City Bethesda
State/province MD
ZIP/Postal code 20892
Country USA
 
Platforms (3)
GPL16791 Illumina HiSeq 2500 (Homo sapiens)
GPL24106 MinION (Homo sapiens)
GPL28352 Sequel II (Homo sapiens)
Samples (18)
GSM5072244 AA1_dmc1_SSDS [re-analysis]
GSM5072245 AA1_input_SSDS [re-analysis]
GSM5072246 AA2_dmc1_SSDS [re-analysis]
Relations
BioProject PRJNA701043
SRA SRP305549

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE166483_File_S1_PRDM9_genotypes.txt.gz 29.0 Kb (ftp)(http) TXT
GSE166483_RAW.tar 168.5 Gb (http)(custom) TAR (of BAM, BEDGRAPH, BIGWIG)
GSE166483_hotspotsData.tab.gz 1.3 Mb (ftp)(http) TAB
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap