A curated catalogue of human genomic structural variation




Variant Details

Variant: essv26819



Internal ID12620984
Landmark
Location Information
TypeCoordinatesAssemblyOther Links
Outerchr2:132271803..133848958hg38UCSC Ensembl
Outerchr2:133029376..134606529hg19UCSC Ensembl
Outerchr2:132745846..134322999hg18UCSC Ensembl
Cytoband2q21.2
Allele length
AssemblyAllele length
hg381577156
hg191577154
hg181577154
Variant TypeOTHER complex
Copy Number
Allele State
Allele Origin
Probe Count
Validation Flag
Merged StatusS
Merged Variantsesv4378
Supporting Variants
SamplesYH
Known GenesGPR39, LYPD1, MIR7853, NCKAP5
MethodSequencing
AnalysisWe defined a read pair as a diagnostic paired-end (PE) if the two ends of a read pair 1) could both be aligned but 2) could not meet the pair-end insert size and/or orientation requirement. We grouped abnormally mapped paired-end reads with coordinate distances smaller than the maximum insert size on both ends into diagnostic PE clusters. In order to avoid misalignment, PE clusters with greater than 4 pairs were discarded. Common structural variations like deletions, translocations, duplications, inversions etc. were examined and summarized into alignment models. This paired-end method (PEM) is substantially biased to deletion events. In addition to searching for SVs using paired-end methods (PEM), we also identified copy number variations (CNV) based on read depth. By modeling sequencing depth distribution on different levels of GC content, we found 1701 CNV regions that had a lower copy number (1-47 kb in length, median at 1 kb) and 1299 that had a higher copy number (1-105 kb in length, median at 1 kb) than NCBI36. Approximately 82% of the CNV regions that had a lower copy number and 61% CNVs that had a higher copy number had more than a 50% overlap with the annotated repeats, which was not surprising. However, these regions may be somewhat inaccurate because sequencing depth can be affected by many factors, including GC content and alignment difficulties in repetitive region.
PlatformIllumina Genome Analyzer
Comments
ReferenceWang_et_al_2008
Pubmed ID18987735
Accession Number(s)essv26819
Frequency
Sample Size1
Observed Gain0
Observed Loss0
Observed Complex1
Frequencyn/a


Hosted by The Centre for Applied Genomics
Grant support for DGV
Please read the usage disclaimer