A curated catalogue of human genomic structural variation




Variant Details

Variant: esv3555



Internal ID5936408
Landmark
Location Information
TypeCoordinatesAssemblyOther Links
Outerchr5:148082523..148082968hg38UCSC Ensembl
Outerchr5:147462086..147462531hg19UCSC Ensembl
Outerchr5:147442279..147442724hg18UCSC Ensembl
Cytoband5q32
Allele length
AssemblyAllele length
hg38446
hg19446
hg18446
Variant TypeCNV loss
Copy Number
Allele State
Allele Origin
Probe Count
Validation Flag
Merged StatusM
Merged Variants
Supporting Variantsessv25996
SamplesYH
Known GenesSPINK5
MethodSequencing
AnalysisWe defined a read pair as a diagnostic paired-end (PE) if the two ends of a read pair 1) could both be aligned but 2) could not meet the pair-end insert size and/or orientation requirement. We grouped abnormally mapped paired-end reads with coordinate distances smaller than the maximum insert size on both ends into diagnostic PE clusters. In order to avoid misalignment, PE clusters with greater than 4 pairs were discarded. Common structural variations like deletions, translocations, duplications, inversions etc. were examined and summarized into alignment models. This paired-end method (PEM) is substantially biased to deletion events. In addition to searching for SVs using paired-end methods (PEM), we also identified copy number variations (CNV) based on read depth. By modeling sequencing depth distribution on different levels of GC content, we found 1701 CNV regions that had a lower copy number (1-47 kb in length, median at 1 kb) and 1299 that had a higher copy number (1-105 kb in length, median at 1 kb) than NCBI36. Approximately 82% of the CNV regions that had a lower copy number and 61% CNVs that had a higher copy number had more than a 50% overlap with the annotated repeats, which was not surprising. However, these regions may be somewhat inaccurate because sequencing depth can be affected by many factors, including GC content and alignment difficulties in repetitive region.
PlatformIllumina Genome Analyzer
Comments
ReferenceWang_et_al_2008
Pubmed ID18987735
Accession Number(s)esv3555
Frequency
Sample Size1
Observed Gain0
Observed Loss1
Observed Complex0
Frequencyn/a


Hosted by The Centre for Applied Genomics
Grant support for DGV
Please read the usage disclaimer