A curated catalogue of human genomic structural variation




Variant Details

Variant: dgv4e221



Internal ID22815703
Landmark
Location Information
TypeCoordinatesAssemblyOther Links
chr8:39375220..39520881hg38UCSC Ensembl
chr8:39232739..39378400hg19UCSC Ensembl
chr8:39351896..39497557hg18UCSC Ensembl
Cytoband8p11.22
Allele length
AssemblyAllele length
hg38145662
hg19145662
hg18145662
Variant TypeCNV loss
Copy Number
Allele State
Allele Origin
Probe Count
Validation Flag
Merged StatusM
Merged Variants
Supporting Variantsesv3693376, esv3692977
SamplesC010018, C010084, C010094, C010038, NA19192, C010080, C010093, C010139, C010078, NA19207, C010143, C010039, C010044, C010017, C010095, C010041, C010037, NA19208, C010087, C010147, C010085, C010142, C010040, C010016, NA19206, C010144, C010036, C010146, C010042, C010019
Known GenesADAM3A, ADAM5
MethodSNP array
AnalysisNormalised microarray signal intensity data for both cohorts was analysed with PennCNV (2009Aug27 v.) and QuantiSNP (v.2) programs to call putative CNVs in each individual. Settings and parameters suggested by authors were used for both algorithms together with the 'genomic wave' adjustment for the signal intensity data. Additionally, with PennCNV we used separate B allele-frequency files (PFB-file) - for HapMap YRI we used PennCNV's default PFB-file based on HapMap YRI samples. As a quality control measure, we checked that all samples met the following quality criteria calculated by the PennCNV program: LRR_SD <= 0.25, BAF_SD <= 0.05, BAF_DRIFT <= 0.002 and GCWF <= |0.04|. Raw CNV calls from PennCNV and QuantiSNP were then merged (as intersection, for each individual separately) with custom PERL script and only CNVs that were similarly called (same type of overlapping copy number change - gain or loss) were considered. From the resulting list of CNVs we filtered out CNVs i) called on X/Y chromosomes; ii) shorter than 1000 bp in length; iii) with QuantiSNP log Bayes Factor (LBF) less than 5. To ensure high-quality of the EGCUT CNV dataset, CNVs detected by PennCNV and QuantiSNP algorithms were further visually confirmed with Illumina Genome Viewer. For each CNV locus, signal intensity data for all corresponding family members was loaded simultaneously and visually inspected to confirm CNV calls and family members with no CNV call. CNV regions containing no visually detectable CNVs (or CNVs not called but visually distinguishable) were excluded. Throughout this study we used the NCBI Build 36/hg18 assembly coordinates of the human reference sequence.
Normalised microarray signal intensity data for both cohorts was analysed with PennCNV (2009Aug27 v.) and QuantiSNP (v.2) programs to call putative CNVs in each individual. Settings and parameters suggested by authors were used for both algorithms together with the 'genomic wave' adjustment for the signal intensity data. Additionally, with PennCNV we used separate B allele-frequency files (PFB-file) - for the EGCUT dataset we used general Estonian population-based dataset as the reference (n=1000). As a quality control measure, we checked that all samples met the following quality criteria calculated by the PennCNV program: LRR_SD <= 0.25, BAF_SD <= 0.05, BAF_DRIFT <= 0.002 and GCWF <= |0.04|. Raw CNV calls from PennCNV and QuantiSNP were then merged (as intersection, for each individual separately) with custom PERL script and only CNVs that were similarly called (same type of overlapping copy number change - gain or loss) were considered. From the resulting list of CNVs we filtered out CNVs i) called on X/Y chromosomes; ii) shorter than 1000 bp in length; iii) with QuantiSNP log Bayes Factor (LBF) less than 5. In order to achieve high-quality CNV dataset, we confirmed our HapMap YRI CNV calls with an independent set of validated CNV calls for the same HapMap YRI individuals. CNV calls generated and confirmed with custom Affymetrix (Emeryville, CA, USA) high-resolution microarrays (with 32 million unique oligonucleotide probes for CNV discovery and 800,000 unique probes for CNV confirmation) by Matsuzaki et al. [Matsuzaki H, Wang PH, Hu J, Rava R, Fu GK (2009) High resolution discovery and confirmation of copy number variants in 90 Yoruba Nigerians. Genome Biol 10: R125.] were downloaded from http://genomebiology.com/2009/10/11/R125/additional/. CNVs called by us and CNVs called and confirmed by Matsuzaki et al. were compared for each individual separately (custom PERL script) and only CNVs that were called in both datasets were considered. Throughout this study we used the NCBI Build 36/hg18 assembly coordinates of the human reference sequence.
PlatformIllumina Infinium Human1M-Duo DNA Analysis BeadChip (Illumina 1M)
Illumina Infinium Human370CNV-Quad DNA Analysis BeadChip (Illumina 370K)
Comments
ReferencePalta_et_al_2015
Pubmed ID25853576
Accession Number(s)dgv4e221
Frequency
Sample Size199
Observed Gain0
Observed Loss30
Observed Complex0
Frequencyn/a


Hosted by The Centre for Applied Genomics
Grant support for DGV
Please read the usage disclaimer