Analysis | The algorithm used to call CNVs using the 500K EA platform was developed to accurately define CNV regions using a large set of reference samples and is described in detail in a separate publication (Komura 2006). The algorithm contains three major parts: 1) Intensity pre-processing using an improved version of Genomic Imbalance Map (GIM) (Ishikawa et al. 2005), including probe selection, noise reduction, normalization, and intensity ratio adjustment based on affinity differences between alleles of a SNP, 2) CNV extraction, which identifies CNVs from all pair-wise comparisons using a modified SW-ARRAY, and 3) A copy number inference step which utilizes signal ratios and SNP information to more precisely define CNV boundaries and the copy number within each region. |