Database of Genomic Variants

About the Project

The objective of the Database of Genomic Variants is to provide a comprehensive summary of structural variation in the human genome. We define structural variation as genomic alterations that involve segments of DNA that are larger than >1kb. Now we also annotate InDels in 100bp-1kb range. The content of the database is only representing structural variation identified in healthy control samples.

The Database of Genomic Variants provides a useful catalog of control data for studies aiming to correlate genomic variation with phenotypic data. The database is continuously updated with new data from peer reviewed research studies. We always welcome suggestions and comments regarding the database from the research community.

If you want to submit variation data to the database, information about the submission process can be found here.

For data sets where the variation calls are reported at a sample by sample level, we merge calls with similar boundaries across the sample set. Only variants of the same type (i.e. CNVs, inversions, InDels) are merged, and gains and losses are merged separately. In addition, if several different platforms/approaches are used within the same study, these datasets are merged separately. Sample level calls that overlap by >= 70% are merged in this process.

Click here for answers to commonly asked questions and an overview about how to interpret the data in the database.

Database first described in Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C. Detection of large-scale variation in the human genome. Nat Genet. 2004 Sep;36(9):949-51.

For a more detailed description of the database structure and content: Zhang J, Feuk L, Duggan GE, Khaja R, Scherer SW. Development of bioinformatics resources for display and analysis of copy number and other structural variants in the human genome. Cytogenet Genome Res. 2006;115(3-4):205-14

An Advisory Board for the Database of Genomic Variants was formed in 2008. The board has the following members:

Dr. Nigel Carter - The Wellcome Trust Sanger Institute
Dr. Deanna Church - National Center for Biotechnology Information (NCBI)
Dr. Lars Feuk - The Hospital for Sick Children
Dr. Paul Flicek - European Bioinformatics Institute (EBI)
Dr. David Ledbetter - Emory University
Dr. Stephen Scherer - The Hospital for Sick Children

This database is supported through grants from Genome Canada/Ontario Genomics Institute, the McLaughlin Centre for Molecular Medicine, the Wellcome Trust and the Canadian Institutes for Health Research.