DeZign for Databases v5.2

Logical model

Attribute details

Analysis_Criteria
Analysis_Description
Analysis_ID
Analysis_ID
Analysis_Number
Analysis_Tool
Analysis_Tool_Description
Analysis_Tool_Type
Analysis_Tool_Version
Analysis_Type
ArrayExpress_Accession_Num
Biological_Source
Calculated_Frequency
Calc_Frequency_Qualifier
Case_or_Control
Chromosome
Chromosome_Length
Chromosome_Number
Chromosome_Number
Chromosome_Random_Num
Chromosome_Random_Num
Cohort_Description
Cohort_ID
Cohort_ID
Cohort_Name
Cohort_Name
Cohort_Name
Comments
Comments
Copy_Chromosome
Copy_Description
Copy_End
Copy_Number
Copy_Number_Count
Copy_Number_ID
Copy_Start
Count
Curation_Comments
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Current_Record_Flag
Cytoband
Dataset_Analysis_ID
Dataset_Analysis_ID
Dataset_Analysis_Number
Dataset_Description
Dataset_ID
Dataset_ID
Dataset_ID
Dataset_ID
Dataset_ID
Dataset_Name
Dataset_Number
Dataset_Xref_ID
Description
DGVA_Inferred_Flag
DGV_Calculated_Size
DGV_Calculated_Size
DGV_End
DGV_End
DGV_Merged_Flag
DGV_Start
DGV_Start
Display_Name
End_A
End_B
End_Random
End_Random
Estimated_Size
Estimated_Size
Ethnicity
Ethnicity
External_Dataset_ID
External_ID_Source
External_ID_Source
External_ID_Source
External_ID_Type
External_ID_Type
External_Sample_ID
External_Variant_ID
Family_ID
Family_Member
Family_Type
Father_ID
Feature
Filter_ID
Filter_Reason
Filter_Step
Gender
Gender
Gene
Genotype
GEO_Accession_Num
Hold_Until_Publication_Flag
ID
ID
ID
ID
Individual_Sample_ID
Inheritance
Inner_End_A
Inner_End_B
Inner_Start_A
Inner_Start_B
Karyotype
Landmark
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Load_Date
Merged_Analysis_ID
Merged_ID
Merging_Criteria
Method_Description
Method_Name
Method_Number
Method_Platform_Sample_ID
Method_Study_ID
Method_Study_ID
Method_Type
Mother_ID
NCBI_Tax_ID
Nucleotides_Covered
Number_of_CNVs_Overlapped
Number_of_Features
Number_of_Features_Overlapped
Number_of_Gains
Number_of_Gains_and_Losses
Number_of_Losses
OneK_Genomes_Project
Original_Method
Originators_of_Data
Outer_End_A
Outer_End_B
Outer_Start_A
Outer_Start_B
Percent_Coverage
Percent_of_CNVs_Overlapped
Percent_of_Features_Overlapped
Placement_Method
Placement_Method
Platform_Description
Platform_Name
Platform_Name
Platform_Number
Platform_Study_ID
Platform_Study_ID
Platform_Type
Platform_Version
Pooled_Sample_ID
Primary_Author_ID
Primary_Author_Name
Primary_Data_Flag
Principal_Investigator
Probe_Count
Project_ID
Publish_Date
PubMed_ID
Recipient
Recipient
Reference_Assembly
Reference_Assembly
Reference_Assembly
Reference_Assembly
Reference_Assembly
Reference_Assembly
Reference_Assembly_ID
Reference_Assembly_Name
Reference_Description
Reference_ID
Reference_ID
Reference_Number
Reference_Sample_ID
Reference_Sequence
Reference_Type
Remap_Status
Remap_Status
Reported_Frequency
Run_ID
Sample_Description
Sample_ID
Sample_ID
Sample_ID
Sample_Merged_Flag
Sample_Number
Sample_Pooled_ID
Sample_Size
Sample_Study_ID
Sample_Study_ID
Sample_Study_ID
Sample_Xref_ID
Sequence
Site
Size
Size
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_System
Source_Type
Span
Span
Start_A
Start_B
Start_Random
Start_Random
Study_Accession
Study_File_Prefix
Study_File_Prefix_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_ID
Study_Mapping
Study_Mapping
Subject_ID
Sub_Analysis_ID
Super_Analysis_ID
Supporting_Merged_Variant_ID
Supporting_Variant_ID
Table_ID
Table_Name
Total_Reporting
Translocation_Mapping_ID
Unmapped_Flag
Unmapped_Flag
Validation_Flag
Validation_Method
Validation_Platform_Name
Variant_Analysis_ID
Variant_Count
Variant_ID
Variant_ID
Variant_ID
Variant_ID
Variant_ID
Variant_ID
Variant_ID
Variant_ID
Variant_Mapping_ID
Variant_Number
Variant_Number
Variant_Number
Variant_Sub_Type
Variant_Type
Variant_Type_Description
Variant_Type_ID
Variant_Type_ID
Variant_Xref_ID
Zygosity

Attribute: Analysis_Criteria

Attribute details:

Entity name	Analysis
Description	If the criteria to perform an Analysis are described in a Study, they are captured here. E.g. parameters set in an Analysis Tool.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Description

Attribute details:

Entity name	Analysis
Description	This field captures a brief description of the Analysis employed in a Study. This information is not always provided in a Study. Note: specific criteria used during the Analysis should be captured in the 'Analysis_Criteria' field and a general description captured in this field.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_ID

Attribute details:

Entity name	Dataset_Analysis
Description	This is the unique identifer for one Analysis performed within a Study. This can be used to cross-reference one record in the 'Analysis' table.
Primary key	No
Refers to	Analysis_ID
Data type	INTEGER
Domain

Constraints:

Not null	Yes
Check
Default

Attribute: Analysis_ID

Attribute details:

Entity name	Analysis
Description	This is the unique database identifer for one Analysis performed within a Study.
Primary key	Yes
Refers to
Data type	SERIAL
Domain

Constraints:

Not null	Yes
Check
Default

Attribute: Analysis_Number

Attribute details:

Entity name	Analysis
Description	This is a sequential number assigned to each Analysis employed in a Study. This value is not unique in this table or across the database. For example, if five Analyses were employed in a Study, each Analysis (before being loaded to the database) will be assigned a sequential number from "1" to "5" inclusive. This allows records from the input file to be easily cross-referened in the database, especially for quality checks.
Primary key	No
Refers to
Data type	INTEGER
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Tool

Attribute details:

Entity name	Analysis
Description	This field captures the name of the Analysis Tool employed in a given Analysis. Examples of Analysis Tools are "Birdsuite", "CNAG", "Genemapper".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Tool_Description

Attribute details:

Entity name	Analysis
Description	This is a description of the Analysis Tool that was used in a given Study. This information is not always provided for an Analysis Tool.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Tool_Type

Attribute details:

Entity name	Analysis
Description	This field captures whether an Analysis Tool is an Algorithm or a Software Suite. This information is not always provided for a given Analysis Tool in a Study; DGVa and dbVar currently do not capture this information.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Tool_Version

Attribute details:

Entity name	Analysis
Description	This is a version number for the Analysis Tool that was used in a given Study. This information is not always provided for an Analysis Tool.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Analysis_Type

Attribute details:

Entity name	Analysis
Description	If this information is provided, the Analysis Type will be captured in this field; this information is not always provided in a Study. An example of an Analysis Type is "split-read mapping".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: ArrayExpress_Accession_Num

Attribute details:

Entity name	Platform_Study
Description	This field will capture the identifier/accession number that represents a given Platform within the ArrayExpress data repository. This information is not always provided in a Study. e.g. ArrayExpress accession number, A-AFFY-65
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Biological_Source

Attribute details:

Entity name	Sample
Description	This field captures a brief description of the Biological Source of a Sample. For example, the 'Biological_Source' could be "cell line", "blood", "tissue" etc. Please note how this field relates to the 'Source_Type' field.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Calculated_Frequency

Attribute details:

Entity name	Variant_Type
Description	This field will serve as a placeholder in the case that DGV develops a frequency calculation step in the automated DGV-curation pipeline. This is a frequency that would be calculated during the DGV data curation process. In cases where no 'Reported_Frequency' is available, a Calculated Frequency may be derived from other information that was reported in a Study. The 'Calculated_Frequency' has an additional descriptor ('Calc_Frequency_Qualifier') to distinguish between the frequency as being "type" (number of Gains or Losses within a Study or Cohort) or "allele" (frequency of the allele in a Variant set).
Primary key	No
Refers to
Data type	INTEGER
Domain

Constraints:

Not null	No
Check
Default

Attribute: Calc_Frequency_Qualifier

Attribute details:

Entity name	Variant_Type
Description	This field will serve as a placeholder in the case that DGV develops a frequency calculation step in the automated DGV-curation pipeline. The 'Calc_Frequency_Qualifier' is an additional descriptor of 'Calculated_Frequency' to distinguish between the frequency as being either: "type" = number of Gains or Losses within a Study or Cohort OR "allele" = frequency of the allele in a Variant set This is a value that is assigned during the DGV data curation process.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Case_or_Control

Attribute details:

Entity name	Sample_Study
Description	This field will capture whether a Sample is a Case or a Control in a Study. This field will not always be populated. Note that for the current DGV only Controls, or healthy individuals, are part of the database.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome

Attribute details:

Entity name	Coverage
Description
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome_Length

Attribute details:

Entity name	Coverage
Description
Primary key	No
Refers to
Data type	BIGINT
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome_Number

Attribute details:

Entity name	Translocation_Mapping
Description	This is the primary Chromosome on which a Translocation Variant has been discovered.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome_Number

Attribute details:

Entity name	Variant_Mapping
Description	This is the primary Chromosome on which a Variant has been mapped. The 'Chromosome_B_Number' captures the secondary Chromosome on which a Variant has been discoverd, for example, in the case of a translocation.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome_Random_Num

Attribute details:

Entity name	Translocation_Mapping
Description	This field is to capture coordinates for variants that cannot be mapped on a chromosome but for which we know to what chromosome the variant belongs.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Chromosome_Random_Num

Attribute details:

Entity name	Variant_Mapping
Description	This field is to capture coordinates for variants that cannot be mapped on a chromosome but for which we know to what chromosome the variant belongs.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Cohort_Description

Attribute details:

Entity name	Cohort
Description	This is a short description of the Cohort (within a Study). For example, the description can include details about what is common among the individuals within the Cohort, such as details about gender, age, etc.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Cohort_ID

Attribute details:

Entity name	Sample
Description	This is the Cohort to which a Sample belongs. This field may not be populated. A Sample could belong to a commonly referenced Cohort such as HapMap or the Human Genome Diversity Panel. A Sample could also belong to a Cohort within a specific Study.
Primary key	No
Refers to	Cohort_ID
Data type	INTEGER
Domain

Constraints:

Not null	No
Check
Default

Attribute: Cohort_ID

Attribute details:

Entity name	Cohort
Description	This is the unique identifier for one Cohort that is in the DGV.
Primary key	Yes
Refers to
Data type	SERIAL
Domain

Constraints:

Not null	Yes
Check
Default

Attribute: Cohort_Name

Attribute details:

Entity name	Variant
Description	This is the Cohort to which a Variant belongs; this information may or may not be provided in a Study. This information is more commonly found in the Sample related tables ('Sample', 'Sample_Xref') but could also be captured here in cases when a Variant is reported with no associated Sample. Examples of Cohorts are "HapMap" or "Human Genome Diversity Project".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Cohort_Name

Attribute details:

Entity name	Sample_Xref
Description	This is the Cohort to which a Sample belongs; this information may or may not be provided in a Study. Examples of Cohorts are "HapMap" or "Human Genome Diversity Panel/Project".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Cohort_Name

Attribute details:

Entity name	Cohort
Description	This is the name given to a Cohort within a given Study. Examples of Cohort Names are: "Human Genome Diversity Panel" or "HapMap".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Comments

Attribute details:

Entity name	Variant
Description	If there is any supplemental information about a Variant that cannot be populated into any of the other fields in the 'Variant' table, that information will go here. e.g. "does not validate experimentally" may need to be appended to/annotated for a Variant
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Comments

Attribute details:

Entity name	Variant_Type
Description	If further details about the Variant Type are provided, those details could be captured in this field.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Copy_Chromosome

Attribute details:

Entity name	Copy_Number
Description	This is the Chromosome on which the Copy of the CNV is found.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Copy_Description

Attribute details:

Entity name	Copy_Number
Description	This is a desription of the structure, or composition of a given Copy of a CNV. An example of a description is "inverted copy" or "copy with SNP".
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Copy_End

Attribute details:

Entity name	Copy_Number
Description	This is the end coordinate for the Copy.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Copy_Number

Attribute details:

Entity name	Copy_Number
Description	This is the sequence number of the Copy in a given CNV. For example, if a CNV has 5 copies, there will be five records in this table, each representing one of the Copies of that CNV. For the five records in this example , 'Copy_Number' will have the values "1", "2", "3", "4", and "5". To find the total number of copies in one CNV, perform a query on the 'Variant' table (can use the 'Variant_ID' from this table) and get the 'Copy_Number_Count'.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	Yes
Check
Default

Attribute: Copy_Number_Count

Attribute details:

Entity name	Variant_Type
Description	For Variants that are of type "CNV", this field indicates the number of copies of a Variant sequence. If information about each copy of a CNV is available, that information will be captured in the 'Copy_Number' table (1:M relationship with the 'Variant_Type' table). This information may not be provided in a published dataset. If this information is available, the value in this field willl typically be an absolute copy number state.
Primary key	No
Refers to
Data type	INTEGER
Domain

Constraints:

Not null	No
Check
Default

Attribute: Copy_Number_ID

Attribute details:

Entity name	Copy_Number
Description	This uniquely identifies one Copy of a Sample Level Copy Number Variant (CNV) in the DGV.
Primary key	Yes
Refers to
Data type	SERIAL
Domain

Constraints:

Not null	Yes
Check
Default

Attribute: Copy_Start

Attribute details:

Entity name	Copy_Number
Description	This is the start coordinate for the Copy.
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default

Attribute: Count

Attribute details:

Entity name	Variant_Type
Description	This is a value related to the Frequency of a Variant. This value is captured during DGVa/dbVar archiving and is described by DGVa/dbVar as: "Number of samples that reported the structural variant".
Primary key	No
Refers to
Data type	INTEGER
Domain

Constraints:

Not null	No
Check
Default

Attribute: Curation_Comments

Attribute details:

Entity name	Study
Description	This field will capture a summary of DGV curation applied to a given Study. This information will be a summary of the DGV Filtering and Merging process. This information will be made available on the Downloads page of the DGV website under the links to the original data ('Comments' field of the Studies' listing).
Primary key	No
Refers to
Data type	CHARACTER VARYING
Domain

Constraints:

Not null	No
Check
Default