Advanced Report Columns¶

Table view columns in the Variant listing of the Advanced report are broadly categorized by the following variant attribute categories:

Default
Diagnosis (DIAG)
Clinical Genomic Database (CGD)
Variant Effect Predictor (VEP)
Gene
CASEs
Comments (COMM)
EuroGeneTest
Gene Ontology (GO)
Genotype (GT)
Known
OMIM
Mother/Father
Other

These categories represent annotations derived from external databases, data quality scores calculated internally for the variant, study- and/or project-specific information, and reference information. Columns can be customized to show, hide, or order selected attributes (see Filtering Results).

Categories and columns are described in the tables below:

Default columns¶

Default columns display automatically when a tab is opened. These columns provide basic sequence variant call information, as well as clinically relevant annotations, links to external reference sources, and quality measures.

Default columns¶
Category	Column	Description
Default	het or hom	Heterozygous or homozygous variant at the designated position
	Sequence Variant	Condensed summary description of the variant combining data from other columns including variant location, gene, reference variant, called variant, coding sequence and protein position, and associated disease information (see also Sequence Variant)
	CGD CONDITION	Conditions resulting from variations in the same gene that may otherwise be placed in the “General” intervention category
	CGD INHERITANCE	Patterns of inheritance, including: AD - Autosomnal dominant AR - Autosomnal recessive BG - Blood group Digenic - A condition resulting from simultaneous mutations in different genes Maternal - Maternal mitochondrial inheritance XL - X-linked (because X-linked conditions can frequently have manifestations in both genetic sexes, X-linked conditions are not designated as dominant or recessive)
	VEP Max Impact	The classification of the variant based on the severity of the consequence type on the transcript (“HIGH”, “MODERATE”, “LOW”, “LOWEST”); for a given variant, the maximum observed impact across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
	VEP Max Consequence	The consequence type reported for this variant that has the greatest biological impact; for a given variant, the maximum observed consequence across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
	VEP Max Af	Maximum reported population allele frequency derived from the 1000 Genomes Project phase 3 (1000G3), Exome Variant Server (EVS), Exome Aggregation Consortium (ExAC), Genomes of the Netherlands (GONL), and Kyoto population surveys (see also popAlleleFreq)
	VEP Max Score	Maximum score for the variant as observed in dbNSFP (Score=max of (1-Sift_score), Polyphen2_HDIV_score, or Polyphen2_HVAR_score)
	Gene cov	Fraction of genes (exome) with depths of less than 5 (red), ranging from 5-9 (orange), and greater than 9 (green); note that the scale ranges from 0 to 50% to make the red/orange bars longer, emphasizing low coverages
	GT CallRatio	Proportion of reads containing the variant call (expect a value of approximately 0.5 for heterozygous calls and approximately 1 for homozygous calls)
	GT Depth	Number of reads used for evaluating the corresponding call
	KNOWN Gene diseases	Diseases known to be associated with variants in the gene, as annotated by ClinVar, HGMD, and OMIM
	Known pmid	Pubmed ID of the reference from which the information was obtained
	KNOWN var diseases	Diseases known to be associated with the variant as annotated by ClinVar, HGMD, and OMIM
	OMIM IDs	Online Mendelian Inheritance in Man (OMIM) IDs
	rsIDs	Reference SNP identifiers assigned by NCBI to a group of SNPs that map to an identical location

Diagnostic (DIAG) columns¶

DIAG columns provide diagnositic information for categorizing variants in a clinical setting, based on American College of Medical Genetics and Genomics (ACMG) guidelines. Note that “diagnosis” is abbreviated as “dx” in some cases.

Diagnostic (DIAG) columns¶
Category	Column	Description
Diagnostic (DIAG)	HomRecess	“Homozygous recessive variant” is reported when both alleles present at the genomic position are identical and none of the controls are homozygous at that position
	CHZ	Compound heterozygous (CHZ) variant (see GENE_CHZinGene) where none of the controls (including parents if non-affected) are homozygous and the cases might also potentially be CHZ. Analysis of the index case takes into account the variant phase if the parents are also provided; however, if the parents are not provided, the phase of the variants is assumed to be different. Compound heterozygosity in the same gene in controls does not prevent DIAG_CHZ from being true because without phased variants, the CHZ status is very sensitive (i.e., only two variants in the gene are required). Below is an example how DIAG_CHZ is calculated when both parents are unaffected: calc DIAG_CHZ = IF ((GENE_CHZinGene = 1) and (CASEs_subjCompHeterInGene >= 1-caseDelta )and (CTRLs_withHomoVar = ctrlDelta) and (FATHER_withHomoVar = 0) and (MOTHER_withHomoVar = 0) , ‘true’, ‘false’) The variables CaseDelta and CtrlDelta can be used to relax the condition, i.e., how many of the cases match and how many of the controls are in disagreement with the observations in the index case.
	Dominant	This field is “true” if the variant is not present in unaffected individuals but is present in affected individuals; for example, with unaffected parents, dominance is calculated as follows: calc DIAG_Dominant = IF((CASEs_withVar >= 1-caseDelta) and (CTRLs_withVar = ctrlDelta ) and (FATHER_withVar = 0) and (MOTHER_withVar = 0) , ‘true’, ‘false’) The variables CaseDelta and CtrlDelta can be used to relax the condition, i.e., how many of the cases match and how many of the controls are in disagreement with the observations in the index case. In cases in which there are no additional affected individuals (cases), novel variants classified as dominant.
	recessiveCat	Recessive and compound heterozygosity, based on the ACMG category for two alleles from any two variants observed in a gene, or from a homozygous variant. For an allele combination in the index case to be considered a potential recessive or compound heterozygous candidate, it may not be present in any of the controls (unless ctrlDelta is greater than zero). If the alleles in the index can be phased, they are considered only if they are not known to be from the same parent. In controls, the phase of alleles is assumed to be different (see recessiveCat)
	otherPos	The base pair position of other variants that form compound heterozygosity with a variant
	otherACMGcat	The ACMG category of other variants (see DIAG_otherPos and DIAG_compPhases) that form compound heterozygosity with the variant
	model	Each variant that lies in chromosomes 1-22 is analyzed as “Autosomal”; variants on the X chromosome are analyzed as “X-linked”
	rank recessive	An empirical ranking of variants; low-ranking variants should be better candidates for recessive variants than those with a high ranking; ranking takes into consideration homozygosity and compound heterozygosity in the affected vs. unaffected individuals; the smaller the DIAG_recessiveCat value (e.g., ‘1:DxConsistent’, ‘2:DxLikely’, ‘3:DxPossible’, etc.), the larger the weight of the variant
	rank dominant	An empirical ranking of variants; low-ranking variants should be better candidates for dominant variants than those with a high ranking; ranking takes into consideration how many affected individuals share the same variant or any variants in the same gene, and how few controls have the variant or any variant in the same gene (see also DIAG_Dominant)
	deNovo	A Boolean column indicating whether a variant is de novo (i.e., not present in the parents)
	compPhases	The phase of the variant and the other variant in the gene (see DIAG_otherPOS and DIAG_otherACMGcat); used when the index case is analyzed for compound heterzygosity Example values include: “F,M” - The variant is inherited from the father and the other variant is inherited from the mother “U” - The phase of the variant is unknown “F/M” - The variant is homozygous All variants within a gene are filtered according to the following condition: where hetORhom = ‘hom’ and pos = posx or pos != posx and (GT_phasex = ‘F/M’ or GT_Phasex = ‘U’ or GT_Phase != GT_Phasex) Here pos represents the position of the first variant, and posx represents the position of the second variant. Likewise for GT_Phase and GT_Phasex.

Clinical Genomic Database (CGD) columns¶

CGD columns provide information that is based on the manually curated database of variants associated with known medically significant conditions and available interventions. For more information, visit https://research.nhgri.nih.gov/CGD/.

Clinical Genomic Database (CGD) columns¶
Category	Column	Description
Clinical Genomic Database (CGD)	AGE GROUP	Pediatric: less than 18 years of age; Adult: at least 18 years of age
	COMMENTS	Conditions also resulting from mutations in the same gene but may otherwise be placed in the “General” intervention category
	INHERITANCE	The pattern of inheritance the variant is known to follow: “AD” - Autosomal dominant “AR” - Autosomal recessive “BG” - Blood group “Digenic” - A condition resulting from simultaneous mutations in different genes “Maternal” - Maternal mitochondrial inheritance “XL” - X-linked (because X-linked conditions can frequently have manifestations in both genetic sexes, X-linked conditions are not designated as dominant or recessive)
	INTERVENTION CATEG or IES	Includes organ systems for which specific and additional inteventions may be beneficial
	INTERVENTION RATIONALE	A description of the intervention and its benefit
	MANIFESTATION CATEG or IES	Includes organ systems affected by mutations in corresponding genes; recognition of involved organ systems may help guide supportive care

Variant Effect Predictor (VEP) columns¶

VEP columns provide functional annotation for variants based on the Ensembl SNP Effect Predictor database. For more information, visit http://www.ensembl.org/info/docs/tools/vep/index.html.

Variant Effect Predictor (VEP) columns¶
Category	Column	Description
Variant Effect Predictor (VEP)	Max Impact	The classification of the variant based on the severity of the consequence type on the transcript (“HIGH”, “MODERATE”, “LOW”, “LOWEST”); for a given variant, the maximum observed impact across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
	Max consequence	The consequence type reported for this variant that has the greatest biological impact; for a given variant, the maximum observed consequence across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
	Max Af	The maximum reported population allele frequency, derived from the 1000 Genomes Project phase 3 (1000G3), Exome Variant Server (EVS), Exome Aggregation Consortium (ExAC), Genomes of the Netherlands (GONL), and Kyoto population surveys (see also popAlleleFreq)
	Consequence	Annotation of the consequence type in the canonical transcript of the annotated variant
	Max Score	The maximum score for the variant, as observed in dbNSFP (Score=max of (1-Sift_score), Polyphen2_HDIV_score, or Polyphen2_HVAR_score)
	Amino Acids	Protein sequence change(s) provided for variants affecting the protein-coding sequence; a value is given for each transcipt
	Protein Position	The position of the amino acid in the protein sequence (only if the variant falls within a coding sequence); a value is given for each corresponding transcipt specified in the CDS position column
	Refgene	Accession number from NCBI for the Refseq transcript using lookup into Ensembl 73 using feature
	CDS position	The position of the base pair in the coding sequence; a value is given for each transcript
	Transcript count	The number of different transcripts in which the variant is found

Gene columns¶

Gene columns provide information based on the candidate gene in which a variant is found. Where possible, the HUGO Gene Nomenclature Committee (HGNC) gene symbol is provided (for more information, visit https://www.genenames.org).

Gene columns¶
Category	Column	Description
Gene	symbol	The gene identified by viewing the variant in Ensembl; output is the HGNC gene symbol for the identified gene (clone name provided when HGNC symbol is unavailable)
	Aliases	Gene aliases, including the main symbol
	avg depth	The average sequence read depth in the exome for a given gene
	candidate paralogs	Paralogs of a gene that is included in the Candidate genes list
	candidate Phenotypes	Phenotypes and gene lists that match the variant gene as a paralog (orange) or a candidate gene (green)
	cand or paralog	Value is “c” if the gene is a candidate gene and “p” if it is a paralog of one of the candidate genes
	Cand PhenoScore	The phenotype significance score over candidate genes
	DmaxAf	The maximum allowed allele frequency for a dominant variant
	exomeSize	The sum of the base pair size of all of the exons in the gene
	exontype	Coding or non-coding region
	homozVarsInGene	The number of homozygous variants the index case has in a given gene
	knownVarsInGene	The number of known (e.g., in HGMD) variants the index case has in a given gene
	lt10	Fraction of the exome with sequence read coverage less than 10
	lt15	Fraction of the exome with sequence read coverage less than 15
	lt20	Fraction of the exome with sequence read coverage less than 20
	lt25	Fraction of the exome with sequence read coverage less than 25
	lt30	Fraction of the exome with sequence read coverage less than 30
	lt5	Fraction of the exome with sequence read coverage less than 5
	MOI	Mode of inheritance determined for the gene based on the alleles present
	Paralogs	The paralog (gene duplication) of a given gene
	RmaxAf	The maximum allowed allele frequency for a recessive variant
	RmaxGf	The maximum allowed genotype frequency for a compound heterozygous variant
	varsInGene	The total number of variants in a gene for the index case
	cov	Fraction of genes (exome) with depths of less than 5 (red), ranging from 5-9 (orange), and greater than 9 (green), scaled from 0 to 50% of the actual coverage to emphasize lower-coverage regions (red/orange)

CASEs columns¶

CASEs columns provide variant counts for affected individuals in a project.

CASEs columns¶
Category	Column	Description
CASEs	homozVarsInGene	Equal to 1 if subjects in the affected case group have a homozygous variant in the given gene
	knownVarsInGene	The number of known variants (e.g., those listed in HGMD) that the affected case group has in a given gene
	subjCompHeterInGene	Equal to 1 if the affected case group is potentially compound heterozygous for a given gene, otherwise it is equal to 0; based on a loose definition for compound heterozygosity, i.e., it requires only the case to have a single homozygous variant or two heterozygous variants in a given gene
	subjWithVarInGene	Equal to 1 if the affected case group has a variant in the given gene, otherwise it is equal to 0
	withHomoVar	Equal to 1 if the affected case group is homozygous for a given variant
	withVar	Equal to 1 if the affected case group has the variant, otherwise it is equal to 0

Comments (COMM) columns¶

COMM columns provide variant annotations that have been added by users in the project.

Comments (COMM) columns¶
Category	Column	Description
Comments (COMM)	Approval months	Counter for months since the date of last approved clinical significance
	Approved clinical significance	The approved clinical significance by the user
	BAM confirmed	User confirmation that the BAM file was reviewed to confirm the presence of the variant
	Clinical Significance	The clinical significance of the variant as annotated (commented) by users (e.g., pathogenic, benign, unknown significance, drug-response, risk factor, etc.)
	Created at	The date and time that the comment was created
	Disease	User-added disease with which the variant is associated
	Gene symbol	The gene symbol from the varcomments table
	HGVScp	User-added Human Genome Variant Society (HGVS), transcript, cDNA nomenclature, and protein nomenclature; can be populated by automatically selecting a transcript in the COMM_transcript_raw column and editing the field as needed
	ID	Comment ID
	Internal comment	User-added internal comment on a variant
	Interpretation	User-added interpretation of a variant
	Is active	Boolean value: 1=active
	Last approved at	The date that the variant was last approved
	Mode of inheritance	User-annotated (commented) mode of inheritance of the variant
	Onset	User-curated onset of the disease with which the variant is associated
	Orthogonal sequencing status	User-added orthogonal sequencing confirmation status
	OTHER active annotations	Annotations on this variant exist in a separate study
	OTHER allele	User-added other allele with which this variant is in compound
	OTHER clinical significance	Clinical significance of the variant available from another study
	OTHER interpretation	Interpretation of the variant available from another study
	OTHER last approved at	The date of the last approved and current status of a variant available from another study
	OTHER variant review status	Variant review status of a variant available from another study
	Owner	The user who entered the curation
	Parental origin	User-selected parental origin of the variant (e.g., father, mother, both, de novo, unknown, etc.)
	PN	Patient number
	Reported at	The date of the variant annotation
	Reporting category	User-selected category, related to client workflow
	Report section	User-selected report section where the selected variant should be placed
	Report status	Related to client workflow to indicate what to report
	Report type	Related to client workflow to indicate the type of report
	Severity	User-added severity of the disease with which the variant is associated
	Starred	Indicates whether the variant has been starred (flagged) by a user
	Text	The description (comment) component for the user’s annotation of the variant
	Transcript raw	All possible transcripts with HGVS nomenclature for the variant for user to select.
	User id	Internal CSA Web user ID
	Variant review status	User-added variant review status

EuroGeneTest columns¶

EuroGenetest columns are derived from a European Commission project database that contains European genetic testing information for particular genes, variants, and diseases. For more information, visit http://www.eurogentest.org/index.php?id=160.

EuroGenetest columns¶
Category	Column	Description
EuroGenetest	Diseases	Diseases associated with a variant
	EuroGenetest NoOfDiseases	The number of diseases associated with a variant
	EuroGenetest NoOfpanels	The number of gene panels associated with a variant
	EuroGenetest panels	EuroGentest gene panels associated with a variant

Gene Ontology (GO) columns¶

GO columns provide a functional annotation of the gene product in which the variant is found. For more information, visit http://geneontology.org.

Gene Ontology (GO) columns¶
Category	Column	Description
Gene Ontology (GO)	Descriptions	Gene ontology category descriptions
	IDs	Gene ontology identifier codes

Geneotype (GT) columns¶

GT columns provide quality control information for the variant call, based on the sequence read depth and quality. These scores are based on the Genome Analysis Toolkit (GATK) measures.

Genotype (GT) columns¶
Category	Column	Description
Genotype (GT)	CallCopies	Variations from the reference are reported by default; CallCopies refers to how many copies of the variation exist in a subject; a CallCopies value of “2” therefore corresponds to a homozygous variant, whereas a CallCopies value of “1” corresponds to a heterozygous variation
	CallRatio	Proportion of reads containing the variant call; this number is expected to be close to 0.5 for heterozygous calls and close to 1 for homozygous calls
	Depth	The number of reads used for evaluating the corresponding call
	FILTER	Quality parameter that uses the ratio between the genotype quality and depth, showing whether the call is considered “LowQual” quality (not useable) or “PASS”; this remains a crude quality measure
	GL Call	A statistical measure indicating the likelihood that a call is wrong; the scale has been converted to using only integers - the higher the number, the less likely that the call is wrong
	IHEstatus	Inheritance error status (1 = error, 0 = ok)

Known columns¶

Known columns provide publicly available information about the candidate gene and/or variant based on annotations listed in the HGMD, ClinVar, and OMIM databases.

Known columns¶
Category	Column	Description
Known	ClinVarAcc	ClinVar accession(s)
	distance	The distance between a known variant and the identified variant
	GeneLists	The gene list where the gene in which the variant is found is a member
	Gene diseases	Diseases known to be associated with variants in the gene
	Gene par diseases	Diseases known to be associated with the paralog genes of the given gene
	HGMDacc	The HGMD accession of the known variant
	InACMG	A Boolean column (1/0) indicating whether the gene is in the ACMG recommended list for incidental findings
	PMID	Pubmed ID of the reference from which the information was obtained
	Refgene	mRNA accession number from NCBI using lookup into Ensembl using feature
	Source	Source database of the known variant
	var diseases	Diseases known to be associated with the variant
	variantType	Variant type in the source database. For HGMD: “DM” - Disease-causing mutations; pathological mutations reported to be disease-causing in the original literature report. The other three tags are used for polymorphisms. “DP” - Disease-associated polymorphisms; reported to be in significant association with disease (p less than 0.05) and are assumed to be functional (e.g., as a consequence of location, evolutionary conservation, replication studies etc), although there may be no direct evidence of function as yet (e.g., from an expression study). “DFP” - Disease-associated polymorphisms with additional supporting functional evidence; reported to be in significant association with disease (p less than 0.05) and to have evidence for being of direct functional importance (e.g., as a consequence of altered expression, mRNA studies etc). “FP” - In vitro/laboratory or in vivo functional polymorphisms; reported to affect the structure, function, or expression of the gene (or gene product), but with no disease association reported as yet.
	varType	The type of variant: “sub” (substitution), “del” (deletion), “ins” (insertion), “indel”, etc.
	Variant review status	The currently selected variant review status value from the variant annotation

Online Mendelian Inheritance in Man (OMIM) columns¶

OMIM columns provide the OMIM-designated identification for a particular gene and related disease description. For more information, visit https://www.ncbi.nlm.nih.gov/omim.

Online Mendelian Inheritance in Man (OMIM) columns¶
Category	Column	Description
OMIM	Descriptions	OMIM disease descriptions for the gene
	IDs	OMIM IDs

MOTHER/FATHER columns¶

MOTHER/FATHER columns provide the variant information of the parents in a project.

MOTHER/FATHER columns¶
Category	Column	Description
MOTHER/FATHER	apprCovDepth	Read depth for the mother/father in the location of the variant
	Call	The variant call of the mother/father; if the mother/father does not have the allele of the index variant, the value is “Var_Absent”
	het or hom	“hom” for homozygosity, “het” for heterozygosity, or “NA” if the value in the Call column is “Var_absent”
	homozVarsInGene	Equal to 1 if the mother/father has a homozygous variant in the given gene
	knownVarsInGene	The number of known variants (e.g., in HGMD) the mother/father has in the given gene
	readsWithVar	Number of reads that carry the variant in the mother/father in the location of the variant
	subjCompHeterInGene	Equal to 1 if the mother/father is potentially compound heterozygous in the given gene, otherwise 0; based on a loose definition of compound heterozygosity, i.e., it requires only the case to have a single homozygous variant or two heterozygous variants in the given gene
	subjWithVarInGene	Equal to 1 if the mother/father has the variant in the given gene, otherwise 0
	withHomoVar	Equal to 1 if the mother/father is homozygous for the given variant, otherwise 0
	withVar	Equal to 1 if the mother/father has the variant, otherwise 0

Other columns¶

Other columns provide information not included in the previously described categories.

Other columns¶
Category	Column	Description
Other	Chrom	Variant chromosome location
	Pos	Nucleotide from the reference build at the base pair position in the Pos column
	Call	Sequence (variant) called based on the reference sequence at the designated position
	ID	Reference SNP identifiers assigned by NCBI to a group of SNPs that map to an identical location
	CLNACC	ClinVar assigned variant accessions and versions
	PN	Patient number (identifier)
	Carrier	A Boolean column (1/0) indicating if the individual carries the variant
	FS	Fisher’s exact test of read strand; if the reference reads are balanced between forward and reverse strands, then the alternate reads should be as well
	formatZip	VCF genotype fields
	HGVSc	The HGVS coding sequence name
	HGVSp	The HGVS protein sequence name
	Biotype	Additional annotation using Ensembl lookup based on feature (e.g., miRNA_PUTATIVE, misc_RNA_PUTATIVE, protein_coding_PUTATIVE, pseudogene_PUTATIVE, snoRNA_PUTATIVE)
	CHROMO	The chromosome of the variant represented as chr1, chr2, …, chr22, chrX, chrY, or chrM
	POSx	The base pair position of a known variant within the same codon of the identified variant
	Source	For trio cases, the participant in the study sharing the same variant
	Sourcex	For trio cases, additional participants in the study sharing the same variant
	comment	User-entered comments for the variant
	starred	A Boolean column (1/0) indicating if the variant has been starred (flagged) for inclusion in the generated report