Understanding the Variant Listing¶

Advanced report findings are displayed in the Variant listing, which provides information about each variant to help determine whether the variant might be causative of the condition being analyzed in the selected study. Findings are organized in a series of tabs, and can be sorted, filtered, and annotated further using a selection of tools and options.

Variant listing in the Advanced report¶

Toggling views¶

The Variant listing opens in Table view by default, but findings on select tabs (e.g., Variants and Missing) can be viewed in either Table view or List view by toggling the Table/List icon at the top of the page. Both views display similar attributes, but Table view columns can be further customized.

To toggle between views, click the Table/List view icon at the top of the page.

Variant listing in Table view¶

Variant listing in List view¶

Table view¶

Because there are too many annotations to fit into a single view, the Default columns displayed in Table view contain only a subset of the total available columns. Columns are broadly categorized by variant attribute categories and can be customized by clicking the Choose columns icon at the top of the page to show or hide selected columns (see also Filtering Results).

Default columns provide basic sequence variant call information, as well as clinically relevant annotations, links to external reference sources, and quality measures. Default columns are described in the table below. For a full list of column descriptions, see Advanced Report Columns.

Default columns in Table view¶
Column	Description
Default
het or hom	Heterozygous or homozygous variant at the designated position
Sequence Variant	Condensed summary description of the variant combining data from other columns including variant location, gene, reference variant, called variant, coding sequence and protein position, and associated disease information (see also Sequence Variant)
CGD CONDITION	Conditions resulting from variations in the same gene that may otherwise be placed in the “General” intervention category
CGD INHERITANCE	Patterns of inheritance, including: AD - Autosomnal dominant AR - Autosomnal recessive BG - Blood group Digenic - A condition resulting from simultaneous mutations in different genes Maternal - Maternal mitochondrial inheritance XL - X-linked (because X-linked conditions can frequently have manifestations in both genetic sexes, X-linked conditions are not designated as dominant or recessive)
VEP Max Impact	The classification of the variant based on the severity of the consequence type on the transcript (“HIGH”, “MODERATE”, “LOW”, “LOWEST”); for a given variant, the maximum observed impact across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
VEP Max Consequence	The consequence type reported for this variant that has the greatest biological impact; for a given variant, the maximum observed consequence across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
VEP Max Af	Maximum reported population allele frequency derived from the 1000 Genomes Project phase 3 (1000G3), Exome Variant Server (EVS), Exome Aggregation Consortium (ExAC), Genomes of the Netherlands (GONL), and Kyoto population surveys (see also popAlleleFreq)
VEP Max Score	Maximum score for the variant as observed in dbNSFP (Score=max of (1-Sift_score), Polyphen2_HDIV_score, or Polyphen2_HVAR_score)
Gene cov	Fraction of genes (exome) with depths <5 (red), ranging from 5-9 (orange), and greater than 9 (green); note that the scale ranges from 0 to 50% to make the red/orange bars longer, emphasizing low coverages
GT CallRatio	Proportion of reads containing the variant call (expect a value of approximately 0.5 for heterozygous calls and approximately 1 for homozygous calls)
GT Depth	Number of reads used for evaluating the corresponding call
KNOWN Gene diseases	Diseases known to be associated with variants in the gene, as annotated by ClinVar, HGMD, and OMIM
Known pmid	Pubmed ID of the reference from which the information was obtained
KNOWN var diseases	Diseases known to be associated with the variant as annotated by ClinVar, HGMD, and OMIM
OMIM IDs	Online Mendelian Inheritance in Man (OMIM) IDs
rsIDs	Reference SNP identifiers assigned by NCBI to a group of SNPs that map to an identical location

Tip

Clicking the Info icon next to each column header opens a tooltip with the column description.

List view¶

Select tabs (e.g., Variants and Missing) can also be viewed in List view, which provides an abbreviated summary of the variant type along with key annotations. Annotations are numbered for reference in the figure below, followed by a table containing corresponding descriptions.

Variant annotations in List view¶
Figure reference	Annotation	Description
1	Sequence Variant	Variant location, reference variant, called variant, gene, coding sequence, protein position, and protein effect information (see Sequence Variant below)
2	Inheritance Error	If the identified variant does not appear to be transmitted in a Mendelian fashion based on the parental genotypes, the variant is flagged as an Inheritance error. Following the error message, the coverage (cov) is reported for the index, father, and mother to describe the read coverage in each participant. An inheritance error message can be caused by a number of factors, for example: The variant may be de novo The variant may be a false positive The variant may be a false negative variant call in one or more family members due to lack of coverage or poor alignment
3	VEP impact (VEP max impact)	Classification of the variant based on the severity of the consequence type on the transcript (“HIGH”, “MODERATE”, “LOW”, “LOWEST”); for a given variant, the maximum observed impact across the predicted transcripts from this position is reported (see also Variant Effect Predictor (VEP) Settings)
	AF (VEP max AF)	Maximum reported population allele frequency derived from the 1000 Genomes Project phase 3 (1000G3), Exome Variant Server (EVS), Exome Aggregation Consortium (ExAC), Genomes of the Netherlands (GONL), and Kyoto population surveys (see also popAlleleFreq)
	Score (VEP max score)	Each missense variant is assigned a score for the predicted impact of the amino acid change on protein function: maximum score for the variant as observed in dbNSFP (Score=max of (1-Sift_score), Polyphen2_HDIV_score, or Polyphen2_HVAR_score); scores run from 0 - 1; for other variant types, the VEP score is either displayed as 0.0 or omitted
	rsIDs	Reference SNP identifiers assigned by NCBI to a group of SNPs that map to an identical location
4	Inheritance	Flags that indicate which inheritance model(s) the variant fits: Dominant, Recessive, or De Novo
5	Candidate genes	The Candidate genes ideogram displays color-coded boxes that correspond to the user-defined phenotypes, and that highlight which phenotypes or genelists match the candidate gene or a paralog of the candidate gene. Clicking the ideogram opens a pop-up window that lists the phenotypes in the order in which they appear in the ideogram. Color-codes are as follows: Green - The gene is annotated to the phenotype class Orange - The gene is annotated to a paralogy of a gene in the phenotype class
6	Paralogs in candidate genes	Known paralogs of the variant-containing gene also found in the candidate gene list; if the variant is in a gene that is a paralog of one or more Candidate genes, the candidate gene(s) are listed.
7	Variant disease	If the variant is known to be pathogenic, this field lists the diseases known to be caused by the variant; annotations are derived from HGMD, ClinVar, and OMIM databases
8	OMIM and/or PubMed IDs	OMIM and/or PubMed IDs associated with the variant
9	Gene diseases	If the variant gene is known to be pathogenic, this field lists the diseases known to be caused by the gene; annotations are derived from HGMD, ClinVar, and OMIM databases
10	Known source
11	Known variant type
12	Coverage	The gene coverage ideogram provides a graphical representation of the degree of read coverage across the gene that contains the variant. Coverage is represented as follows: Green - The proportion of the gene covered by at least 10 reads Orange - The proportion of the gene covered by 5-9 reads Red - The proportion of reads covered by fewer than 5 reads For emphasis, orange and red segments are displayed at double their proportionate size. In cases where <50% of the gene is covered by 10 or more reads, no green is displayed. The coverage calculation can be based on either the coding sequence or the entire gene including the 5’ and 3’ UTR, and can be based on either Ensembl or Refseq transcripts (these options are selected the Advanced report settings tab of the Study Overview).
13	recessiveCat	recessiveCat is a system for assessing recessive and compound heterozygous variants, based on the ACMG category for the two alleles (see recessiveCat below).

Note

If no annotation is available for a particular field, the field is omitted from the Variant listing.

Sequence Variant and recessiveCat annotations are described in more detail below.

Sequence Variant¶

The Sequence Variant annotation provides positional, genotype, and protein effect information about the selected variant. Listed first is the variant’s genomic position based on the current reference assembly, followed by the gene symbol for the gene containing the variant.

Next, a description of the cDNA and protein position of the variant is given using a notation similar to the HGVS notation. In instances where a variant may affect one or more annotated transcripts and occur in different positions in the transcript, the position is given in parentheses followed by the DNA or protein change (e.g., “”c.(145,850)_C>T p.(284,49)_E>K,”). The DNA or protein change is given as “reference>sample”, and the altered nucleic acid sequence of the variant will follow the reference sequence. If the variant affects codons within a transcript, the position of the protein amino acid is also provided for the corresponding transcript listed. cDNA and HGVS notations are not used in the case of a splice variant.

Following the cDNA and protein position are the ACMG variant categorization (e.g., Cat1, Cat2), the zygosity and type of variant (e.g. het_sub, hom_ins, het_del), and the maximal VEP consequence (e.g., missense, splice_donor_variant).

Sequence Variant field descriptions¶
Field header	Description
Chrom	Variant chromosome location
Pos	Variant start bp position
GENE Symbol	HGNC gene symbol for gene identified (clone name provided when HGNC unavailable)
CDS position	Position of the base pair in the coding sequence; a value is given for each transcipt
Ref	Reference allele in VCF format at the designated position
Call	Sequence (variant) called based on the reference sequence at the designated position
Protein Position	Position of the amino acid in the protein sequence (only if the variant falls within a coding sequence); a value is given for each corresponding transcipt specified in the CDS position field
ACMGcat	Categorization of the sequence variants according to the ACMG scheme
Nucleotide Change	Description of the nucleotide change in the Human Genome Variation Society (HGVS) nomenclature (e.g., deletion (del), substitution (sub), insertion (ins))
Consequence	Impact of the variation on the sequence as calculated per transcript by NCBI (e.g., frameshift variant, truncating variant, missense variant, splice site variant)

recessiveCat¶

To assess multiple variants in a single gene, Genuity Science has adapted the approach taken by Kingsmore, et al., 2011¹.

When a gene has two or more variants, each variant is given a category classification using the combined analysis of the variants in the gene.

The recessive and compound heterozygous category is based on the ACMG category for two alleles from any two variants observed in a gene or from a homozygous variant. For an allele combination in the index to be considered a potential recessive or compound heterozygous candidate, it may not be present in any of the controls (unless ctrlDelta is greater than zero). If the alleles in the index can be phased, they are considered only if they are not known to be from the same parent. In controls, the phase of alleles is assumed to be different.

1=DxConsistent, 2=DxLikely, 3=DxPossible, 4=DxLessLikely, 5=DxIndeterminate, 6=DxUnlikely, 7=DxNegative¶
Allele 1/Allele 2	Cat1	Cat2	Cat3a	Cat3b	Cat4
Cat1	1	1	2	3	5
Cat2	1	1	2	3	5
Cat3a	2	2	3	3	6
Cat3b	3	3	3	4	6
Cat4	5	5	6	6	7

“Consistent”, “Likely”, and “Unlikely” classifications are derived from the ACMG guidelines for variant categorization. In cases where, for example, one variant is known to be pathogenic (Cat1), and a variant in the second allele is a truncating variant (Cat2), the combined variants are considered “Consistent” with diagnosis. For a Cat2 variant and a Cat3a variant, the combined variant genotype would be classified as “DxLikely” to cause the phenotype. Two Cat1 variants would score the highest with a “DxConsistent” classification.

References

Kingsmore SF, Dinwiddie DL, Miller NA, Soden SE, Saunders CJ, The Children’s Mercy Genomic Medicine Team. Adopting orphans: comprehensive genetic testing of Mendelian diseases of childhood by next-generation sequencing. Expert review of molecular diagnostics. 2011;11(8):855-868. doi:10.1586/erm.11.70.

Starred variants¶

“Starring” a variant makes it easily retrievable for later review without the need for a comment.

To star a variant, click the Star icon in the leftmost column of a selected row in the Variant listing.

clinicalWorkflow/images/variantListing/VariantListing_3.png

When a variant is starred, the star icon turns orange. To “unstar” the variant, click the Star icon again. Starred variants are also indicated in the Starred column of the Variant listing (see Advanced Report Columns).

Approved variants in other studies¶

If a variant has been annotated with the Variant review status “Approved and current” in another study in the project, a green circle appears below the Info icon in the leftmost column of the Variant listing.

Viewing study information¶

Clicking Info at the top of the page opens the Study information pop-up window.

Saving the Variant listing¶

You can save the Variant listing by doing any of the following:

Click Open query in Sequence Miner at the top of the page. This loads the same set of variants and annotations in Sequence Miner. From Sequence Miner, you can cut and paste table rows into an application on your computer (limited to 10,000 rows).
Save the Variant listing to the user_data directory in the Sequence Miner File Explorer, and then copy and paste selected rows into an application on your computer.
Generate a tab-delimited file by adding .gor to the URL of the CSA page.