Skip to content

hbctraining/variant_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Variant Analysis Workshop

Audience Computational skills required Duration
Biologists Introduction to the Command-line 3 - 2.5 hour sessions

Description

This Introduction to Variant Analysis workshop is aimed at providing best practices for calling variants for paired normal/tumor datasets. Importantly, while this workshop series does focus on calling variants in the context of paired tumor/normal samples, much of this workshop's pipeline and discussion is adaptable to other types of variant calling applications. This workshop series should provide participants with an ability to take raw sequence reads and process them into a VCF file with annotated variants. Furthermore, the workshop series ends with a tutorial for visualizing called variants within the Integrative Genomics Viewer (IGV).

These materials were developed for a trainer-led workshop, but are also amenable to self-guided learning.

Pre-Requisite

This workshop series is designed for users with a background in the fundamentals of working in the shell environment on an high-performance computing cluster (HPCC). If you don't have this background or are unsure if you are ready for this workshop series, you should first complete our Introduction to the Command Line workshop series.

Learning Objectives

  • Evaluate QC metrics for variant calling
  • Call variants using GATK
  • Filter variants to retain only high-quality variant calls
  • Annotate variants using SnpEff and dbSNP
  • Prioritize variants by their impact
  • Visualize variants in IGV
  • Implement cBioPortal to analyze variants

Lessons

Installation Requirements

All:

Mac users: No additional installation requirements.

Windows users: GitBash


These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.