Cancer Genomics: NGS (Whole Exome Sequencing) Variant Calling Using Linux

Categories: Cancer Genomics, Linux, NGS, WES
Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

Cancer genomics is a rapidly evolving field that utilizes next-generation sequencing (NGS) technologies to better understand the genetic basis of cancer. One of the most powerful tools in cancer genomics is whole exome sequencing, which enables the sequencing of all protein-coding genes in a genome. However, analyzing the massive amounts of data generated by NGS requires sophisticated computational tools, such as the Linux operating system, which provides a highly flexible and customizable environment for bioinformatics analysis.

BioCode is offering Cancer Genomics: NGS (Whole Exome Sequencing) Variant Calling Using Linux course which is designed to provide students with the knowledge and skills necessary to perform whole exome sequencing (WES) variant calling using Linux. Whole exome sequencing is a powerful technique that can be used to identify genetic mutations and variations that contribute to the development and progression of cancer. Variant calling is an essential step in the analysis of WES data, which involves identifying genetic variants that are different from the reference genome. The course will consist of both lectures and hands-on practical sessions. The course will cover the entire process of NGS variant calling, from initial data pre-processing and alignment to the identification and annotation of genetic variants. 

This course is designed to provide a comprehensive understanding of cancer genomics and how NGS  technology is used to identify cancer-related genetic mutations. The course is primarily focused on the analysis of cancer datasets using the Linux command line, which is an essential tool in the field of bioinformatics. Students will have the opportunity to work with real-world cancer genomics datasets and gain a deep understanding of the bioinformatics pipelines used in cancer genomics research.

This course is divided into three categories covering an in-depth introduction to NGS and variant calling, hands-on whole-exome variant calling, and additional downstream analysis. Students will gain hands-on experience using bioinformatics tools such as SAMtools and GATK  to perform variant calling on NGS data. They will also learn how to visualize and interpret the results of their analyses using popular tools such as IGV and UCSC Genome Browser. No prior knowledge or experience to enroll in this course.

Throughout the course, students will learn how to analyze NGS data for identifying genetic mutations and variations, including substitutions (SNPs) and insertions/deletions (indels) in a raw dataset. Students will learn how to obtain raw whole exome datasets for any organism or disease, perform quality control, trimming, mapping against a reference genome, post-alignment quality control, recalibration of the reads, and finally, how to identify variants in the disease/patient samples to obtain VCF files. Students will also learn how to use various tools to filter and annotate these VCF files, ensuring the accuracy and reliability of their results in the context of cancer genomics research. The course will focus on the identification and analysis of genetic mutations and variations specific to cancer, including somatic mutations and driver mutations.

By the end of this course, students will gain hands-on experience working with real NGS data, specifically in the context of cancer genomics research. They will learn how to troubleshoot common issues and optimize their analysis pipeline for different types of cancer datasets. Students will have a thorough understanding of NGS and whole exome data analysis and will be well-equipped to apply their skills to cancer genomics research and other related fields.

This course will include the following sections:

Section 1: In-depth Introduction to NGS and Whole-Exome Sequencing for Cancer Genomics

Description:  Students will be introduced to the fundamentals of NGS, variant calling, and the role of whole-exome sequencing in cancer research. Students will learn about haploid vs. diploid organisms, ploidy and its significance in disease research, and germline vs. somatic mutations. Students will also be introduced to single nucleotide polymorphisms (SNPs) and their types, as well as large-scale variations in genomes and the role of copy number variations (CNVs) in diseases.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Describe Next-Generation Sequencing and Variant Calling.
  2. Explain the Fundamentals of Whole Exome Sequencing.
  3. Discuss Cancer Genomics and the Role of WES in Cancer Research.
  4. Explain Haploid vs. Diploid Organisms for Variant Calling.
  5. Describe Ploidy and Its Significance in Disease Research.
  6. Describe a Bioinformatics Approach Towards Germline vs. Somatic Mutations.
  7. Explain Single Nucleotide Polymorphisms and Their Types.
  8. Discuss Structural Variations.
  9. Explain CNVs and the role of CNVs in Diseases.

 

Section 2: Hands-on Whole-Exome Variant Calling

Description: Students will gain hands-on experience with the GATK pipeline for variant calling. They will learn how to retrieve raw whole-exome cancer data, perform quality control, and filter low-quality reads. Students will also learn how to map reads against a reference genome, using multiple tools such as Freebayes, and Mutect2  for variant calling, and filter out low-quality variants using VCFtools. Finally, students will learn how to perform variant effect prediction using various tools such as VEP to identify novel pathogenic variants. Students will also learn how to identify and analyze genetic mutations and variations specific to cancer. They will learn how to identify germline and somatic variants involved in the diagnosis and prognosis of cancer.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Explain GATK Pipeline for Variant Calling.
  2. Retrieve Raw WES Cancer Data.
  3. Perform Quality Control.
  4. Trim Out the Bad Quality Reads.
  5. Perform Mapping of Reads Against a Reference Genome.
  6. Perform Alignment of Trimmed Reads Against a Reference Genome Using BWA/Bowtie2.
  7. Perform Post-Alignment Processing of Aligned (SAM/BAM) Files.
  8. Perform Recalibration and Deduplication of Aligned Reads.
  9. Utilize Freebayes for Variant Calling.
  10. Perform Variant Calling Using Freebayes.
  11. Explain DRAGEN: A Germline Variant Detector.
  12. Identify Germline Variants Using DRAGEN.
  13. Explain Mutect2: A Somatic Variant Caller.
  14. Identify Somatic Variants Using Mutect2.
  15. Explain HaplotypeCaller: A Germline Variant Caller.
  16. Identify Germline Variants Using HaplotypeCaller.
  17. Describe VCFtools: A Toolkit for Variant File Formats.
  18. Perform Variant Calling Using Bcftools.
  19. Perform Variant Recalibration of Raw Variants.
  20. Perform Variant Filtration.
  21. Filter Out the Low-Quality Variants Using SnpSift.
  22. Perform Variant Annotation.
  23. Annotate Variants Using SnpEff for Prediction of Variant Effects.
  24. Utilize Variant Effect Predictor (VEP) for Elucidation of Novel Variants.

 

Section 3: Additional Downstream Analysis

Description: Students will learn about the downstream functional enrichment analysis. They will learn how to perform gene ontology analysis using topGO and enrichR, as well as pathways analysis using KEGG, PANTHER, and Reactome.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Explain the Downstream Functional Enrichment Analysis.
  2. Perform Gene Ontology Analysis Using topGO & enrichR
  3. Perform Pathways Analysis Using KEGG, PANTHER, Reactome.

 

Section 4: Additional Lectures

Description: Students will have the access to additional lectures which will help them learn about various databases, and file formats so that they can easily retrieve the required biological data and perform whole-exome analysis.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Discuss ArrayExpress.
  2. Describe Gene Expression Omnibus Database.
  3. Explain NCBI Genomes & NCBI Assembly.
  4. Discuss Genome Reference Consortium.
  5. Retrieve an Entire Genome & Retrieval of SARS-CoV-2 Viral Genome.
  6. Perform Genome Assembly Retrieval and Analysis
  7. Explain BED.
  8. Describe GTF/GFF.
  9. Explain SAM/BAM.
Show More

What Will You Learn?

  • In-depth Introduction to NGS
  • Hands-on Analysis of Whole Exome Sequence Data
  • Variant Calling (SNPs/Indels, SVs, CNVs)
  • From Raw Reads to Variant Identification
  • Quality Control
  • Mapping
  • Post-Alignment Processing and Re-calibration
  • Variant Filtration and Annotation
  • Variant Effect Prediction
  • Linux-Based Command-Line Tools
  • Cancer Genomics
  • Somatic Variant Calling
  • Germline Variant Calling

Course Content

In-depth Introduction to NGS and Whole-Exome Sequencing for Cancer Genomics

  • In-depth Introduction to NGS and Variant Calling
    20:12
  • Fundamentals of Whole Exome Sequencing: A Comprehensive Sequencing Approach
    26:16
  • Cancer Genomics: Role of WES in Cancer Research
    22:18
  • Haploid vs. Diploid Organisms for Variant Calling
    07:26
  • Ploidy and Its Significance in Disease Research
    14:58
  • A Bioinformatics Approach Towards Germline vs. Somatic Mutations
    14:30
  • Single Nucleotide Polymorphisms: An Introduction to SNPs and Their Types
    31:22
  • Structural Variations: Large Scale Variations in Genomes
    08:55
  • Copy Number Variations: Role of CNVs in Diseases
    10:29

Hands-on Whole-Exome Variant Calling

Additional Downstream Analysis

Additional Lectures

Earn a certificate

Add this certificate to your resume to demonstrate your skills & increase your chances of getting noticed.

selected template

Student Ratings & Reviews

No Review Yet
No Review Yet

Want to receive push notifications for all major on-site activities?

Select your currency
| BioCode
Hurry up! Sale ends in:
Days
Hours
Minutes
Seconds