Advanced Bioinformatics Scripting in Python, BioPython, R & BioConductor

Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

Writing short scripts, programs and developing softwares for various biological data analysis such as Sequences Alignment and Analysis, Genome Analysis, Proteome Analysis, Phylogenetic Analysis, Biological data visualization, MicroArray gene expression analysis, etc, requires a great deal of understanding of biological programming languages and the knowledge of how to utilize such programming languages to write the scripts.

BioCode is offering an Advanced Bioinformatics Scripting in Python, BioPython, R & BioConductor course so that you’ll learn from the very basics of biological programming in Python, BioPython & R to an advanced level understanding of Bioinformatics Scripting, even if you lack prior knowledge. You will understand various concepts related to how to write programs for MicroArray Gene Expression Analysis, ggplot2 biological data visualization & sequence retrieval, alignment, BLAST database searching & phylogenetic analysis in BioPython. You’ll also be learning complete end-to-end Linux (BASH) for Bioinformatics.

This course is for absolute beginners in bioinformatics scripting and you don’t require any prior knowledge of scripting or even bioinformatics to get started with this course. Everyday Bioinformatics analysis involves the extensive study and analysis of huge biological datasets. Linux and programming languages like Python and R have made it easy to perform analysis on huge biological data sets.

This course will include the following sections:

Section 1: Python

Description: This section will focus on making sure that the students gain an understanding of scripting in Python language and the basic functions that can be used to manipulate biological data.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Learn the Importance of Python in Bioinformatics.
  2. Understand Python Programming Language.
  3. Install Python Language.
  4. Discuss Comments in Programming Language.
  5. Perform Basic Input and Output Functions.
  6. Perform Mathematical Operations.
  7. Explain Strings Data Structure.
  8. Explain Dictionaries.
  9. Discuss Lists in Python.
  10. Describe Tuples.
  11. Explain Sets.
  12. Execute If-Else Conditions in Scripts.
  13. Execute While Loop and Perform Biological Data Analysis.
  14. Explain CSV Files.
  15. Read Files.
  16. Write Files.
  17. Consolidate (merge) Multiple DNA and Protein Sequences into one FASTA File.
  18. Describe OS Module.
  19. Explain Functions in Python.
  20. Use the “With” Statement in Python.
  21. Perform Error Handling.
Section 2: BioPython

Description: This section will ensure that the students will learn about the various functions  that help in our biological data analysis in BioPython module provided by Python programming language.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Understand the BioPython module.
  2. Install BioPython.
  3. Create a Sequence Object Using Bio.Seq Class.
  4. Explain How a Sequence Object Behaves like a String.
  5. Perform Central Dogma in BioPython.
  6. Import UnknownSeq and MutableSeq Objects from Bio.Seq Class.
  7. Understand the Alphabets of Biology Using Bio.Alphabet Class.
  8. Explain IUPAC Module and Types of Sequence Representation.
  9. Concatenate Multiple Sequence Records Using Generic Alphabets.
  10. Create Sequence Records Using SeqRecord Module.
  11. Utilize the SeqRecord Module to Demonstrate the Representation of FASTA File Within BioPython.
  12. Utilize the SeqRecord Module to Demonstrate the Representation of GenBank File Within BioPython.
  13. Utilize the Formatting Feature of the SeqRecord Module.
  14. Compare and Read Multiple FASTA Files from Directory Using SeqRecord Module in BioPython.
  15. Read a Sequence File Using SeqIO class.
  16. Parse a Sequence File Using SeqIO class.
  17. Parse a Compressed Sequence File and Create a Dictionary of Sequences.
  18. Write Sequences and SeqRecords into Files.
  19. Extract Annotations and Perform Pattern-wise Sequence Data Extraction Using SeqIO module.
  20. Read and Parse a Multiple Sequence Alignment File using AlignIO Module.
  21. Write Alignment and Multiple Sequence Alignment Records using AlignIO Module.
  22. Convert Alignment Formats.
  23. Manipulate Alignments.
  24. Align Multiple Sequences Using the ClustalW Python Wrapper.
  25. Align Two Sequences Using the paiwise2 Function in BioPython.
  26. Read Multiple Sequence Alignment Files of a Particular Format and Map Information of Alignments.
  27. Format Alignments.
  28. Truncate the Specific Regions from the Entire Alignment (Slice Alignments).
  29. Query NCBI BLAST Through Python.
  30. Access ENTREZ Using Python.
  31. Parse the BLAST Results using the Bio.Blast module.
  32. Get the Summary of Accessions Using Esummary Function of Entrez module in BioPython.
  33. Download Complete Records Using EFetch Function.
  34. Use EGQuery Function to do Global Queries for Search Count.
  35. Search for Database Links of Records Using Elinks.
  36. Search the Entrez Database Using ESearch Function.
  37. Use ESpell Function to Get the Correct Spellings for your Search Terms.
  38. Download GenBank and Entrez Records.
  39. Search Taxonomy Database.
  40. Download PubMed Articles.
  41. Read a PDB (3D Structure) File Using Bio.PDB Module.
  42. Calculate the Distance Matrix Between Sequences for Phylogenetic Analysis.
  43. Convert Phylogenetic Tree Data Formats.
  44. Print Out the Phylogenetic Tree in ASCII.
  45. Read Phylogenetic Trees.
  46. Visualize and Manipulate Phylogenetic Trees.
  47. Create a Web Logo of Motifs.
  48. Perform MEME Analysis.
  49. Write Out Phylogenetic Data.
Section 3: R Language

Description: This section will focus on making sure that the students will learn about R language, various biological functions that are performed using R, and how R is used to visualize biological data using the ggplot2 package.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Discuss R Language.
  2. Install R.
  3. Explain Comments.
  4. Declare Variables and Objects.
  5. Use Built-in Functions and ARGS.
  6. Explain Samples and Replacement.
  7. Write their own Functions and Arguments.
  8. Create Customized Scripts.
  9. Discuss Packages in R.
  10. Install Bioinformatics Packages in R.
  11. Initialize Library to Perform R Functions.
  12. Get Help from Help Packages.
  13. Explain Atomic Vectors in R.
  14. Explain Integers, Doubles, Logicals, and Factors in R.
  15. Discuss Dim and Dimensions in R.
  16. Explain Attributes and Names.
  17. Describe Matrix and Matrices.
  18. Explain Arrays and Lists.
  19. Describe Coercion.
  20. Explain Data Frames.
  21. Load Biological Data.
  22. Save Biological Data.
  23. Perform R Notation and Select Values from Biological Datasets.
  24. Discuss Positive Integers for Subsetting Biological Datasets.
  25. Discuss Negative Integers for Subsetting Biological Datasets.
  26. Describe Zero Notation for Subsetting Biological Datasets.
  27. Explain Blank Spaces for Subsetting Biological Datasets.
  28. Explain Dollar Signs for Subsetting Biological Datasets.
  29. Modify Values in Existing Datasets.
  30. Explain NA (Not Available) Values in Biological Datasets.
  31. Figure out NA Values in Biological Datasets.
  32. Perform Logical Subsetting in Biological Datasets.
  33. Use If Else Statement in Code.
  34. Use For Loops and Perform Biological Data Binding.
  35. Use While Loops and Read Multiple Biological Datasets.
  36. Explain ggplot2 and its Use in Biological Data Representation.
  37. Describe Key Components in ggplot2.
  38. Visualize Human Mitochondrial Proteome.
  39. Facet the Human Chromosome Dataset.
  40. Smooth Out the Biological Data.
  41. Create Box Plot for Human Mitochondrial Proteome.
  42. Create Histograms for Human Mitochondrial Pattern Finding.
  43. Create Frequency Plots for Human Mitochondrial Information Frequency Mining.
  44. Create Bar Charts for Human Mitochondrial Knowledge Mining.
  45. Scale and Limit Data Visualization.
  46. Visualize Phylogenetic Tree.
  47. Save Visualizations in High Resolution.
Section 4: Linux

Description: This section will focus on making sure that the students will learn about R language

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Discuss Linux Operating System.
  2. Print Working Directory in Linux.
  3. Make Directories in Linux.
  4. Change Directories in Linux.
  5. Move Files, Directories, and Data.
  6. Delete Files and Directories in Linux.
  7. Find the Programs Installed by the User.
  8. Find the Files Created by the User.
  9. List Files and Directories on Linux.
  10. Pipe and Redirect Data.
  11. Visualize and Inspect Text Data.
  12. Read the Specified Number of Lines from the Bottom
  13. Modify File Statistics and Create Files.
  14. See the Statistics of Files & Directories.
  15. Retrieve Genome Assemblies.
  16. Retrieve Bioinformatics Files.
  17. Create and Edit Text Files.
  18. Find Sequence Differences in Files.
  19. Compress and Archive Files Efficiently.
  20. Extract Compressed Content.
  21. Create Archives of Genome Data.
  22. Find Uncharacterized Proteins in the Human Genome.
  23. Subset Required Textual Data from Text Files.
  24. Sort Data.
  25. Find Unique Data Items.
  26. See the Statistics of Data Within the File.
  27. Copy Files and File Contents.
  28. Properly Visualize Delimited Datasets.

 

Show More

What Will You Learn?

  • Introduction
  • Iterable Objects
  • Control Flow
  • File Handling
  • Functions & Modules
  • Error Handling
  • Sequence Analysis
  • Sequence Data Parsing
  • Sequence Data Extraction
  • Alignment Parsing and Analysis
  • BLAST Database Searching
  • Parsing BLAST results
  • Biological Data Retrieval
  • Parsing a PDB Structure file
  • Phylogenetic Analysis
  • Protein Sequence Analysis
  • Variables & Functions
  • Packages
  • Vectors & Data Types
  • Biological Data Analysis
  • Data Visualization: ggplot2
  • Getting Familiar With Linux
  • Piping and Control Data Flow
  • Pre-processing Biological Datasets
  • Processing and Analysis of Biological Datasets
  • BioConductor
  • Sequence Retrieval
  • Bioinformatics File Parsing and Writing
  • Sequence Alignment
  • Database Searching
  • Gene Enrichment Analysis
  • Data Transformation with dplyr
  • Tidy Data with tidyr
  • MicroArray Analysis: BioConductor

Course Content

Python

  • Why Python in Bioinformatics
    09:16
  • Introduction to Python and it’s Installation
    08:25
  • Comments
    05:43
  • Basic Input and output
    15:38
  • Mathematical Operations
    07:20
  • Strings
    21:51
  • Dictionaries
    10:57
  • Lists
    28:48
  • Lists(pt 2) and Tuples
    10:38
  • Sets
    07:36
  • If-Else
    09:19
  • For Loop and Calculation of Molecular Weight of Proteins
    10:56
  • While Loop and Biological Data Analysis
    09:37
  • CSV (A special kind of file in Bioinformatics)
    08:42
  • Reading Files
    13:45
  • Writing Files
    07:18
  • Consolidate (merge) multiple DNA and Protein Sequences into one FASTA file
    09:25
  • OS Module
    31:47
  • Function
    26:41
  • With
    08:50
  • Error Handling
    00:00

BioPython

R

Linux

Exercise

Earn a certificate

Add this certificate to your resume to demonstrate your skills & increase your chances of getting noticed.

selected template

Student Ratings & Reviews

No Review Yet
No Review Yet

Want to receive push notifications for all major on-site activities?

Select your currency
Hurry up! Sale ends in:
Days
Hours
Minutes
Seconds