Introductory Python & BioPython for Bioinformatics

Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

Introductory Python & BioPython for Bioinformatics

The major part of bioinformatics is connecting together different processing steps into a single pipeline and then applying that pipeline to many other files repeatedly, which often involves massive and tedious data processing. The simple syntax and high-level data structures of Python, make it easier for nonprofessional programmers such as computational biologists to develop programming skills, enabling them to interact with data programmatically and eventually develop code on their own. Biopython is a set of freely available tools for biological computation written in Python language.


BioCode is offering an Introductory Python & BioPython for Bioinformatics course in which you’ll learn various concepts related to how to write your first script, and Python data structures such as lists, strings, dictionaries, and more. You’ll learn how to use BioPython for reading/writing bioinformatics files, biological data retrieval from various bioinformatics databases, sequence analysis and alignment, genome analysis, proteome analysis, phylogenetic analysis, and much more. BioPython provides various modules and functions for the study and analysis of huge biological datasets. 


This course is for absolute beginners in bioinformatics scripting and you don’t require any prior knowledge of Python programming or even bioinformatics to get started with this course. In this course you will see that by using the modules available within the BioPython package in Python, we don’t have to write long codes to perform a specific task on our biological data, rather we can just call in the built-in functions of BioPython and perform the required task.


Introductory Python BioPython for Bioinformatics | BioCode


This course will include the following sections:

Section 1: Introduction to Python

Description: This section will focus on making sure that the students gain an understanding of scripting in Python language and the basic functions that can be used to manipulate biological data.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Learn the Importance of Python in Bioinformatics.
  2. Understand Python Programming Language.
  3. Install Python Language.
  4. Discuss Comments in Programming Language.
  5. Perform Basic Input and Output Functions.
  6. Perform Mathematical Operations.
  7. Explain Strings Data Structure.
  8. Explain Dictionaries.
  9. Discuss Lists in Python.
  10. Describe Tuples.
  11. Explain Sets.
  12. Execute If-Else Conditions in Scripts.
  13. Execute While Loop and Perform Biological Data Analysis.
  14. Read Files.
  15. Write Files.
  16. Consolidate (merge) Multiple DNA and Protein Sequences into one FASTA File.
  17. Describe the OS Module.
  18. Explain CSV Files.
  19. Explain Functions in Python.
  20. Use the “With” Statement in Python.
  21. Perform Error Handling.


Section 2: In-Depth Fundamentals of BioPython

Description: This section will ensure that the students will learn about the various functions that help in our biological data analysis in the BioPython module provided by the Python programming language.

Learning Outcomes:  Upon completion of this section, students will be able to:

  1. Understand the BioPython module.
  2. Install BioPython.
  3. Create a Sequence Object Using Bio.Seq Class.
  4. Explain How a Sequence Object Behaves like a String.
  5. Perform Central Dogma in BioPython.
  6. Import UnknownSeq and MutableSeq Objects from Bio.Seq Class.
  7. Understand the Alphabets of Biology Using Bio.Alphabet Class.
  8. Explain the IUPAC Module and Types of Sequence Representation.
  9. Concatenate Multiple Sequence Records Using Generic Alphabets.
  10. Create Sequence Records Using SeqRecord Module.
  11. Utilize the SeqRecord Module to Demonstrate the Representation of the FASTA File Within BioPython.
  12. Utilize the SeqRecord Module to Demonstrate the Representation of GenBank File Within BioPython.
  13. Utilize the Formatting Feature of the SeqRecord Module.
  14. Compare and Read Multiple FASTA Files from the Directory Using SeqRecord Module in BioPython.
  15. Read a Sequence File Using SeqIO class.
  16. Parse a Sequence File Using SeqIO class.
  17. Parse a Compressed Sequence File and Create a Dictionary of Sequences.
  18. Write Sequences and SeqRecords into Files.
  19. Extract Annotations and Perform Pattern-wise Sequence Data Extraction Using SeqIO module.
  20. Write Alignment and Multiple Sequence Alignment Records using AlignIO Module.
  21. Read and Parse a Multiple Sequence Alignment File using AlignIO Module.
  22. Convert Alignment Formats.
  23. Manipulate Alignments.
  24. Align Multiple Sequences Using the ClustalW Python Wrapper.
  25. Align Two Sequences Using the paiwise2 Function in BioPython.
  26. Read Multiple Sequence Alignment Files of a Particular Format and Map Information of Alignments.
  27. Format Alignments.
  28. Truncate the Specific Regions from the Entire Alignment (Slice Alignments).
  29. Query NCBI BLAST Through Python.
  30. Parse the BLAST Results using the Bio.Blast module.
  31. Access ENTREZ Using Python.
  32. Get the Summary of Accessions Using Esummary Function of Entrez module in BioPython.
  33. Download Complete Records Using EFetch Function.
  34. Use EGQuery Function to do Global Queries for Search Count.
  35. Search for Database Links of Records Using Elinks.
  36. Search the Entrez Database Using ESearch Function.
  37. Use ESpell Function to Get the Correct Spellings for your Search Terms.
  38. Download GenBank and Entrez Records.
  39. Search Taxonomy Database.
  40. Download PubMed Articles.
  41. Read a PDB (3D Structure) File Using Bio.PDB Module.
  42. Calculate the Distance Matrix Between Sequences for Phylogenetic Analysis.
  43. Convert Phylogenetic Tree Data Formats.
  44. Print Out the Phylogenetic Tree in ASCII.
  45. Read Phylogenetic Trees.
  46. Visualize and Manipulate Phylogenetic Trees.
  47. Create a Web Logo of Motifs.
  48. Write Out Phylogenetic Data.
  49. Perform MEME Analysis.


Show More

What Will You Learn?

  • Introduction
  • Iterable Objects
  • Control Flow
  • File Handling
  • Functions & Modules
  • Error Handling
  • Sequence Analysis
  • Sequence Data Parsing
  • Sequence Data Extraction
  • Alignment Parsing and Analysis
  • BLAST Database Searching
  • Parsing BLAST results
  • Biological Data Retrieval
  • Parsing a PDB Structure file
  • Phylogenetic Analysis
  • Protein Sequence Analysis

Course Content


  • Why Python in Bioinformatics
  • Introduction to Python and it’s Installation
  • Comments
  • Basic Input and output
  • Mathematical Operations
  • Strings
  • Dictionaries
  • Lists
  • Lists (pt 2) and Tuples
  • Sets
  • If-Else
  • For Loop and Calculation of Molecular Weight of Proteins
  • While Loop and Biological Data Analysis
  • Reading Files
  • Writing Files
  • Consolidate (merge) multiple DNA and Protein Sequences into one FASTA file
  • OS Module
  • CSV (A special kind of file in Bioinformatics)
  • Functions
  • With
  • Error Handling



Earn a certificate

Add this certificate to your resume to demonstrate your skills & increase your chances of getting noticed.

selected template

Student Ratings & Reviews

No Review Yet
No Review Yet

Want to receive push notifications for all major on-site activities?

Select your currency
| BioCode
Hurry up! Sale ends in: