Comparing Genes, Proteins, and Genomes (Bioinformatics III) from Coursera

Once we have sequenced genomes in the previous course, we would like to compare them to determine how species have evolved and what makes them different.

In the first half of the course, we will compare two short biological sequences, such as genes (i.e., short sequences of DNA) or proteins. We will encounter a powerful algorithmic tool called dynamic programming that will help us determine the number of mutations that have separated the two genes/proteins.

In the second half of the course, we will "zoom out" to compare entire genomes, where we see large scale mutations called genome rearrangements, seismic events that have heaved around large blocks of DNA over millions of years of evolution. Looking at the human and mouse genomes, we will ask ourselves: just as earthquakes are much more likely to occur along fault lines, are there locations in our genome that are "fragile" and more susceptible to be broken as part of genome rearrangements? We will see how combinatorial algorithms will help us answer this question.

Finally, you will learn how to apply popular bioinformatics software tools to solve problems in sequence alignment, including BLAST.

What's inside

Syllabus

Week 1: Introduction to Sequence Alignment

Welcome to class!

If you joined us in the previous course in this Specialization, then you became an expert at assembling genomes and sequencing antibiotics. The next natural question to ask is how to compare DNA and amino acid sequences. This question will motivate this week's discussion of sequence alignment, which is the first of two questions that we will ask in this class (the algorithmic methods used to answer them are shown in parentheses):

How Do We Compare DNA Sequences? (Dynamic Programming)
Are There Fragile Regions in the Human Genome? (Combinatorial Algorithms)

As in previous courses, each of these two chapters is accompanied by a Bioinformatics Cartoon created by talented artist Randall Christopher and serving as a chapter header in the Specialization's bestselling print companion. You can find the first chapter's cartoon at the bottom of this message. Why have taxis suddenly become free of charge in Manhattan? Where did Pavel get so much spare change? And how should you get dressed in the morning so that you aren't late to your job as a crime-stopping superhero? Answers to these questions, and many more, in this week's installment of the course.

Week 2: From Finding a Longest Path to Aligning DNA Strings

Welcome to Week 2 of the class!

Last week, we saw how touring around Manhattan and making change in a Roman shop help us find a longest common subsequence of two DNA or protein strings.

This week, we will study how to find a highest scoring alignment of two strings. We will see that regardless of the underlying assumptions that we make regarding how the strings should be aligned, we will be able to phrase our alignment problem as an instance of finding the longest path in a directed acyclic graph.

Week 3: Advanced Topics in Sequence Alignment

Welcome to Week 3 of the class!

Last week, we saw how a variety of different applications of sequence alignment can all be reduced to finding the longest path in a Manhattan-like graph.

This week, we will conclude the current chapter by considering a few advanced topics in sequence alignment. For example, if we need to align long strings, our current algorithm will consume a huge amount of memory. Can we find a more memory-efficient approach? And what should we do when we move from aligning just two strings at a time to aligning many strings?

Week 4: Genome Rearrangements and Fragility

Welcome to Week 4 of the class!

You now know how to compare two DNA (or protein) strings. But what if we wanted to compare entire genomes? When we "zoom out" to the genome level, we find that substitutions, insertions, and deletions don't tell the whole story of evolution: we need to model more dramatic evolutionary events known as genome rearrangements, which wrench apart chromosomes and put them back together in a new order. A natural question to ask is whether there are "fragile regions" hidden in your genome where chromosome breakage has occurred more often over millions of years. This week, we will begin addressing this question by asking how we can compute the number of rearrangements on the evolutionary path connecting two species.

You can find this week's Bioinformatics Cartoon from Randall Christopher at the bottom of this E-mail. What do earthquakes and a stack of pancakes have to do with species evolution? Keep learning to find out!

Week 5: Applying Genome Rearrangement Analysis to Find Genome Fragility

Last week, we asked whether there are fragile regions in the human genome. Then, we took a lengthy detour to see how to compute a distance between species genomes, a discussion that we will continue this week.

It is probably unclear how computing the distance between two genomes can help us understand whether fragile regions exist. If so, please stay tuned -- we will see that the connection between these two concepts will yield a surprising conclusion to the class.

Week 6: Bioinformatics Application Challenge

In the sixth and final week of the course, we will apply sequence alignment algorithms to infer the non-ribosomal code.

Good to know

Know what's good

, what to watch for

, and possible dealbreakers

Teaches a fundamental computational tool for examining the evolutionary relationships between genomes

Instructors have extensive experience and recognition in the field of bioinformatics

Builds a strong foundation in comparative genomics algorithms for advanced learners

Requires some background in biology and computer science for full comprehension

May not cover the most recent advancements in genome comparison algorithms

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Comparing Genes, Proteins, and Genomes (Bioinformatics III) with these activities:

Review the basics of molecular biology

Show steps

Reviewing the basics of molecular biology will help you refresh your knowledge of the underlying principles of sequence alignment and genome rearrangement.

Browse courses on Molecular Biology

Show steps

Read a textbook or online resources on molecular biology.
Review your notes from previous courses in molecular biology.

Follow tutorials on BLAST and other sequence alignment software

Show steps

Following tutorials on BLAST and other sequence alignment software will help you learn how to use these tools to analyze biological sequences.

Show steps

Find a set of tutorials on BLAST and other sequence alignment software.
Follow the tutorials to learn how to use the software.

Solve practice problems on sequence alignment

Show steps

Practicing sequence alignment problems will help you solidify your understanding of the algorithms and techniques covered in the course.

Show steps

Find a set of practice problems online or in a textbook.
Solve the problems using the algorithms and techniques taught in the course.

Five other activities

Expand to see all activities and additional details

Show all eight activities

Participate in a study group with other students

Show steps

Participating in a study group with other students will help you learn from each other and reinforce your understanding of the course material.

Show steps

Find a group of students who are also taking the course.
Meet regularly to discuss the course material.

Create a visualization of the human genome

Show steps

Creating a visualization of the human genome will help you understand the structure and organization of the human genome.

Show steps

Gather data on the human genome.
Choose a visualization tool.
Create a visualization of the human genome.

Create a presentation on genome rearrangement

Show steps

Creating a presentation on genome rearrangement will help you deepen your understanding of the concepts and be able to explain them to others.

Show steps

Research the topic of genome rearrangement.
Gather and organize information about the different types of genome rearrangements, their causes, and their consequences.
Create a presentation that explains the concepts of genome rearrangement in a clear and concise way.

Participate in a bioinformatics competition

Show steps

Participating in a bioinformatics competition will help you test your knowledge and skills, and learn from other participants.

Show steps

Find a bioinformatics competition that is relevant to the course material.
Register for the competition and prepare for it by studying the course material and practicing your skills.

Start a project to develop a new sequence alignment algorithm

Show steps

Starting a project to develop a new sequence alignment algorithm will help you apply the concepts and techniques covered in the course to a real-world problem.

Show steps

Research different sequence alignment algorithms.
Design a new sequence alignment algorithm.
Implement the algorithm in a programming language.
Test the algorithm on a variety of biological sequences.
Write a report on your findings.

Career center

Learners who complete Comparing Genes, Proteins, and Genomes (Bioinformatics III) will develop knowledge and skills that may be useful to these careers:

Bioinformatics Software Engineer

A Bioinformatics Software Engineer can design and develop software tools to help scientists analyze and interpret biological data. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which can be valuable for this role. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the development of software tools for analyzing genetic data.

See salaries and explore the career path for Bioinformatics Software Engineer

Computational Biologist

Computational Biology is a field that uses computer science and mathematics to solve problems in biology. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which is a subfield of computational biology. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the development of computational methods for analyzing genetic data.

See salaries and explore the career path for Computational Biologist

Biostatistician

A Biostatistician uses statistics to analyze biological data. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which can be valuable for this role. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the statistical analysis of genetic data.

See salaries and explore the career path for Biostatistician

Data Scientist

A Data Scientist uses data to solve problems in a variety of fields, including biology. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which can be valuable for this role. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the analysis of biological data.

See salaries and explore the career path for Data Scientist

Genetic Counselor

A Genetic Counselor helps people understand and manage their genetic risks. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which can be valuable for this role. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the analysis of genetic data, which is essential for genetic counseling.

See salaries and explore the career path for Genetic Counselor

Forensic Scientist

A Forensic Scientist uses science to solve crimes. This course can help build a foundation in the algorithms and techniques used in bioinformatics, which can be valuable for this role. Specifically, the course's focus on sequence alignment and genome rearrangements can provide insights into the analysis of DNA evidence.

See salaries and explore the career path for Forensic Scientist

Healthcare Consultant

A Healthcare Consultant helps healthcare organizations improve their performance. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in healthcare.

See salaries and explore the career path for Healthcare Consultant

Insurance Actuary

An Insurance Actuary uses mathematics to assess and manage risk. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in insurance.

See salaries and explore the career path for Insurance Actuary

Investment Analyst

An Investment Analyst uses financial data to make investment decisions. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in investment.

See salaries and explore the career path for Investment Analyst

Marketing Manager

A Marketing Manager develops and executes marketing campaigns. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in marketing.

See salaries and explore the career path for Marketing Manager

Operations Research Analyst

An Operations Research Analyst uses mathematics and computer science to solve problems in a variety of fields, including biology. This course may be useful for this role, as it can provide insights into the analysis of biological data.

See salaries and explore the career path for Operations Research Analyst

Product Manager

A Product Manager develops and manages products. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in product development.

See salaries and explore the career path for Product Manager

Project Manager

A Project Manager plans and executes projects. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in project management.

See salaries and explore the career path for Project Manager

Quality Assurance Analyst

A Quality Assurance Analyst ensures that products and services meet quality standards. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in quality assurance.

See salaries and explore the career path for Quality Assurance Analyst

Risk Manager

A Risk Manager identifies and manages risks. This course may be useful for this role, as it can provide insights into the analysis of biological data, which is increasingly being used in risk management.

See salaries and explore the career path for Risk Manager