Get started with custom lists to organize and share courses.

Sign up

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Comparing Genes, Proteins, and Genomes (Bioinformatics III)

University of California, San Diego via Coursera

  • Provider Coursera
  • Subject Bioinformatics
  • Cost Free Online Course (Audit)
  • Session Upcoming
  • Language English
  • Certificate Paid Certificate Available
  • Effort 4-10 hours a week
  • Start Date
  • Duration 6 weeks long
  • Learn more about MOOCs

Taken this course? Share your experience with other students. Write review


Sign up to Coursera courses for free Learn how

Once we have sequenced genomes in the previous course, we would like to compare them to determine how species have evolved and what makes them different.

In the first half of the course, we will compare two short biological sequences, such as genes (i.e., short sequences of DNA) or proteins. We will encounter a powerful algorithmic tool called dynamic programming that will help us determine the number of mutations that have separated the two genes/proteins.

In the second half of the course, we will "zoom out" to compare entire genomes, where we see large scale mutations called genome rearrangements, seismic events that have heaved around large blocks of DNA over millions of years of evolution. Looking at the human and mouse genomes, we will ask ourselves: just as earthquakes are much more likely to occur along fault lines, are there locations in our genome that are "fragile" and more susceptible to be broken as part of genome rearrangements? We will see how combinatorial algorithms will help us answer this question.

Finally, you will learn how to apply popular bioinformatics software tools to solve problems in sequence alignment, including BLAST.


Week 1: Introduction to Sequence Alignment

Welcome to class!

If you joined us in the previous course in this Specialization, then you became an expert at assembling genomes and sequencing antibiotics. The next natural question to ask is how to compare DNA and amino acid sequences. This question will motivate this week's discussion of sequence alignment, which is the first of two questions that we will ask in this class (the algorithmic methods used to answer them are shown in parentheses):

  1. How Do We Compare DNA Sequences? (Dynamic Programming)
  2. Are There Fragile Regions in the Human Genome? (Combinatorial Algorithms)

As in previous courses, each of these two chapters is accompanied by a Bioinformatics Cartoon created by talented artist Randall Christopher and serving as a chapter header in the Specialization's bestselling print companion. You can find the first chapter's cartoon at the bottom of this message. Why have taxis suddenly become free of charge in Manhattan? Where did Pavel get so much spare change? And how should you get dressed in the morning so that you aren't late to your job as a crime-stopping superhero? Answers to these questions, and many more, in this week's installment of the course.

Week 2: From Finding a Longest Path to Aligning DNA Strings

Welcome to Week 2 of the class!

Last week, we saw how touring around Manhattan and making change in a Roman shop help us find a longest common subsequence of two DNA or protein strings.

This week, we will study how to find a highest scoring alignment of two strings. We will see that regardless of the underlying assumptions that we make regarding how the strings should be aligned, we will be able to phrase our alignment problem as an instance of finding the longest path in a directed acyclic graph.

Week 3: Advanced Topics in Sequence Alignment

Welcome to Week 3 of the class!

Last week, we saw how a variety of different applications of sequence alignment can all be reduced to finding the longest path in a Manhattan-like graph.

This week, we will conclude the current chapter by considering a few advanced topics in sequence alignment. For example, if we need to align long strings, our current algorithm will consume a huge amount of memory. Can we find a more memory-efficient approach? And what should we do when we move from aligning just two strings at a time to aligning many strings?

Week 4: Genome Rearrangements and Fragility

Welcome to Week 4 of the class!

You now know how to compare two DNA (or protein) strings.  But what if we wanted to compare entire genomes? When we "zoom out" to the genome level, we find that substitutions, insertions, and deletions don't tell the whole story of evolution: we need to model more dramatic evolutionary events known as genome rearrangements, which wrench apart chromosomes and put them back together in a new order. A natural question to ask is whether there are "fragile regions" hidden in your genome where chromosome breakage has occurred more often over millions of years. This week, we will begin addressing this question by asking how we can compute the number of rearrangements on the evolutionary path connecting two species.

You can find this week's Bioinformatics Cartoon from Randall Christopher at the bottom of this E-mail. What do earthquakes and a stack of pancakes have to do with species evolution? Keep learning to find out!

Week 5: Applying Genome Rearrangement Analysis to Find Genome Fragility

Last week, we asked whether there are fragile regions in the human genome. Then, we took a lengthy detour to see how to compute a distance between species genomes, a discussion that we will continue this week.

It is probably unclear how computing the distance between two genomes can help us understand whether fragile regions exist. If so, please stay tuned -- we will see that the connection between these two concepts will yield a surprising conclusion to the class.

Week 6: Bioinformatics Application Challenge
In the sixth and final week of the course, we will apply sequence alignment algorithms to infer the non-ribosomal code.

Taught by

Pavel Pevzner, Phillip Compeau and Nikolay Vyahhi

Help Center

Most commonly asked questions about Coursera Coursera

Reviews for Coursera's Comparing Genes, Proteins, and Genomes (Bioinformatics III)
5.0 Based on 2 reviews

  • 5 stars 100%
  • 4 star 0%
  • 3 star 0%
  • 2 star 0%
  • 1 star 0%

Did you take this course? Share your experience with other students.

Write a review
  • 1
Ericdo1810 E
5.0 3 years ago
by Ericdo1810 completed this course, spending 12 hours a week on it and found the course difficulty to be very hard.
This course is great, and beyond that. I have never seen any MOOC like this one. So challenging, so rigorous, and so satisfying upon completion (which I just did half an hour ago).

You may be turned off by the word "Bio", if you're a CS person. However, what I can say is: you don't need biology in this course, literallly: zero biology knowledge. To me, all the genes and chromosomes and blah blah, they are just alphabets! The real deal, the real big deal in this course, is that learners will struggle, and struggle a lot, to learn, understand, and implement all the algorithms taught…
1 person found
this review helpful
Was this review helpful to you? Yes
Ken S
5.0 3 years ago
by Ken completed this course.
Was this review helpful to you? Yes
  • 1

Class Central

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free

Never stop learning Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.