We may earn an affiliate commission when you visit our partners.
Course image
Rafael Irizarry, Vincent Carey, and Michael Love

We will explain how to perform the standard processing and normalization steps, starting with raw data, to get to the point where one can investigate relevant biological questions. Throughout the case studies, we will make use of exploratory plots to get a general overview of the shape of the data and the result of the experiment. We start with RNA-seq data analysis covering basic concepts and a first look at FASTQ files. We will also go over quality control of FASTQ files; aligning RNA-seq reads; visualizing alignments and move on to analyzing RNA-seq at the gene-level : counting reads in genes; Exploratory Data Analysis and variance stabilization for counts; count-based differential expression; normalization and batch effects. Finally, we cover RNA-seq at the transcript-level : inferring expression of transcripts (i.e. alternative isoforms); differential exon usage. We will learn the basic steps in analyzing DNA methylation data, including reading the raw data, normalization, and finding regions of differential methylation across multiple samples. The course will end with a brief description of the basic steps for analyzing ChIP-seq datasets, from read alignment, to peak calling, and assessing differential binding patterns across multiple samples.

Read more

We will explain how to perform the standard processing and normalization steps, starting with raw data, to get to the point where one can investigate relevant biological questions. Throughout the case studies, we will make use of exploratory plots to get a general overview of the shape of the data and the result of the experiment. We start with RNA-seq data analysis covering basic concepts and a first look at FASTQ files. We will also go over quality control of FASTQ files; aligning RNA-seq reads; visualizing alignments and move on to analyzing RNA-seq at the gene-level : counting reads in genes; Exploratory Data Analysis and variance stabilization for counts; count-based differential expression; normalization and batch effects. Finally, we cover RNA-seq at the transcript-level : inferring expression of transcripts (i.e. alternative isoforms); differential exon usage. We will learn the basic steps in analyzing DNA methylation data, including reading the raw data, normalization, and finding regions of differential methylation across multiple samples. The course will end with a brief description of the basic steps for analyzing ChIP-seq datasets, from read alignment, to peak calling, and assessing differential binding patterns across multiple samples.

Given the diversity in educational background of our students we have divided the series into seven parts. You can take the entire series or individual courses that interest you. If you are a statistician you should consider skipping the first two or three courses, similarly, if you are biologists you should consider skipping some of the introductory biology lectures. Note that the statistics and programming aspects of the class ramp up in difficulty relatively quickly across the first three courses. By the third course will be teaching advanced statistical concepts such as hierarchical models and by the fourth advanced software engineering skills, such as parallel computing and reproducible research concepts.

These courses make up two Professional Certificates and are self-paced:

Data Analysis for Life Sciences:

Genomics Data Analysis:

This class was supported in part by NIH grant R25GM114818.

HarvardX requires individuals who enroll in its courses on edX to abide by the terms of the edX honor code. HarvardX will take appropriate corrective action in response to violations of the edX honor code, which may include dismissal from the HarvardX course; revocation of any certificates received for the HarvardX course; or other remedies as circumstances warrant. No refunds will be issued in the case of corrective action for such violations. Enrollees who are taking HarvardX courses as part of another program will also be governed by the academic policies of those programs.

HarvardX pursues the science of learning. By registering as an online learner in an HX course, you will also participate in research about learning. Read our research statement to learn more.

Harvard University and HarvardX are committed to maintaining a safe and healthy educational and work environment in which no member of the community is excluded from participation in, denied the benefits of, or subjected to discrimination or harassment in our program. All members of the HarvardX community are expected to abide by Harvard policies on nondiscrimination, including sexual harassment, and the edX Terms of Service. If you have any questions or concerns, please contact [email protected] and/or report your experience through the edX contact form.

What's inside

Learning objectives

  • Mapping reads
  • Quality assessment of next generation data
  • Analyzing rna-seq data
  • Analyzing dna methylation data
  • Analyzing chip seq data

Traffic lights

Read about what's good
what should give you pause
and possible dealbreakers
Taught by Vincent Carey, Michael Love, and Rafael Irizarry, who are recognized for their work in genomics
Develops foundational knowledge and skills in bioinformatics
Examines real-world case studies in functional genomics

Save this course

Create your own learning path. Save this course to your list so you can find it easily later.
Save

Reviews summary

Practical functional genomics data analysis

According to learners, this course, "Case Studies in Functional Genomics," is a valuable and practical experience focusing on applying bioinformatics tools, particularly R and Bioconductor, to analyze genomic data. Students praise the hands-on case studies covering key areas like RNA-seq, DNA methylation, and ChIP-seq, finding them essential for real-world application. While many appreciate the in-depth content, the course is frequently noted as having significant prerequisites, requiring a strong foundation in statistics and programming, especially R. Learners without this background may find the pace challenging, but those prepared often find it a strong foundation for further work in the field.
Good for building core analysis skills.
"This course provided a strong foundation for my future work in functional genomics."
"Feel much more capable of analyzing this type of data now."
"A crucial step for anyone looking to specialize in genomic data analysis."
Covers multiple analysis types deeply.
"The course provides in-depth coverage of RNA-seq, DNA methylation, and ChIP-seq analysis pipelines."
"I appreciated the detailed explanations of the different functional genomics methods."
"Provides a comprehensive overview of key techniques used in the field."
Practical case studies and labs are key.
"The case studies are the most valuable part, providing much-needed hands-on practice."
"I found the labs essential for applying the theoretical knowledge to real data."
"Working through the practical examples helped solidify my understanding greatly."
"The hands-on coding and projects are the strongest part of the course for me."
Pace can be fast for some learners.
"The pace was quite fast, especially given the complexity of the topics."
"Needed to spend a lot of time outside of the lectures to fully grasp the material."
"It's a challenging course, be prepared to dedicate significant effort."
Requires strong stats/R background.
"Make sure you have a solid background in statistics and R before taking this course."
"This course is quite advanced; I struggled without sufficient programming skills."
"The description mentions prerequisites, and they are absolutely necessary to keep up."
"I wish I had spent more time mastering R and the previous courses first."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Case Studies in Functional Genomics with these activities:
Organize and review course materials
Helps students stay organized and engaged with course materials, improving retention and understanding.
Browse courses on Organization
Show steps
  • Download and print course materials.
  • Create a system for organizing notes and assignments.
  • Regularly review and summarize course materials.
Refresh your knowledge of linear algebra
Ensures students have a strong foundation in linear algebra, essential for understanding statistical models and data analysis.
Browse courses on Linear Algebra
Show steps
  • Review your notes from a previous linear algebra course.
  • Work through practice problems.
Review the textbook 'Bioconductor for Beginners'
Provides a solid foundation in the use of Bioconductor, an essential software package for genomics data analysis.
View Melania on Amazon
Show steps
  • Read the book and take notes.
  • Work through the examples in the book.
Four other activities
Expand to see all activities and additional details
Show all seven activities
Practice mapping sequencing reads
Reinforces hands-on practice in crucial skills in working with sequencing datasets.
Show steps
  • Find an online tutorial on read mapping.
  • Download a small sequencing dataset.
  • Use a read mapping software to align the reads to a reference genome.
  • Evaluate the mapping results.
Analyze RNA-Seq data
Provides practical experience in analyzing real-world biological datasets.
Browse courses on RNA-seq
Show steps
  • Download an RNA-Seq dataset from a public repository.
  • Use a statistical software package to analyze the data.
  • Identify differentially expressed genes.
  • Interpret the results in the context of the biological question being asked.
Follow a tutorial on ChIP-Seq data analysis
Guides students through the process of analyzing ChIP-Seq data, a valuable technique in studying gene regulation.
Show steps
  • Find a tutorial on ChIP-Seq data analysis.
  • Follow the steps in the tutorial to analyze a ChIP-Seq dataset.
  • Interpret the results in the context of the biological question being asked.
Create a presentation on the application of genomics in medicine
Encourages students to synthesize their knowledge of genomics and its applications in healthcare, developing valuable communication and presentation skills.
Browse courses on Genomics
Show steps
  • Research the applications of genomics in medicine.
  • Develop a presentation outline.
  • Create slides with clear and concise content.
  • Practice delivering the presentation.

Career center

Learners who complete Case Studies in Functional Genomics will develop knowledge and skills that may be useful to these careers:
Statistician
A Statistician collects, analyzes, and interprets data. They may work in a variety of settings, such as academia, government, or industry. This course aids in building a foundation in statistical concepts and methods for this role. The course covers topics such as mapping reads, quality assessment of Next Generation Data, analyzing RNA-seq data, analyzing DNA methylation data, and analyzing ChIP Seq data. These topics are essential for a Statistician to effectively analyze data.
Geneticist
A Geneticist studies the inheritance and variation of genes. They may work in a variety of settings, such as academia, government, or industry. This course aids in building a foundation in statistical concepts and methods for this role. The course covers topics such as mapping reads, quality assessment of Next Generation Data, analyzing RNA-seq data, analyzing DNA methylation data, and analyzing ChIP Seq data. These topics are essential for a Geneticist to effectively analyze genetic data.
Biostatistician
A Biostatistician designs and analyzes experiments, interprets data, and draws conclusions. They may work in a variety of settings, such as academia, government, or industry. This course aids in building a foundation in statistical concepts and methods for this role. The course covers topics such as mapping reads, quality assessment of Next Generation Data, analyzing RNA-seq data, analyzing DNA methylation data, and analyzing ChIP Seq data. These topics are essential for a Biostatistician to effectively design and analyze experiments.
Research Scientist
A Research Scientist conducts research in a variety of fields, such as science, engineering, or medicine. They may work in a variety of settings, such as academia, government, or industry. This course aids in building a foundation in statistical concepts and methods for this role. The course covers topics such as mapping reads, quality assessment of Next Generation Data, analyzing RNA-seq data, analyzing DNA methylation data, and analyzing ChIP Seq data. These topics are essential for a Research Scientist to effectively analyze data.
Microbiologist
A Microbiologist studies microorganisms, such as bacteria, viruses, and fungi. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Molecular Biologist
A Molecular Biologist studies the structure and function of molecules, such as DNA and RNA. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Pharmacologist
A Pharmacologist studies the effects of drugs on the body. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Toxicologist
A Toxicologist studies the effects of toxic substances on the body. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Bioinformatics Scientist
A Bioinformatics Scientist develops and applies computational tools and techniques to analyze and interpret biological data. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Computational Biologist
A Computational Biologist develops and applies computational tools and techniques to solve biological problems. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Medical Scientist
A Medical Scientist conducts research to develop new treatments and cures for diseases. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Systems Biologist
A Systems Biologist studies the interactions between different components of a biological system. They may work in a variety of settings, such as academia, government, or industry. This course may be useful for building a foundation in the analysis of biological data.
Data Scientist
A Data Scientist develops and applies mathematical and statistical methods to extract meaningful insights from data. They may work in a variety of settings, such as business, government, or healthcare. This course may be useful for building a foundation in data analysis techniques.
Data Analyst
A Data Analyst collects, processes, and analyzes data to extract meaningful insights. They may work in a variety of settings, such as business, government, or healthcare. This course may be useful for building a foundation in data analysis techniques.
Quantitative Analyst
A Quantitative Analyst develops and applies mathematical and statistical models to analyze financial data. They may work in a variety of settings, such as investment banks, hedge funds, or asset management companies. This course may be useful for building a foundation in data analysis techniques.

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Case Studies in Functional Genomics.
Provides a comprehensive overview of the Bioconductor project, a collection of open-source software packages for the analysis and visualization of genomic data. It is an essential reference for anyone using Bioconductor for their research.
Provides a comprehensive overview of statistical methods used in bioinformatics, covering topics such as sequence analysis, gene expression analysis, and genome-wide association studies. It valuable resource for anyone working with bioinformatics data.
Provides a comprehensive overview of algorithms used in computational biology, covering topics such as sequence alignment, gene finding, and phylogenetic analysis. It valuable resource for anyone working with computational biology data.
Provides a comprehensive overview of machine learning methods used in bioinformatics, covering topics such as supervised learning, unsupervised learning, and feature selection. It valuable resource for anyone working with bioinformatics data.
Provides a practical guide to the essential data skills needed for bioinformatics research, such as data management, data analysis, and data visualization. It valuable resource for anyone working with bioinformatics data.
Provides a comprehensive overview of biochemistry, covering topics such as protein structure, enzyme kinetics, and metabolism.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Similar courses are unavailable at this time. Please try again later.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser