We may earn an affiliate commission when you visit our partners.
Course image
Rafael Irizarry and Michael Love

In this course, we begin with approaches to visualization of genome-scale data, and provide tools to build interactive graphical interfaces to speed discovery and interpretation. Using knitr and rmarkdown as basic authoring tools, the concept of reproducible research is developed, and the concept of an executable document is presented. In this framework reports are linked tightly to the underlying data and code, enhancing reproducibility and extensibility of completed analyses. We study out-of-memory approaches to the analysis of very large data resources, using relational databases or HDF5 as "back ends" with familiar R interfaces. Multiomic data integration is illustrated using a curated version of The Cancer Genome Atlas. Finally, we explore cloud-resident resources developed for the Encyclopedia of DNA Elements (the ENCODE project). These address transcription factor binding, ATAC-seq, and RNA-seq with CRISPR interference.

Read more

In this course, we begin with approaches to visualization of genome-scale data, and provide tools to build interactive graphical interfaces to speed discovery and interpretation. Using knitr and rmarkdown as basic authoring tools, the concept of reproducible research is developed, and the concept of an executable document is presented. In this framework reports are linked tightly to the underlying data and code, enhancing reproducibility and extensibility of completed analyses. We study out-of-memory approaches to the analysis of very large data resources, using relational databases or HDF5 as "back ends" with familiar R interfaces. Multiomic data integration is illustrated using a curated version of The Cancer Genome Atlas. Finally, we explore cloud-resident resources developed for the Encyclopedia of DNA Elements (the ENCODE project). These address transcription factor binding, ATAC-seq, and RNA-seq with CRISPR interference.

Given the diversity in educational background of our students we have divided the series into seven parts. You can take the entire series or individual courses that interest you. If you are a statistician you should consider skipping the first two or three courses, similarly, if you are biologists you should consider skipping some of the introductory biology lectures. Note that the statistics and programming aspects of the class ramp up in difficulty relatively quickly across the first three courses. By the third course will be teaching advanced statistical concepts such as hierarchical models and by the fourth advanced software engineering skills, such as parallel computing and reproducible research concepts.

These courses make up two Professional Certificates and are self-paced:

Data Analysis for Life Sciences:

Genomics Data Analysis:

This class was supported in part by NIH grant R25GM114818.

HarvardX requires individuals who enroll in its courses on edX to abide by the terms of the edX honor code. HarvardX will take appropriate corrective action in response to violations of the edX honor code, which may include dismissal from the HarvardX course; revocation of any certificates received for the HarvardX course; or other remedies as circumstances warrant. No refunds will be issued in the case of corrective action for such violations. Enrollees who are taking HarvardX courses as part of another program will also be governed by the academic policies of those programs.

HarvardX pursues the science of learning. By registering as an online learner in an HX course, you will also participate in research about learning. Read our research statement to learn more.

Harvard University and HarvardX are committed to maintaining a safe and healthy educational and work environment in which no member of the community is excluded from participation in, denied the benefits of, or subjected to discrimination or harassment in our program. All members of the HarvardX community are expected to abide by Harvard policies on nondiscrimination, including sexual harassment, and the edX Terms of Service. If you have any questions or concerns, please contact [email protected] and/or report your experience through the edX contact form.

What's inside

Learning objectives

  • Static and interactive visualization of genomic data
  • Reproducible analysis methods
  • Memory-sparing representations of genomic assays
  • Working with multiomic experiments in cancer
  • Targeted interrogation of cloud-scale genomic archives

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Develops statistical analysis methods using R, which is standard in industry and the sciences
Builds a strong foundation for high-throughput genomics, a contemporary area of research
Teaches reproducible analysis, a highly relevant skill in data science
Explores real-world case studies in functional genomics for the analysis of cancer
Covers statistical inference and modeling for high-throughput experiments, core skills for modern biological research
Prerequisites may be required for individuals without strong programming and statistics backgrounds

Save this course

Save Advanced Bioconductor to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Advanced Bioconductor with these activities:
Review the textbook: 'Introduction to Statistical Learning'
Referencing a standard textbook will complement the course material and provide a comprehensive foundation in statistical learning.
Show steps
  • Read and understand the main concepts presented in the textbook
  • Use the textbook as a reference for specific topics or concepts
Organize and review course materials
Maintaining organized notes and reviewing materials regularly can enhance your retention and understanding of the course content.
Show steps
  • Create a system for keeping track of assignments, notes, and other materials
  • Regularly review and summarize your notes
Join a study group for collaborative learning
Engaging in peer discussions can enhance your understanding by exposing you to different perspectives and fostering collaboration.
Show steps
  • Find or form a study group with classmates or online learners
  • Meet regularly to discuss course material
  • Collaborate on assignments or projects
Four other activities
Expand to see all activities and additional details
Show all seven activities
Follow tutorials on R programming
Learning R is essential for this course. Following guided tutorials will provide step-by-step instructions and examples to help you master the language.
Browse courses on R Programming
Show steps
  • Identify online tutorials or video courses
  • Set aside dedicated time for learning R
  • Apply what you learn by practicing on sample datasets
Solve practice problems on statistical inference
Regularly solving practice problems will help solidify your understanding of statistical inference methods and improve your problem-solving skills.
Browse courses on Statistical Inference
Show steps
  • Identify online resources or textbooks for practice problems
  • Set aside dedicated time for practicing
  • Review your solutions and identify areas for improvement
Create a visual summary of key concepts
Using your knowledge from this course, create a visually appealing representation of key concepts to reinforce your understanding and improve your recall.
Show steps
  • Identify the main concepts covered in the course
  • Choose a visual format (e.g., infographic, mind map, diagram)
  • Design and create your visual summary
Develop a web application for interactive genome visualization
Building a web application will challenge you to apply your knowledge of visualization and programming to create a practical tool that showcases your understanding.
Browse courses on Interactive Visualization
Show steps
  • Design the user interface and functionality of your application
  • Implement the application using appropriate programming languages and frameworks
  • Test and refine your application based on user feedback

Career center

Learners who complete Advanced Bioconductor will develop knowledge and skills that may be useful to these careers:
Geneticist
A Geneticist studies the inheritance of genes and genetic variation. Advanced Bioconductor can help build a foundation in working with genomic assays, which is essential for a Geneticist.
Bioinformatics Scientist
A Bioinformatics Scientist develops and applies computational tools and approaches to analyze large biological datasets, such as genomic data. Advanced Bioconductor can help build a foundation in working with genomic assays, which is essential for a Bioinformatics Scientist.
Computational Biologist
A Computational Biologist uses computational tools and techniques to analyze and interpret biological data. Advanced Bioconductor can help build a foundation in working with large genomic datasets, which is essential for a Computational Biologist.
Professor
A Professor teaches and conducts research at a college or university. Advanced Bioconductor can help build a foundation in working with genomic assays, which can be useful in teaching and research.
Research Scientist
A Research Scientist conducts research in a specific field of science. Advanced Bioconductor can help build a foundation in working with large genomic datasets, which can be useful in research.
Postdoctoral Researcher
A Postdoctoral Researcher conducts research under the supervision of a senior researcher. Advanced Bioconductor can help build a foundation in working with large genomic datasets, which can be useful in postdoctoral research.
Database Manager
A Database Manager designs, implements, and maintains databases to store and manage data. Advanced Bioconductor may help build a foundation in working with large datasets, which can be useful in database management.
Software Engineer
A Software Engineer designs, develops, and maintains software systems. Advanced Bioconductor may help build a foundation in working with large datasets and applying statistical methods, which can be useful in software engineering.
Data Scientist
A Data Scientist collects, analyzes, and interprets data to extract meaningful insights and make data-driven decisions. Advanced Bioconductor may help build a foundation in working with large datasets and applying statistical methods, which can be useful in data science.
Epidemiologist
An Epidemiologist investigates the causes and patterns of health and disease in populations. Advanced Bioconductor may help build a foundation in working with genomic data, which can be useful in epidemiological research.
Biostatistician
A Biostatistician designs studies to collect and analyze data related to the effectiveness of medications, treatments, and other healthcare interventions. They apply statistical methods to analyze data and draw conclusions. Advanced Bioconductor may help build a foundation in statistical methods, particularly as they apply to genomic assays, which can be useful in performing biostatistical analysis on genomic data.
Science Writer
A Science Writer communicates complex scientific information to the public. Advanced Bioconductor may help build a foundation in understanding genomic data, which can be useful in writing about scientific topics.
Medical Scientist
A Medical Scientist conducts research to discover new treatments and cures for diseases. Advanced Bioconductor may help build a foundation in working with genomic data, which can be useful in medical research.
Physician
A Physician diagnoses and treats diseases and injuries. Advanced Bioconductor may help build a foundation in working with genomic data, which can be useful in understanding the genetic basis of diseases.
Pharmacist
A Pharmacist dispenses medications and provides advice on their use. Advanced Bioconductor may help build a foundation in working with genomic data, which can be useful in understanding the mechanisms of action of medications.

Reading list

We've selected 22 books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Advanced Bioconductor.
Provides a comprehensive overview of Bioconductor, an open-source software platform for the analysis and visualization of genomic data. It includes case studies that demonstrate how to use Bioconductor to analyze real-world data sets.
Provides a comprehensive grounding in statistical learning methods, expanding on the fundamental concepts introduced in the course.
Provides a comprehensive overview of the statistical and computational methods used to analyze genomic and proteomic data.
Provides a comprehensive overview of the use of R for data analysis in the life sciences. It covers a wide range of topics, from basic data manipulation to advanced statistical methods.
Provides a statistical perspective on the analysis of biological data, with a particular focus on genomics and bioinformatics.
Provides a comprehensive overview of high-dimensional data analysis methods, with a focus on applications in genomics and bioinformatics.
Provides a comprehensive introduction to R for data science, covering data manipulation, visualization, statistical modeling, and much more.
Covers essential principles and practices of reproducible research, complementing the course's emphasis on reproducible analysis methods.
Offers advanced techniques for data manipulation, visualization, and statistical modeling, enhancing the R programming skills gained in the course.
Provides an introduction to machine learning methods and their applications in bioinformatics.
Provides a comprehensive introduction to R programming, with a focus on best practices and code optimization.
Provides a practical introduction to the analysis of next-generation sequencing data, with a focus on methods for aligning and assembling reads, identifying variants, and performing differential expression analysis.
Offers a practical guide to machine learning techniques in R, complementing the course's focus on statistical inference and modeling.
Provides an introduction to exploratory data analysis using R, with a focus on visualization and interactive analysis.
Provides a comprehensive overview of bioinformatics algorithms. It covers a wide range of topics, from basic concepts to advanced methods.
Provides a comprehensive overview of computational methods in genomics. It covers a wide range of topics, from basic concepts to advanced methods.
Provides a comprehensive overview of bioinformatics. It covers a wide range of topics, from basic concepts to advanced methods.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Advanced Bioconductor.
Introduction to Bioconductor
Most relevant
Case Studies in Functional Genomics
Most relevant
CS50's Introduction to Computer Science
Most relevant
Leaders of Learning
Most relevant
Masterpieces of World Literature
Most relevant
First Nights - Beethoven's 9th Symphony and the 19th...
Most relevant
The Einstein Revolution
Most relevant
Tangible Things: Discovering History Through Artworks,...
Most relevant
Statistics and R
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser