We may earn an affiliate commission when you visit our partners.
Course image
Nicholas James Provart

The past decade has seen a vast increase in the amount of data available to biologists, driven by the dramatic decrease in cost and concomitant rise in throughput of various next-generation sequencing technologies, such that a project unimaginable 10 years ago was recently proposed, the Earth BioGenomes Project, which aims to sequence the genomes of all eukaryotic species on the planet within the next 10 years. So while data are no longer limiting, accessing and interpreting those data has become a bottleneck. One important aspect of interpreting data is data visualization. This course introduces theoretical topics in data visualization through mini-lectures, and applied aspects in the form of hands-on labs. The labs use both web-based tools and R, so students at all computer skill levels can benefit. Syllabus may be viewed at https://tinyurl.com/DataViz4GenomeBio.

Enroll now

What's inside

Syllabus

Week 1
In this module we'll cover 3 straightforward approaches for generating simple plots. As we'll see in the lab, often visualizing datasets can help us see the overall shape of the data that might not be captured in descriptive statistics like mean and standard deviation. Plotting datasets is also a useful way to identify outliers. In the mini-lectures we go over some common biological data visualization paradigms and more generally what the common chart types are, and we also talk about the context and grammar of data visualization.
Read more
Week 2
In this week's module we explore ways of displaying biological variation and a little bit of background about track viewers. We also cover visual perception, Gestalt principles, and issues related to colour perception, important for accessibility-related reasons. In the lab we'll use an online app, PlotsOfDifferences, to generate some charts that display variation nicely, and we'll also use R to generate some box plots, histograms, and violin plots. Last but not least, we'll try adjusting some of the settings in JBrowse to help assess gene expression levels in a more intuitive manner. Thanks to Dr. Joachim Goedhart, University of Amsterdam, Netherlands for permission to use PlotsOfDifferences in the lab.
Week 3
In this week's module we explore ways of visualizing gene expression data after briefly covering how we can measure gene expression levels with RNA-seq and identify significantly differentially expressed genes using statistical tests. We also cover design thinking. In the lab we'll use an online platform, Galaxy, to generate a volcano plot for visualizing significantly differentially expressed genes, and we'll also use R to generate some heatmaps of gene expression. Last but not least, we'll create our own "electronic fluorescent pictographs" for a gene expression data set.
Week 4
In this week's module we cover how the Gene Ontology can be used to make sense of often overwhelmingly long lists of genes from transcriptomic and other kind of 'omic experiments, especially through Gene Ontology enrichment analyses. We'll also look at Agile Development and User Testing and how these can help improve data visualization tools. In the lab, we'll try our hand at 3 online Gene Ontology analysis apps, and create some nice overview charts for GO enrichment results in R. Thanks to Dr. Roy Navon, Technion University, Israel, for permission to use GOrilla in the lab. Thanks to Dr. Juri Reimand of the University of Toronto for permission to use g:Profiler. And thanks to Dr. Zhen Su of the China Agricultural University for permission to use AgriGO.
Week 5
In this week's module, we explore tools for displaying and analyzing graph networks, notably those created when we generate protein-protein interactions, especially in a high-throughput manner. These PPIs are deposited in online databases like BioGRID, and can be retrieved on-the-fly via web services for display in powerful network visualization apps like Cytoscape. We'll talk about other web services/APIs that are available for biology in one of the mini-lectures, and in the lab we'll use Cytoscape to explore interactors of BRCA2. We'll also use a plug-in called BiNGO to do Gene Ontology enrichment analyses of its interactors, continuing our exploration of GO that we started last week. Last, we'll try using D3 to display an interaction network in a web page.
Week 6
In this module we cover methods for generating and making sense of ever bigger biological data sets. The growth in sequencing capacity has enabled projects that we unimaginable even a few years ago, such as the Earth Biogenomes Project, which aims to sequence the genome of a representative of every eukaryotic species on the planet. In order to make sense of these large data sets, it is often useful to use dimentionality reduction methods, like t-SNE, PCA, and UMAP, to help visualize how similar samples are. Logic diagrams (Venn-Euler or Upset plots) are also useful for displaying how sets of genes are similar one to another. Thanks to Dr. Tim Hulsen (Philips Research, the Netherlands) for permission to use the DeepVenn app in the lab.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Assumes a base level of comfort with command-line software and programming
Offers a great overview of data visualization
Places particular emphasis on the analysis of biological datasets and omics data
Uses Bioconductor R packages

Save this course

Save Data Visualization for Genome Biology to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Data Visualization for Genome Biology with these activities:
Data Visualization Practice Drills
Reinforce your understanding of data visualization techniques through hands-on practice exercises.
Show steps
  • Access online visualization tools like PlotsOfDifferences or Gapminder and explore different data sets.
  • Generate and interpret basic plots (e.g., histograms, scatterplots, line graphs) to identify patterns and trends.
  • Experiment with color schemes, chart types, and annotations to enhance the clarity of your visualizations.
Create different types of data visualizations
Creating different types of data visualizations will provide hands-on practice, improving proficiency in selecting the most appropriate visual encodings and chart types for different types of data.
Browse courses on Chart Creation
Show steps
  • Choose a dataset and create a variety of visualizations, including bar charts, line charts, and scatter plots.
  • Experiment with different data visualization tools, such as Tableau, Power BI, or Google Data Studio.
  • Share your visualizations with others and get feedback.
Advanced Data Visualization Tutorials
Enhance your understanding and skills in advanced data visualization techniques by following guided tutorials.
Show steps
  • Identify online tutorials or courses on specific data visualization tools or techniques (e.g., R's ggplot2, Cytoscape).
  • Work through the tutorials step-by-step, experimenting with the featured techniques and applying them to your own data.
Two other activities
Expand to see all activities and additional details
Show all five activities
Interactive Data Visualization Project
Consolidate your learning by creating an interactive data visualization project that showcases your skills.
Show steps
  • Identify a biological data set of interest and explore it thoroughly.
  • Design and develop an interactive data visualization using a tool like D3 or Plotly.
  • Present your visualization, explaining the insights you gained and the techniques you used.
Develop a data visualization dashboard
Developing a data visualization dashboard will provide experience in designing and implementing interactive visualizations, allowing students to apply their skills in a practical context.
Show steps
  • Identify a dataset and determine the key insights that you want to communicate.
  • Design the layout and structure of your dashboard.
  • Select and implement appropriate visualizations.
  • Make your dashboard interactive by adding filters, tooltips, and other features.
  • Deploy your dashboard and share it with others.

Career center

Learners who complete Data Visualization for Genome Biology will develop knowledge and skills that may be useful to these careers:
Bioinformatician
Bioinformaticians develop and use computational tools to analyze biological data, such as gene sequences and protein structures. This course, Data Visualization for Genome Biology, can be particularly helpful for bioinformaticians because it teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help bioinformaticians to identify patterns and trends in data, and to make inferences about biological processes. Additionally, this course can help bioinformaticians to communicate their findings to other scientists and to the public.
Computational Biologist
Computational biologists use computer science and mathematics to study biological systems. This course, Data Visualization for Genome Biology, can be particularly helpful for computational biologists because it teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help computational biologists to identify patterns and trends in data, and to make inferences about biological processes. Additionally, this course can help computational biologists to communicate their findings to other scientists and to the public.
Data Scientist
Data scientists use data to solve problems and make predictions. This course, Data Visualization for Genome Biology, can be particularly helpful for data scientists who work in the field of biology or medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help data scientists to identify patterns and trends in data, and to make inferences about biological processes. Additionally, this course can help data scientists to communicate their findings to other scientists and to the public.
Biostatistician
Biostatisticians use statistics to analyze biological data. This course, Data Visualization for Genome Biology, can be particularly helpful for biostatisticians because it teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help biostatisticians to identify patterns and trends in data, and to make inferences about biological processes. Additionally, this course can help biostatisticians to communicate their findings to other scientists and to the public.
Geneticist
Geneticists study genes and their role in health and disease. This course, Data Visualization for Genome Biology, can be particularly helpful for geneticists because it teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help geneticists to identify patterns and trends in data, and to make inferences about gene function. Additionally, this course can help geneticists to communicate their findings to other scientists and to the public.
Medical Doctor
Medical doctors diagnose and treat diseases. This course, Data Visualization for Genome Biology, can be particularly helpful for medical doctors who are interested in genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help medical doctors to make more informed decisions about patient care.
Epidemiologist
Epidemiologists study the causes and spread of diseases. This course, Data Visualization for Genome Biology, can be particularly helpful for epidemiologists who are interested in genomic epidemiology. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help epidemiologists to identify patterns and trends in data, and to make inferences about the causes and spread of diseases.
Health Scientist
Health scientists conduct research to improve human health. This course, Data Visualization for Genome Biology, can be particularly helpful for health scientists who are interested in genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help health scientists to make more informed decisions about research.
Science Teacher
Science teachers teach science to students. This course, Data Visualization for Genome Biology, can be particularly helpful for science teachers who teach biology or genetics. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help science teachers to make their lessons more engaging and effective.
Science Communicator
Science communicators translate scientific information into a form that is accessible to the public. This course, Data Visualization for Genome Biology, can be particularly helpful for science communicators who want to communicate about genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help science communicators to create more informative and engaging content.
Science Writer
Science writers write about science for the public. This course, Data Visualization for Genome Biology, can be particularly helpful for science writers who want to write about genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help science writers to create more informative and engaging articles.
Policy Advisor
Policy advisors advise policymakers on the development and implementation of policies. This course, Data Visualization for Genome Biology, can be particularly helpful for policy advisors who work on health or science policy. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help policy advisors to make more informed recommendations to policymakers.
Science Editor
Science editors edit scientific writing. This course, Data Visualization for Genome Biology, can be particularly helpful for science editors who work on scientific journals or books. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help science editors to make more informed decisions about what to publish.
Intellectual Property Attorney
Intellectual property attorneys help their clients protect their intellectual property rights. This course, Data Visualization for Genome Biology, can be particularly helpful for intellectual property attorneys who work on patents related to genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help intellectual property attorneys to more effectively represent their clients.
Patent Examiner
Patent examiners examine patent applications to determine if they meet the criteria for a patent. This course, Data Visualization for Genome Biology, can be particularly helpful for patent examiners who work on patents related to genomics or personalized medicine. The course teaches how to visualize biological data in a way that makes it easier to understand and interpret. This can help patent examiners to make more informed decisions about whether to grant a patent.

Reading list

We've selected 12 books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Data Visualization for Genome Biology.
Classic work on data visualization. It provides a wealth of information on how to create effective visualizations, and must-read for anyone who wants to learn more about data visualization.
Classic work on data visualization. It provides a detailed overview of the different graphical methods that can be used to analyze data. Useful as either a textbook or a reference book, this book should be on every data scientist's bookshelf.
Provides a comprehensive overview of data visualization techniques. It covers a wide range of topics, from the basics of data visualization to advanced techniques for creating interactive visualizations.
Practical guide to creating beautiful visualizations. It covers a wide range of topics, from the basics of design to advanced techniques for creating interactive visualizations. Useful for any level of designer, and an excellent reference guide.
Collection of essays on the topic of visual complexity. It explores the different ways in which we can visualize complex data, and provides a wealth of inspiration for anyone who wants to create more effective visualizations.
Provides a comprehensive overview of the field of genomics. It covers a wide range of topics, from the basics of DNA sequencing to the latest advances in genome editing. This book will be of interest to those who are involved in research on genomes or who wish to understand the impact of genomics on science and medicine.
Provides a hands-on approach to using the popular Python and JavaScript libraries for data visualization. In particular, this book covers using the popular Bokeh library to create interactive visualizations.
Provides a comprehensive overview of the field of bioinformatics. Suitable for beginners, this book covers a wide range of topics, from the basics of DNA sequencing to the latest advances in genome editing. Will be of interest to those who are involved in the field of bioinformatics or those who wish to understand how bioinformatics is used to analyze and interpret biological data.
Practical guide to choosing the right chart for your data. It covers a wide range of chart types, and provides guidance on how to use them effectively. Useful for anyone who wants to communicate data in a clear and concise way.
Provides a comprehensive overview of the field of molecular biology. It covers a wide range of topics, from the basics of DNA structure to the latest advances in gene editing. Essential reading for anyone who is interested in understanding the fundamental principles of molecular biology.
Provides a comprehensive overview of the field of biostatistics. It covers a wide range of topics, from the basics of probability to the latest advances in statistical modeling. Aimed at a general audience with little or no prior knowledge of statistics, this book will be of interest to those who are involved in research on biological or health sciences or who wish to understand the role of statistics in these fields.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Data Visualization for Genome Biology.
Tableau Expert: Top Visualization Techniques in Tableau 10
Data Visualization with Python
Data Visualization
Building Effective Data Communications with Tableau...
Data Representation and Visualization in Tableau
Data Visualization Capstone
Learn ggplot2 in R for Data Visualization
Statistical Data Visualization with Seaborn From UST
Calculate Basic Descriptive Statistics in Google Sheets
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser