We may earn an affiliate commission when you visit our partners.
Course image
Carrie Wright, PhD

One of the key cancer informatics challenges is dealing with and managing the explosion of large data from multiple sources that are often too large to work with on typical personal computers. This course is designed to help researchers and investigators to understand the basics of computing and to familiarize them with various computing options to ultimately help guide their decisions on the topic. This course aims to provide research leaders with awareness and guidance about:

Read more

One of the key cancer informatics challenges is dealing with and managing the explosion of large data from multiple sources that are often too large to work with on typical personal computers. This course is designed to help researchers and investigators to understand the basics of computing and to familiarize them with various computing options to ultimately help guide their decisions on the topic. This course aims to provide research leaders with awareness and guidance about:

Basic computing terminology

Concepts about how computers and computing systems work

Differences between shared computing resources

Appropriate etiquette for shared computing resources

Computing resources designed for cancer research

Considerations for computing resource decisions

Target audience:

This course is intended for researchers (including postdocs and students) with limited to intermediate experience with informatics research. The conceptual material will also be useful for those in management roles who are collecting data and using informatics pipelines.

Curriculum:

We will provide you with familiarity with fundamental computing terms. We will also discuss relevant concepts about how computers and shared computing resources work. We will explore the differences between various computing resource options, as well as provide guidance on how to make important computing discussions.

This course is part of a series of courses for the Informatics Technology for Cancer Research (ITCR) called the Informatics Technology for Cancer Research Education Resource. This material was created by the ITCR Training Network (ITN) which is a collaborative effort of researchers around the United States to support cancer informatics and data science training through resources, technology, and events. This initiative is funded by the following grant: National Cancer Institute (NCI) UE5 CA254170. Our courses feature tools developed by ITCR Investigators and make it easier for principal investigators, scientists, and analysts to integrate cancer informatics into their workflows. Please see our website at www.itcrtraining.org for more information.

Enroll now

What's inside

Syllabus

Welcome
In this module we will introduce you to how the material will be presented and the goals for the course.
Basic Building Block of Computers
Read more
In this module we will start by describing some basics about how computers work. We feel that familiarity with this information will be helpful for you when you need to make computing decisions for your work.
Binary data to computations
In this module we will talk about how computers store and process data. This will be helpful for understanding computing and storage requirements for your work.
Computing Resources
In this module we will describe some basics about file sizes and computing capacity. We will specifically focus on common types of files used in cancer research. We will also introduce some general concepts for shared computing resource, which can be a great option if you wish to do work that might be too intensive for your personal computer.
Shared Computing Etiquette
In this module we will describe some common good practices for using traditional shared computing resources like clusters. These guidelines will help ensure that you don't use shared resources in a way that might bother others, so that you can continue to have access to such shared resources.
Research Platforms
In this module we will take you through a tour of some computing resource platforms designed for researchers, including some that may be especially useful to cancer researchers.
Data Management Decisions
In this final module we will provide guidance about how to decide what computing resources would be most beneficial for your work.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Introduces computing fundamentals, which is standard in cancer research
Taught by Carrie Wright, PhD, who are recognized for their work in cancer informatics
Examines computing resources designed specifically for cancer research
Develops awareness and guidance about computing resources
Appropriate for researchers with limited to intermediate experience in informatics research
Advises research leaders on computing resource decisions

Save this course

Save Computing for Cancer Informatics to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Computing for Cancer Informatics with these activities:
Review binary data conversion
Build a stronger foundation for working with binary data.
Browse courses on Binary Data
Show steps
  • Review binary data representations.
  • Practice converting binary data to decimal and vice versa.
Host a peer study session on binary data concepts
Enhance your understanding by presenting and discussing binary data concepts.
Browse courses on Binary Data
Show steps
  • Review binary data concepts.
  • Prepare to present on the topic.
  • Host the peer study session.
Complete practice exercises on shared computing resources
Reinforce your understanding of using shared computing resources.
Show steps
  • Find practice exercises or tutorials on shared computing resources.
  • Work through the exercises or tutorials.
Eight other activities
Expand to see all activities and additional details
Show all 11 activities
Practice working with binary data
Reinforce understanding of how computers store and process data through practical exercises.
Browse courses on Binary Data
Show steps
  • Solve problems involving binary data conversion
  • Use online tools or resources to practice working with binary data
  • Build a simple program that manipulates binary data
Explore shared computing resources
Enhance familiarity with different types of shared computing resources and their applications.
Show steps
  • Follow online tutorials on using shared computing resources
  • Attend a workshop or webinar on shared computing
  • Experiment with using shared computing resources for small tasks
Create a basic workflow using a research platform
Practice working with research platforms to create a real-world workflow.
Show steps
  • Identify a research platform and follow a tutorial or documentation about using it.
  • Create a basic workflow using the research platform.
Attend a cancer research conference
Connect with experts and researchers in the field to learn about cutting-edge advancements and resource.
Show steps
  • Research and identify relevant conferences
  • Attend talks and presentations on topics related to computing in cancer research
  • Network with attendees to exchange knowledge and ideas
Develop a data management plan
Apply course knowledge to create a plan for managing and organizing research data effectively.
Browse courses on Data Management
Show steps
  • Identify data sources and types
  • Determine storage requirements and strategies
  • Establish data access and sharing protocols
  • Document the data management plan
Build familiarity with the Linux command line
Extend your knowledge by becoming more comfortable with Linux command line.
Browse courses on Linux Command Line
Show steps
  • Find and follow a tutorial on Linux command line basics.
  • Practice using the commands covered in the tutorial.
Develop a data management plan for your research
Apply your knowledge to create a plan for managing your research data.
Show steps
  • Research best practices for data management in your field.
  • Create a data management plan that outlines how you will collect, store, and share your research data.
Develop a computational pipeline for cancer research
Apply course concepts by building a complete computational pipeline for a specific cancer research project.
Show steps
  • Define the research question and data requirements
  • Design and implement the computational pipeline
  • Test and validate the pipeline
  • Document and share the pipeline

Career center

Learners who complete Computing for Cancer Informatics will develop knowledge and skills that may be useful to these careers:
Bioinformatician
Bioinformaticians play a vital role in analyzing and interpreting large and complex biological datasets, enabling researchers to make meaningful discoveries. The Computing for Cancer Informatics course can provide a solid foundation in computing and data management, helping Bioinformaticians understand how computers process and store biological data. This course also covers shared computing resources and research platforms, which are essential for handling the immense datasets encountered in bioinformatics research.
Computational Biologist
Computational Biologists utilize computational techniques to study and model complex biological systems. They work on projects ranging from understanding disease mechanisms to designing new drugs. The Computing for Cancer Informatics course can provide a valuable introduction to computing principles, binary data processing, and shared computing resources, which are fundamental concepts for Computational Biologists. Understanding these concepts can enable them to effectively use computational tools and resources for their research.
Machine Learning Engineer
Machine Learning Engineers design, develop, and deploy machine learning models. The Computing for Cancer Informatics course provides a solid foundation in computing principles and data management, which are essential for Machine Learning Engineers. This course covers concepts such as data storage, processing, and shared computing resources, which are crucial for handling large datasets and training machine learning models.
Research Scientist
Research Scientists conduct original research in various scientific fields, including cancer informatics. The Computing for Cancer Informatics course can equip Research Scientists with the necessary computing knowledge to manage and analyze large datasets, which is becoming increasingly important in scientific research. This course covers concepts like data storage, computing capacity, and research platforms, providing a valuable foundation for researchers working with complex datasets.
Data Engineer
Data Engineers design, build, and maintain data pipelines and infrastructure. The Computing for Cancer Informatics course provides a foundational understanding of data storage, processing, and shared computing resources. This knowledge can help Data Engineers develop efficient and scalable data pipelines for managing and analyzing large datasets, which is essential for modern data-driven applications.
Data Analyst
Data Analysts gather, clean, analyze, and interpret data to identify patterns and trends. The Computing for Cancer Informatics course can provide a valuable foundation in data storage and processing, two essential aspects of data analysis. This course also covers data management decisions, which can help Data Analysts make informed choices about the best computing resources for their specific data analysis tasks.
Computer Scientist
Computer Scientists research and develop new computing technologies and applications. The Computing for Cancer Informatics course offers a solid foundation in computing fundamentals, including binary data processing, and shared computing resources. This knowledge can help Computer Scientists design and implement innovative solutions for managing and analyzing large datasets, which is becoming increasingly important in various fields including cancer research.
Data Scientist
Data Scientists translate vast amounts of data into actionable insights for businesses and organizations. They analyze, interpret, and visualize data to identify trends, predict outcomes, and guide decision-making. The Computing for Cancer Informatics course provides a foundational understanding of computing principles and concepts, storage requirements, and managing and processing large datasets. This knowledge can equip you with the essential skills to understand data structures, work with distributed systems, and utilize advanced computing resources, all of which are crucial skills for Data Scientists.
Database Administrator
Database Administrators manage and maintain databases, ensuring data integrity and accessibility. The Computing for Cancer Informatics course can provide Database Administrators with a deeper understanding of data storage, processing, and shared computing resources. This knowledge can help them optimize database performance, manage large datasets, and ensure data security, which are crucial responsibilities in the field.
Systems Analyst
Systems Analysts design, develop, and implement computer systems for various organizations. The Computing for Cancer Informatics course can provide Systems Analysts with a solid understanding of computing fundamentals, including data processing, storage, and shared computing resources. This knowledge is crucial for designing and managing efficient and effective computer systems, especially for organizations dealing with large datasets.
Software Engineer
Software Engineers design, develop, and maintain software applications. The Computing for Cancer Informatics course provides a foundational understanding of computing principles, data structures, and shared computing resources. This knowledge is essential for Software Engineers working on complex software systems, particularly those involving data analysis and management.
Artificial Intelligence Researcher
Artificial Intelligence Researchers develop and apply artificial intelligence techniques to solve complex problems. The Computing for Cancer Informatics course may be useful for Artificial Intelligence Researchers working on projects involving large datasets. This course provides insights into computing concepts, data storage, and shared computing resources, which can help them understand the computational requirements and resources needed for their research.
Information Technology Specialist
Information Technology Specialists support and maintain computer systems and networks. The Computing for Cancer Informatics course may be useful for Information Technology Specialists who work in research institutions or organizations dealing with large datasets. This course provides insights into computing concepts, data storage, and shared computing resources, which can help them better understand and support the computational needs of researchers.
Cloud Architect
Cloud Architects design and manage cloud computing systems. The Computing for Cancer Informatics course may be useful for Cloud Architects working on projects related to data management and analysis. This course covers concepts such as data storage, computing capacity, and research platforms, which are relevant to designing and managing cloud systems for handling large datasets.
Quantitative Analyst
Quantitative Analysts use mathematical and statistical models to analyze financial data. The Computing for Cancer Informatics course may be useful for Quantitative Analysts who work with large datasets or need to develop computational models. This course provides a foundational understanding of data storage, processing, and shared computing resources, which are essential for managing and analyzing large financial datasets.

Reading list

We've selected ten books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Computing for Cancer Informatics.
Offers a comprehensive introduction to machine learning, covering topics such as supervised learning, unsupervised learning, and reinforcement learning. It can be a valuable resource for those who want to gain a deeper understanding of the machine learning algorithms used in cancer informatics.
Provides a comprehensive introduction to statistical learning, covering topics such as regression, classification, and clustering. It can be a valuable resource for those who want to gain a deeper understanding of the statistical methods used in cancer informatics.
Serves as a comprehensive introduction to computational science, covering topics such as numerical methods, data analysis, and high-performance computing. It provides a strong foundation for understanding the computational aspects of cancer informatics.
This classic book provides a comprehensive guide to the Unix operating system, covering topics such as file systems, shells, and programming tools. It can be a valuable resource for those who want to gain a deeper understanding of the underlying operating system used in many computing resources.
Provides a comprehensive introduction to data visualization, covering topics such as visual perception, design principles, and evaluation methods. It can be a valuable resource for those who want to gain a deeper understanding of how to effectively visualize data in the context of cancer informatics.
Offers a practical introduction to using R for data science, covering topics such as data manipulation, visualization, and statistical modeling. It can serve as a valuable resource for those who want to explore R programming in the context of cancer informatics.
For those interested in exploring the use of Apache Spark for data analysis in cancer informatics, this book offers a comprehensive guide to using Spark with R.
Provides a comprehensive guide to using Python for data analysis, including data manipulation, visualization, and statistical modeling. It can complement the course's coverage of basic computing concepts by offering a deeper dive into Python programming.
Covers the fundamentals of data skills in bioinformatics. Suitable for beginners to the field of bioinformatics, this book can act as a supplement to the course, as it offers practical exercises and guidance on using open-source tools.
Focuses on using Python for bioinformatics, providing practical guidance and examples. It can supplement the course's introduction to basic computing concepts by offering a hands-on approach to programming in Python.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Computing for Cancer Informatics.
Leadership for Cancer Informatics Research
Most relevant
Documentation and Usability for Cancer Informatics
Most relevant
Advanced Reproducibility in Cancer Informatics
Most relevant
Introduction to Reproducibility in Cancer Informatics
Most relevant
Introduction to Computational Materials Design
The Quantum Internet and Quantum Computers: How Will They...
High Performance Computing in the Cloud
CERTaIN: Patient-Centered Outcomes Research
Introduction to Health Informatics
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser