Save for later
AI Workflow
Data Analysis and Hypothesis Testing
This is the second course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones.
In this course you will begin your work for a hypothetical streaming media company by doing exploratory data analysis (EDA). Best practices for data visualization, handling missing data, and hypothesis testing will be introduced to you as part of your work. You will learn techniques of estimation with probability distributions and extending these estimates to apply null hypothesis significance tests. You will apply what you learn through two hands on case studies: data visualization and multiple testing using a simple pipeline.
By the end of this course you should be able to:
1. List several best practices concerning EDA and data visualization
2. Create a simple dashboard in Watson Studio
3. Describe strategies for dealing with missing data
4. Explain the difference between imputation and multiple imputation
5. Employ common distributions to answer questions about event probabilities
6. Explain the investigative role of hypothesis testing in EDA
7. Apply several methods for dealing with multiple testing
Who should take this course?
This course targets existing data science practitioners that have expertise building machine learning models, who want to deepen their skills on building and deploying AI in large enterprises. If you are an aspiring Data Scientist, this course is NOT for you as you need real world expertise to benefit from the content of these courses.
What skills should you have?
It is assumed that you have completed Course 1 of the IBM AI Enterprise Workflow specialization and have a solid understanding of the following topics prior to starting this course: Fundamental understanding of Linear Algebra; Understand sampling, probability theory, and probability distributions; Knowledge of descriptive and inferential statistical concepts; General understanding of machine learning techniques and best practices; Practiced understanding of Python and the packages commonly used in data science: NumPy, Pandas, matplotlib, scikit-learn; Familiarity with IBM Watson Studio; Familiarity with the design thinking process.
Get a Reminder
Rating | Not enough ratings |
---|---|
Length | 3 weeks |
Effort | This course requires 7.5 to 9 hours of study. |
Starts | Jun 26 (40 weeks ago) |
Cost | $99 |
From | IBM via Coursera |
Instructors | Mark J Grover, Ray Lopez, Ph.D. |
Download Videos | On all desktop and mobile devices |
Language | English |
Subjects | Data Science Programming |
Tags | Data Science Data Analysis Machine Learning |
Get a Reminder
Similar Courses
Careers
An overview of related careers and their average salaries in the US. Bars indicate income percentile.
Benefits and Entitlements Specialist, BEST $30k
BEST crisis clinician $67k
USAID-BEST Editor $68k
Senior Business Practices Analyst $71k
Best boy grip $82k
Best Boy Electric $83k
Technology Analyst in Search Practices $85k
Strategic Business Analyst - Best Practices $86k
Best Practice Coordinator $89k
Global Business Practices Analyst $92k
Employment Practices Analyst Lead $97k
NX Best Practices Engineer $99k
Write a review
Your opinion matters. Tell us what you think.
Please login to leave a review
Rating | Not enough ratings |
---|---|
Length | 3 weeks |
Effort | This course requires 7.5 to 9 hours of study. |
Starts | Jun 26 (40 weeks ago) |
Cost | $99 |
From | IBM via Coursera |
Instructors | Mark J Grover, Ray Lopez, Ph.D. |
Download Videos | On all desktop and mobile devices |
Language | English |
Subjects | Data Science Programming |
Tags | Data Science Data Analysis Machine Learning |
Similar Courses
Sorted by relevance
Like this course?
Here's what to do next:
- Save this course for later
- Get more details from the course provider
- Enroll in this course