Data science is quickly becoming one of the most important skills in industry, academia, marketing, and science. Most data-science courses teach analysis methods, but there are many methods; which method do you use for which data? The answer to that question comes from understanding data. That is the focus of this course.
What you will learn in this course:
Data science is quickly becoming one of the most important skills in industry, academia, marketing, and science. Most data-science courses teach analysis methods, but there are many methods; which method do you use for which data? The answer to that question comes from understanding data. That is the focus of this course.
What you will learn in this course:
You will learn how to generate data from the most commonly used data categories for statistics, machine learning, classification, and clustering, using models, equations, and parameters. This includes distributions, time series, images, clusters, and more. You will also learn how to visualize data in 1D, 2D, and 3D.
All videos come with MATLAB and Python code for you to learn from and adapt.
This course is for you if you are an aspiring or established:
Data scientist
Statistician
Computer scientist (MATLAB and/or Python)
Signal processor or image processor
Biologist
Engineer
Student
Curious independent learner.
What you get in this course:
>6 hours of video lectures that include explanations, pictures, and diagrams
pdf readers with important notes and explanations
Exercises and their solutions
MATLAB code and Python code
With >4000 lines of MATLAB and Python code, this course is also a great way to improve your programming skills, particularly in the context of data analysis, statistics, and machine learning.
What do you need to know before taking this course?
You need some experience with either Python or MATLAB programming. You don't need to be an expert coder, but if you are comfortable working with variables, for-loops, and basic plotting, then you already know enough to take this course.
Learn about the two most important distributions used in data science. Then see it in action in Python and MATLAB!
QQ sounds funny, right? But it is a powerful data visualization and inspection method.
Many physical and biological data distributions are characterized by Poisson. Learn how to simulate them in Python and MATLAB.
Log-normal data distributions come from combining other distributions. Hint: They're never negative!
Data quality is super-important in data science. Here you will learn the math, Python, and MATLAB methods for measuring data distribution quality.
You're probably thinking that I'm promoting my own method. But it's a different Cohen's D. Still a good metric, though!
OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.
Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.
Find this site helpful? Tell a friend about us.
We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.
Your purchases help us maintain our catalog and keep our servers humming without ads.
Thank you for supporting OpenCourser.