We may earn an affiliate commission when you visit our partners.
Course image
Joseph Santarcangelo

Analyzing data with Python is an essential skill for Data Scientists and Data Analysts. This course will take you from the basics of data analysis with Python to building and evaluating data models.

Topics covered include:

Read more

Analyzing data with Python is an essential skill for Data Scientists and Data Analysts. This course will take you from the basics of data analysis with Python to building and evaluating data models.

Topics covered include:

- collecting and importing data

- cleaning, preparing & formatting data

- data frame manipulation

- summarizing data

- building machine learning regression models

- model refinement

- creating data pipelines

You will learn how to import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. You will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to evaluate them.

In addition to video lectures you will learn and practice using hands-on labs and projects. You will work with several open source Python libraries, including Pandas and Numpy to load, manipulate, analyze, and visualize cool datasets. You will also work with scipy and scikit-learn, to build machine learning models and make predictions.

If you choose to take this course and earn the Coursera course certificate, you will also earn an IBM digital badge.

Enroll now

What's inside

Syllabus

Importing Data Sets
In this module, you will learn how to understand data and learn about how to use the libraries in Python to help you import data from multiple sources. You will then learn how to perform some basic tasks to start exploring and analyzing the imported data set.
Read more
Data Wrangling
In this module, you will learn how to perform some fundamental data wrangling tasks that, together, form the pre-processing phase of data analysis. These tasks include handling missing values in data, formatting data to standardize it and make it consistent, normalizing data, grouping data values into bins, and converting categorical variables into numerical quantitative variables.
Exploratory Data Analysis
In this module, you will learn what is meant by exploratory data analysis, and you will learn how to perform computations on the data to calculate basic descriptive statistical information, such as mean, median, mode, and quartile values, and use that information to better understand the distribution of the data. You will learn about putting your data into groups to help you visualize the data better, you will learn how to use the Pearson correlation method to compare two continuous numerical variables, and you will learn how to use the Chi-square test to find the association between two categorical variables and how to interpret them.
Model Development
In this module, you will learn how to define the explanatory variable and the response variable and understand the differences between the simple linear regression and multiple linear regression models. You will learn how to evaluate a model using visualization and learn about polynomial regression and pipelines. You will also learn how to interpret and use the R-squared and the mean square error measures to perform in-sample evaluations to numerically evaluate our model. And lastly, you will learn about prediction and decision making when determining if our model is correct.
Model Evaluation and Refinement
In this module, you will learn about the importance of model evaluation and discuss different data model refinement techniques. You will learn about model selection and how to identify overfitting and underfitting in a predictive model. You will also learn about using Ridge Regression to regularize and reduce standard errors to prevent overfitting a regression model and how to use the Grid Search method to tune the hyperparameters of an estimator.
Final Assignment
Congratulations! You have now completed all the modules for this course. In this last module, you will complete the final assignment that will be graded by your peers. In this final assignment, you will assume the role of a Data Analyst working at a real estate investment trust organization who wants to start investing in residential real estate. You will be given a dataset containing detailed information about house prices in the region based on a number of property features, and it will be your job to analyze and predict the market price of houses given that information.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Helps learners understand and use Pandas, Numpy, scikit-learn, and scipy to load, analyze, manipulate, and visualize data sets
Provides a strong foundation for learners interested in data analysis, machine learning, and data science
Taught by highly experienced Joseph Santarcangelo
Covers essential topics in data analysis, including data wrangling, exploratory data analysis, model development, and model evaluation
Includes hands-on labs and projects to help learners practice their skills
Offers an IBM digital badge upon completion

Save this course

Save Data Analysis with Python to your list so you can find it easily later:
Save

Reviews summary

Data analysis with python

learners say this course is a well-structured and comprehensive introduction to data analysis using Python. The course begins with the basics of Python and data analysis, and gradually introduces more advanced topics such as data visualization, machine learning, and statistical modeling. The course is taught by experienced instructors who are knowledgeable and passionate about data analysis. The hands-on labs are a great way to practice the concepts learned in the videos, and the quizzes and assignments help to reinforce the material. Overall, this course is a great choice for anyone who wants to learn more about data analysis using Python.
The hands-on exercises were really good and I learnt a lot of things from this wonderful course.
"The hands-on exercises were really good and I learnt a lot of things from this wonderful course."
I got a lot in this course, the teaching way is quite good with animation and real life based example.
"I got a lot in this course, the teaching way is quite good with animation and real life based example."
Assignments and exercises made us quite interactive and I really enjoyed that.
"Assignments and exercises made us quite interactive and I really enjoyed that."
This course covered a lot of great material.
"This course covered a lot of great material."
This course was quite good until Week 3 but after that it was poorly structured.
"This course was quite good until Week 3 but after that it was poorly structured."
There were some technical issues with the labs at times, but there's always the possibility to look up the blurry parts online.
"There were some technical issues with the labs at times, but there's always the possibility to look up the blurry parts online."
This course helped me a lot in solving my basics about data cleaning, Visualization, Techniques for getting better result and most important how we can judge whether a model is good or not.
"This course helped me a lot in solving my basics about data cleaning, Visualization, Techniques for getting better result and most important how we can judge whether a model is good or not."
The final project left out some higher cross-validation methods like Grid search and model comparison.
"The final project left out some higher cross-validation methods like Grid search and model comparison."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Data Analysis with Python with these activities:
Review Python basics
Refreshes your understanding of Python's basic syntax and concepts, making it easier to follow along with the course material.
Browse courses on Python
Show steps
  • Review Python data types, variables, and operators.
  • Practice writing simple Python programs using loops and conditional statements.
Read 'Python for Data Analysis' by Wes McKinney
Provides a comprehensive reference and guide to Python for data analysis, complementing the course material and deepening your understanding of the subject.
Show steps
  • Read through the book, focusing on topics relevant to the course.
  • Take notes and highlight important concepts.
Follow tutorials on data wrangling with Pandas
Provides practical experience in cleaning and preparing data for analysis, which is essential for the data analysis tasks in the course.
Browse courses on Data Wrangling
Show steps
  • Find tutorials on Pandas data wrangling techniques.
  • Follow along with the tutorials, practicing the techniques on sample datasets.
  • Apply the techniques to clean and prepare a dataset of your own.
Five other activities
Expand to see all activities and additional details
Show all eight activities
Participate in peer study groups
Encourages collaboration and sharing of knowledge, which can deepen your understanding of the course material and improve your problem-solving skills.
Show steps
  • Join or form a study group with classmates.
  • Meet regularly to discuss course topics, solve problems together, and quiz each other.
Solve practice problems on model evaluation metrics
Strengthens your understanding of how to evaluate the performance of regression models, which is crucial for building and refining models in the course.
Browse courses on Model Evaluation
Show steps
  • Find practice problems or exercises on model evaluation metrics.
  • Solve the problems, calculating metrics such as R-squared, mean squared error, and MAE.
  • Interpret the results and draw conclusions about model performance.
Follow tutorials on machine learning with scikit-learn
Provides practical experience in using scikit-learn for machine learning tasks, enhancing your understanding of the algorithms and techniques covered in the course.
Browse courses on Machine Learning
Show steps
  • Find tutorials on scikit-learn for regression and classification tasks.
  • Follow along with the tutorials, building and evaluating machine learning models.
  • Apply the techniques to solve real-world machine learning problems.
Build a data pipeline for a real-world dataset
Provides hands-on experience in designing and implementing data pipelines, which is a key skill for data scientists and analysts.
Browse courses on Data Pipelines
Show steps
  • Choose a real-world dataset and define the data processing tasks.
  • Use Python libraries such as Pandas, NumPy, and scikit-learn to build the data pipeline.
  • Validate and test the pipeline to ensure it meets the desired requirements.
  • Deploy the pipeline to automate the data processing tasks.
Develop a data analysis project on a topic of your interest
Provides an opportunity to apply the concepts and techniques learned in the course to a real-world problem, fostering your problem-solving and critical thinking skills.
Show steps
  • Identify a data analysis problem or topic that interests you.
  • Gather and clean the necessary data.
  • Analyze the data using techniques learned in the course.
  • Draw conclusions and present your findings.

Career center

Learners who complete Data Analysis with Python will develop knowledge and skills that may be useful to these careers:
Market Research Analyst
As a Market Research Analyst, you will use data to understand consumer behavior and trends. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Market Research Analyst.
Machine Learning Engineer
As a Machine Learning Engineer, you will design and develop machine learning models. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Machine Learning Engineer.
Business Analyst
As a Business Analyst, you will use data to make recommendations that improve business outcomes. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Business Analyst.
Operations Research Analyst
As an Operations Research Analyst, you will use data to improve the efficiency and effectiveness of operations. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as an Operations Research Analyst.
Statistician
As a Statistician, you will use data to make inferences about the world. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Statistician.
Data Engineer
As a Data Engineer, you will build and maintain the infrastructure that stores and processes data. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Data Engineer.
Data Visualization Specialist
As a Data Visualization Specialist, you will use data to create visual representations that communicate insights. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Data Visualization Specialist.
Data Scientist
As a Data Scientist, you will use scientific methods to extract knowledge and insights from data. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Data Scientist.
Data Analyst
As a Data Analyst, you will use data to solve problems and make informed decisions. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Data Analyst.
Database Administrator
As a Database Administrator, you will be responsible for the maintenance and performance of databases. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Database Administrator.
Financial Analyst
As a Financial Analyst, you will use data to make investment decisions. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Financial Analyst.
Risk Analyst
As a Risk Analyst, you will use data to identify and assess risks. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Risk Analyst.
Software Engineer
As a Software Engineer, you will be responsible for the design, development, and maintenance of software systems. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Software Engineer.
Quantitative Analyst
As a Quantitative Analyst, you will use data to make investment decisions. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as a Quantitative Analyst.
Actuary
As an Actuary, you will use data to assess risk and make financial decisions. This course will help you build a foundation in data analysis and machine learning, which are essential skills for this role. You will learn how to import, clean, and analyze data, and how to build and evaluate machine learning models. This course will also help you develop the critical thinking and problem-solving skills that are necessary for success as an Actuary.

Reading list

We've selected ten books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Data Analysis with Python.
Provides a comprehensive overview of deep learning, covering topics such as convolutional neural networks, recurrent neural networks, and generative adversarial networks. It is written in a clear and concise style, with plenty of examples and exercises to help readers learn the material.
Provides a comprehensive overview of deep learning for natural language processing. It covers a wide range of topics, from word embeddings and sequence models to attention mechanisms and transformers.
Provides a comprehensive guide to data analysis with Python, covering topics such as data cleaning, wrangling, visualization, and modeling. It is written in a clear and concise style, with plenty of examples and exercises to help readers learn the material.
Provides a comprehensive overview of statistical learning methods, with a focus on sparsity. It covers a wide range of topics, from linear regression and logistic regression to decision trees and random forests.
Provides a comprehensive guide to data analysis with Python, covering topics such as data cleaning, wrangling, visualization, and modeling. It is written in a clear and concise style, with plenty of examples and exercises to help readers learn the material.
Provides a comprehensive overview of statistical methods for machine learning. It covers a wide range of topics, from linear regression and logistic regression to decision trees and random forests.
Provides a practical introduction to machine learning with Python, using the popular scikit-learn, Keras, and TensorFlow libraries. It covers a wide range of topics, from data preprocessing and model selection to model evaluation and deployment.
Provides a practical introduction to data science for business professionals. It covers a wide range of topics, from data collection and cleaning to data analysis and visualization.
Provides a gentle introduction to machine learning, covering topics such as linear regression, logistic regression, decision trees, and neural networks. It is written in a non-technical style, with plenty of examples and exercises to help readers learn the material.
Provides a gentle introduction to data science, covering topics such as data cleaning, wrangling, visualization, and modeling. It is written in a non-technical style, with plenty of examples and exercises to help readers learn the material.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Data Analysis with Python.
Master Regression and Feedforward Networks [2024]
Most relevant
Analyzing Data with Python
Most relevant
Automate Machine Learning Using Databricks AutoML
Most relevant
Complete Linear Regression Analysis in Python
Most relevant
Data Science and Machine Learning in Python: Linear models
Most relevant
Building Machine Learning Models in Python with scikit...
Most relevant
Linear Regression and Logistic Regression in Python
Most relevant
Linear Regression and Logistic Regression using R Studio
Most relevant
Machine Learning with Apache Spark
Most relevant
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser