We may earn an affiliate commission when you visit our partners.
Pluralsight logo

Pandas Playbook

Manipulating Data

Reindert-Jan Ekker

Pandas is one of the most popular software packages for data analysis. This course focuses on the core functionalities of Pandas for data wrangling, teaching you how to tackle everyday tasks for a data analyst, or data scientist.

Read more

Pandas is one of the most popular software packages for data analysis. This course focuses on the core functionalities of Pandas for data wrangling, teaching you how to tackle everyday tasks for a data analyst, or data scientist.

Pandas is not just one of the most popular software packages for data analysis, it is also, without a doubt, the most convenient and fun way to work with your data. In this course, Pandas Playbook: Manipulating Data, you will cover the most important core functionalities of Pandas, focusing on the core functionalities of the two main Pandas classes: the DataFrame and the Series. First, you will take a look at a new dataset and try to get a feeling for it - how many rows and columns are there? What datatypes does it consist of? You will do some basic statistical exploration as well. Then, you'll focus on getting information out of your dataset. Basically, it's about asking the right questions and drilling down into your dataset. Finally, you will learn how to clean and transform your data. Here, you will see how to run Python functions against our data, including functions we write ourselves by using a very cool and powerful feature called groupby() - changing the structure of our columns and rows, and combining multiple dataframes into one. After watching this course, you will be ready for just about any data wrangling job that you might come across.

Enroll now

What's inside

Syllabus

Course Overview
Course Introduction
Exploring Data
Selecting, Filtering, and Sorting Data
Read more
Cleaning Data
Transforming Data

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Taught by Reindert-Jan Ekker, who has researched and developed foundational aspects of data engineering and scalable data science technologies
Introduces students to one of the most popular and convenient packages for data analysis
Covers essential aspects of data analysis and manipulation, including data exploration, cleaning, and transformation making it relevant to a wide range of academic, industry, and personal applications
Teaches the use of Python functions and the powerful groupby() feature for data manipulation
Provides a solid foundation in Pandas for beginners and intermediate learners and serves as preparation for more advanced data analysis courses

Save this course

Save Pandas Playbook: Manipulating Data to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Pandas Playbook: Manipulating Data with these activities:
Review previous data analysis courses or materials
Take this course more confidently by revisiting previous data analysis courses or materials, refreshing your knowledge and building upon your existing foundation.
Browse courses on Data Analysis
Show steps
  • Gather your previous course materials or notes
  • Review key concepts and techniques
Review Introductions to Data Science by James E. Gentle
Review this foundational book to strengthen your understanding of data science concepts, which will aid you in comprehending the course material.
Show steps
  • Read the preface and first two chapters
  • Review the key terms and concepts introduced
Perform examples from Pandas documentation
Complete the examples and exercises found in the Pandas documentation to reinforce your understanding of its functionalities and improve your hands-on skills.
Browse courses on Pandas
Show steps
  • Open the Pandas documentation
  • Locate the section on DataFrames
  • Follow along with the examples and complete the exercises
Five other activities
Expand to see all activities and additional details
Show all eight activities
Complete tutorials on Pandas data manipulation
Seek out tutorials that provide step-by-step guidance on performing specific data manipulation tasks with Pandas, solidifying your knowledge and skills.
Browse courses on Pandas
Show steps
  • Search for tutorials on Pandas data manipulation
  • Follow along with the instructions and complete the exercises
Participate in a Pandas study group
Join a peer study group to discuss Pandas concepts, share knowledge, and work through problems together, fostering collaboration and deepening your understanding.
Browse courses on Pandas
Show steps
  • Find or create a study group
  • Meet regularly to discuss topics and work on exercises
Attend a Pandas workshop
Enroll in a workshop led by an experienced instructor to gain practical experience, ask questions, and enhance your Pandas skills in a collaborative setting.
Browse courses on Pandas
Show steps
  • Find and register for a Pandas workshop
  • Attend the workshop and actively participate in the activities
Create a data analysis project using Pandas
Apply your Pandas skills by working on a project that involves data cleaning, manipulation, and analysis, demonstrating your proficiency in using the library.
Browse courses on Pandas
Show steps
  • Define the project scope and objectives
  • Gather and clean the data
  • Manipulate and analyze the data using Pandas
  • Visualize and interpret the results
  • Write a report or presentation summarizing your findings
Contribute to Pandas documentation or examples
Deepen your understanding of Pandas and its capabilities by actively contributing to its documentation or examples, solidifying your knowledge and potentially benefiting the wider community.
Browse courses on Pandas
Show steps
  • Identify areas where you can contribute to the Pandas documentation or examples
  • Follow the contribution guidelines and submit your changes

Career center

Learners who complete Pandas Playbook: Manipulating Data will develop knowledge and skills that may be useful to these careers:
Data Analyst
Data Analysts use their knowledge of data-analysis tools and techniques to prepare and analyze data to identify trends and patterns. They present their findings to help businesses make better decisions. This course may be useful for aspiring Data Analysts, as it will give them hands-on experience with Pandas, one of the most popular software packages for data analysis. By learning how to use Pandas, Data Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Business Analyst
Business Analysts use data to identify opportunities and solve problems for businesses. They work closely with stakeholders to understand their needs and develop solutions that meet those needs. This course may be useful for aspiring Business Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Business Analysts can more easily gather, clean, and analyze data to identify insights that can help businesses make better decisions.
Data Engineer
Data Engineers design, build, and maintain the infrastructure that stores and processes data. They work closely with data scientists and other data professionals to ensure that data is available and usable for analysis. This course may be useful for aspiring Data Engineers, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Data Engineers can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Data Scientist
Data Scientists use data to build models that can predict future outcomes. They work closely with businesses to identify problems that can be solved using data science, and they develop and deploy models that can help businesses make better decisions. This course may be useful for aspiring Data Scientists, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Data Scientists can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Machine Learning Engineer
Machine Learning Engineers build and deploy machine learning models. They work closely with data scientists and other data professionals to ensure that models are accurate and reliable. This course may be useful for aspiring Machine Learning Engineers, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Machine Learning Engineers can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Software Engineer
Software Engineers design, develop, and maintain software applications. They work closely with businesses to identify needs and develop solutions that meet those needs. This course may be useful for aspiring Software Engineers, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Software Engineers can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Statistician
Statisticians collect, analyze, and interpret data. They work closely with businesses and organizations to help them make informed decisions. This course may be useful for aspiring Statisticians, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Statisticians can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Financial Analyst
Financial Analysts use data to evaluate investments and make recommendations to clients. They work closely with individuals and institutions to help them make informed financial decisions. This course may be useful for aspiring Financial Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Financial Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Market Researcher
Market Researchers collect and analyze data to understand consumer behavior. They work closely with businesses to help them develop and market products and services that meet the needs of consumers. This course may be useful for aspiring Market Researchers, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Market Researchers can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Operations Research Analyst
Operations Research Analysts use data to improve the efficiency and effectiveness of organizations. They work closely with businesses to identify problems and develop solutions that can help businesses achieve their goals. This course may be useful for aspiring Operations Research Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Operations Research Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Risk Analyst
Risk Analysts use data to identify and assess risks. They work closely with businesses and organizations to help them manage risks and make informed decisions. This course may be useful for aspiring Risk Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Risk Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Data Management Analyst
Data Management Analysts use data to improve the efficiency and effectiveness of data management processes. They work closely with businesses to identify problems and develop solutions that can help businesses manage their data more effectively. This course may be useful for aspiring Data Management Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Data Management Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Business Intelligence Analyst
Business Intelligence Analysts use data to identify and solve business problems. They work closely with businesses to help them make informed decisions and improve their performance. This course may be useful for aspiring Business Intelligence Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Business Intelligence Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Quantitative Analyst
Quantitative Analysts use data to develop and implement quantitative models. They work closely with businesses and organizations to help them make informed decisions and improve their performance. This course may be useful for aspiring Quantitative Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Quantitative Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.
Data Visualization Analyst
Data Visualization Analysts use data to create visualizations that communicate insights and trends. They work closely with businesses and organizations to help them make informed decisions and improve their performance. This course may be useful for aspiring Data Visualization Analysts, as it will give them hands-on experience with Pandas, a popular tool for data analysis. By learning how to use Pandas, Data Visualization Analysts can more easily wrangle data, perform exploratory data analysis, and generate insights from data.

Reading list

We've selected 14 books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Pandas Playbook: Manipulating Data.
Provides a collection of recipes for solving common data analysis problems using Pandas. It covers topics such as data cleaning, data transformation, and data visualization. This book useful reference for those who are already familiar with Pandas.
Provides a comprehensive overview of the Python programming language, with a focus on data analysis. It covers topics such as data structures, data manipulation, data visualization, and machine learning. This book good starting point for those who are new to Python or data analysis.
Provides a comprehensive overview of data science using Python. It covers topics such as data cleaning, data analysis, machine learning, and deep learning. This book good starting point for those who are new to data science or Python.
Provides a comprehensive overview of machine learning using Python. It covers topics such as data preprocessing, model selection, and model evaluation. This book good starting point for those who are new to machine learning or Python.
Provides a comprehensive overview of deep learning. It covers topics such as neural networks, convolutional neural networks, and recurrent neural networks. This book good starting point for those who are new to deep learning.
Provides a comprehensive overview of data science from scratch. It covers topics such as data cleaning, data analysis, and machine learning. This book good starting point for those who are new to data science.
Provides a comprehensive overview of statistical learning. It covers topics such as supervised learning, unsupervised learning, and reinforcement learning. This book good starting point for those who are new to statistical learning.
Provides a comprehensive overview of statistical learning. It covers topics such as supervised learning, unsupervised learning, and reinforcement learning. This book good starting point for those who are new to statistical learning.
Provides a comprehensive overview of data mining using R. It covers topics such as data preprocessing, model selection, and model evaluation. This book good starting point for those who are new to data mining or R.
Provides a comprehensive overview of machine learning using Python. It covers topics such as data preprocessing, model selection, and model evaluation. This book good starting point for those who are new to machine learning or Python.
Provides a comprehensive overview of machine learning using Python. It covers topics such as data preprocessing, model selection, and model evaluation. This book good starting point for those who are new to machine learning or Python.
Provides a comprehensive overview of deep learning using Python. It covers topics such as neural networks, convolutional neural networks, and recurrent neural networks. This book good starting point for those who are new to deep learning or Python.
Provides a comprehensive overview of natural language processing using Python. It covers topics such as tokenization, stemming, and parsing. This book good starting point for those who are new to natural language processing or Python.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser