We may earn an affiliate commission when you visit our partners.
Course image
Sadie St. Lawrence, Brooke Wenig, Conor Murphy, Don Noxon, and Katrina Glaeser Poole

This Specialization is intended for a learner with no previous coding experience seeking to develop SQL query fluency. Through four progressively more difficult SQL projects with data science applications, you will cover topics such as SQL basics, data wrangling, SQL analysis, AB testing, distributed computing using Apache Spark, Delta Lake and more. These topics will prepare you to apply SQL creatively to analyze and explore data; demonstrate efficiency in writing queries; create data analysis datasets; conduct feature engineering, use SQL with other data analysis and machine learning toolsets; and use SQL with unstructured data sets.

Enroll now

Share

Help others find Specialization from Coursera by sharing it with your friends and followers:

What's inside

Four courses

SQL for Data Science

(0 hours)
As data collection has increased exponentially, the need for people skilled at using and interacting with data has also increased. This course is designed to give you a primer in the fundamentals of SQL and working with data so that you can begin analyzing it for data science purposes.

Data Wrangling, Analysis and AB Testing with SQL

(0 hours)
This course applies SQL skills to data science inquiry case studies. We'll learn to convert timestamps, perform optimal JOINs, clean data, and analyze data per segment. We'll also describe how to convert a query into a scheduled job, insert data into a date partition, and engineer a feature from raw data. These skills provide the framework for performing the analysis of an AB test.

Distributed Computing with Spark SQL

(0 hours)
This course teaches distributed computing using Apache Spark for students with SQL experience. Students will learn the fundamentals of data analysis using SQL on Spark, setting the foundation for advanced analytics at scale. The four modules cover Spark architecture, queries, optimization, and building reliable data pipelines.

SQL for Data Science Capstone Project

(0 hours)
Data science demands SQL skills. This course provides a foundation in applying SQL skills to analyze data and solve real business problems.

Learning objectives

  • U​se sql commands to filter, sort, & summarize data; manipulate strings, dates, & numerical data from different sources for analysis
  • A​ssess and create datasets to solve your business questions and problems using sql
  • U​se the collaborative databricks workspace and create an end-to-end pipeline that reads data, transforms it, and saves the result
  • ​develop a project proposal & select your data, perform statistical analysis & develop metrics, and p​resent your findings & make recommendations

Save this collection

Save Learn SQL Basics for Data Science to your list so you can find it easily later:
Save
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser