We may earn an affiliate commission when you visit our partners.
Course image
Ian Cook and Glynn Durham

This Specialization teaches the essential skills for working with large-scale data using SQL.

Read more

This Specialization teaches the essential skills for working with large-scale data using SQL.

Maybe you are new to SQL and you want to learn the basics. Or maybe you already have some experience using SQL to query smaller-scale data with relational databases. Either way, if you are interested in gaining the skills necessary to query big data with modern distributed SQL engines, this Specialization is for you.

Most courses that teach SQL focus on traditional relational databases, but today, more and more of the data that’s being generated is too big to be stored there, and it’s growing too quickly to be efficiently stored in commercial data warehouses. Instead, it’s increasingly stored in distributed clusters and cloud storage. These data stores are cost-efficient and infinitely scalable.

To query these huge datasets in clusters and cloud storage, you need a newer breed of SQL engine: distributed query engines, like Hive, Impala, Presto, and Drill. These are open source SQL engines capable of querying enormous datasets. This Specialization focuses on Hive and Impala, the most widely deployed of these query engines.

This Specialization is designed to provide excellent preparation for the Cloudera Certified Associate (CCA) Data Analyst certification exam. You can earn this certification credential by taking a hands-on practical exam using the same SQL engines that this Specialization teaches—Hive and Impala.

Enroll now

Share

Help others find Specialization from Coursera by sharing it with your friends and followers:

What's inside

Three courses

Foundations for Big Data Analysis with SQL

In this course, you'll learn about using SQL for big data. You'll start with an overview of data, database systems, and SQL. Then you'll learn about the characteristics of big data and SQL tools for working on big data platforms. You'll also install an exercise environment (virtual machine) to be used through the specialization courses, and you'll have an opportunity to do some initial exploration of databases and tables in that environment.

Analyzing Big Data with SQL

In this course, you'll explore the SQL SELECT statement and its clauses. You'll learn to navigate databases, filter results, group and aggregate data, sort and limit results, and combine multiple tables. By the end, you'll be able to analyze big data with SQL.

Managing Big Data in Clusters and Cloud Storage

In this course, you'll learn how to manage big datasets, load them into clusters and cloud storage, and apply structure to the data. You'll learn how to choose the right data types, storage systems, and file formats based on your tools and performance needs.

Save this collection

Save Modern Big Data Analysis with SQL to your list so you can find it easily later:
Save
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser