Introduction to Spark SQL and DataFrames
Heads up! This course may be archived and/or unavailable.
Explore DataFrames, a widely used data structure in Apache Spark. DataFrames allow Spark developers to perform common data operations, such as filtering and aggregation, as well as advanced data analysis on large collections of distributed data. With the addition of Spark SQL, developers have access to an even more popular and powerful query language than the built-in DataFrames API. In this course, instructor Dan Sullivan shows how to perform basic operations—loading, filtering, and aggregating data in DataFrames—with the API and SQL, as well as more advanced techniques that are easily performed in SQL. In this section of the course, Dan explains how to join data, eliminate duplicates, and deal with null or NA values. The lessons conclude with three in-depth examples of using DataFrames for data science: exploratory data analysis, time series analysis, and machine learning.
Contents:
- Introduction
- 1. Introduction to Spark DataFrames
- 2. Installing Spark
- 3. Getting Started with Spark DataFrames
- 4. SQL for DataFrames
- 5. Data Analysis with Spark
- Conclusion
Get a Reminder
Rating | Not enough ratings |
---|---|
Length | 1h 53m |
Starts | On Demand (Start anytime) |
Cost | $29/month (Access to entire library- free trial available) |
From | LinkedIn Learning |
Instructor | Dan Sullivan |
Download Videos | Only via the LinkedIn Learning mobile app |
Language | English |
Subjects | Data Science |
Tags | SQL Data Management Apache Spark |
Get a Reminder
Similar Courses
Careers
An overview of related careers and their average salaries in the US. Bars indicate income percentile.
Institutional Research Specialist in Data Analysis $42k
Professional-Data Analysis - SQL $63k
Business and Data Analysis $67k
Data Management and Analysis Fellowship - CDC $68k
Data Analyst, Marketing & Analysis $68k
Senior Data Analyst, Marketing & Analysis $77k
Data Scientist (Social Network Analysis) $84k
Analyst, R&D IT and Data Analysis Lead $88k
Data Management and Analysis Tech. $94k
Senior Data Analysis - ITSM Analyst $101k
Senior Data Analysis Engineer u2013 Engineering Data Analysis $149k
Data Architect - Financial Planning and Analysis $156k
Write a review
Your opinion matters. Tell us what you think.
Please login to leave a review
Rating | Not enough ratings |
---|---|
Length | 1h 53m |
Starts | On Demand (Start anytime) |
Cost | $29/month (Access to entire library- free trial available) |
From | LinkedIn Learning |
Instructor | Dan Sullivan |
Download Videos | Only via the LinkedIn Learning mobile app |
Language | English |
Subjects | Data Science |
Tags | SQL Data Management Apache Spark |
Similar Courses
Sorted by relevance
Like this course?
Here's what to do next:
- Save this course for later
- Get more details from the course provider
- Enroll in this course