Save for later

Hadoop

Heads up! This course may be archived and/or unavailable.

Hadoop is the cloud computing platform data scientists use to perform highly parallelized operations on big data. If you've explored Hadoop, you've probably discovered it has many levels of complexity. After getting comfortable with the fundamentals, you're ready to see how to put additional frameworks and tool sets to use.

In this course, software engineer and data scientist Jack Dintruff goes beyond the basic capabilities of Hadoop. He demonstrates hands-on, project-based, practical skills for analyzing data, including how to use Pig to analyze large datasets and how to use Hive to manage large datasets in distributed storage. Learn how to configure the Hadoop distributed file system (HDFS), perform processing and ingestion using MapReduce, copy data from cluster to cluster, create data summarizations, and compose queries. Topics include:
  • Setting up and administrating clusters
  • Ingesting data
  • Working with MapReduce, YARN, Pig, and Hive
  • Selecting and aggregating large datasets
  • Defining limits, unions, filters, and joins
  • Writing custom user-defined functions (UDFs)
  • Creating queries and lookups

Get Details and Enroll Now

OpenCourser is an affiliate partner of LinkedIn Learning and may earn a commission when you buy through our links.

Get a Reminder

Send to:
Rating Not enough ratings
Length 39m 5s
Starts On Demand (Start anytime)
Cost $0/month (Access to entire library- free trial available)
From LinkedIn Learning
Instructor Jack Dintruff
Download Videos Only via the LinkedIn Learning mobile app
Language English
Subjects IT & Networking Data Science
Tags IT Big Data Hadoop

Get a Reminder

Send to:

Similar Courses

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile.

Large Animal Technician $43k

Account Developer Relief (Large Store/Bulk) $55k

Large Loss Property Claims Adjuster $57k

Research Scientist--Large-scale machine learning $63k

Sales Manger - Large Business $78k

Scientist, Drug Product Development Large Molecule $94k

Account Coordinator - Large Group Sales Manager $109k

Technology Large Customer Sales Lead $132k

Regional Large Project Construction Sales Engineer at GE Manager $132k

Vice Senior President Coordinates Large Off-site Sales Events $133k

Large Enterprise BDM - CEB Legacy-CA $199k

Regional Account Executive, Large Customer Sales $210k

Write a review

Your opinion matters. Tell us what you think.

Rating Not enough ratings
Length 39m 5s
Starts On Demand (Start anytime)
Cost $0/month (Access to entire library- free trial available)
From LinkedIn Learning
Instructor Jack Dintruff
Download Videos Only via the LinkedIn Learning mobile app
Language English
Subjects IT & Networking Data Science
Tags IT Big Data Hadoop

Similar Courses

Sorted by relevance

Like this course?

Here's what to do next:

  • Save this course for later
  • Get more details from the course provider
  • Enroll in this course
Enroll Now