We may earn an affiliate commission when you visit our partners.
Course image
Youssef Hatem Salem
By the end of this project, you will create your own data warehouse by using a star schema modeling technique that holds baseball raw data gathered from multiple baseball leagues in the States. During the course of this project, you will be using Apache Hadoop and Apache Hive as the main tools to build your very own Data Warehouse using HQL as the querying language, which is very similar to SQL. The learner will be introduced to the star schema modeling technique, database creation and handling, data insertion, and data storage using two different methods in Apache Hadoop. Data Warehouses are viewed as the core of any application...
Read more
By the end of this project, you will create your own data warehouse by using a star schema modeling technique that holds baseball raw data gathered from multiple baseball leagues in the States. During the course of this project, you will be using Apache Hadoop and Apache Hive as the main tools to build your very own Data Warehouse using HQL as the querying language, which is very similar to SQL. The learner will be introduced to the star schema modeling technique, database creation and handling, data insertion, and data storage using two different methods in Apache Hadoop. Data Warehouses are viewed as the core of any application, as any further step that requires dealing with the data stored such as Data Analytics, building a recommendation system, or applying any sort of Machine Learning will inevitably point out to the start point, which is the data warehouse modeling. This guided project is for students who are aiming to start their careers in the broad field of Data Engineering and the specific field of Data Warehouses Modeling. It will also be of interest to people who are keen on exploring Big Data platforms such as Apache Hadoop and Apache Hive. Note: This project works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Enroll now

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Delves into data warehousing, a core concept in data analytics and engineering
Emphasizes practical application of Apache Hadoop and Apache Hive for data warehouse construction
Targeted towards aspiring data engineers and those interested in exploring Big Data platforms
Offers a hands-on approach to data warehouse modeling using the star schema technique
Note that this course is currently only available for learners in North America

Save this course

Save Modeling Data Warehouses using Apache Hive to your list so you can find it easily later:
Save

Reviews summary

Data warehousing with hive

This course introduces foundational principles of data warehousing using Apache Hive. Students will build a data warehouse using a star schema model with data from baseball leagues in the United States. The course is geared towards learners who are new to data engineering or data warehousing and would like to learn more about Apache Hive and Hadoop.
Instructor was knowledgeable.
"...and the instructor was very good."
Course content was well-structured.
"The course was well organized..."
Good introduction to Apache Hive basics.
"Great beginner course and introduction to Apache Hive and Apache Hadoop."
Would have liked to see more advanced content.
"I would have like to have seen some advanced queries in the course."
First section could provide better setup instructions.
"The first section is too vague to follow."
"The first section should explain how to start the platform and hive."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Modeling Data Warehouses using Apache Hive with these activities:
Review SQL basics
Refresh your knowledge of SQL before starting the course.
Browse courses on SQL
Show steps
  • Review SQL syntax and commands
  • Practice writing basic SQL queries
Create a data warehousing resource list
Gather and organize a collection of useful data warehousing resources.
Browse courses on Data Warehousing
Show steps
  • Find and bookmark relevant articles, tutorials, and videos
  • Create a document or spreadsheet to organize the resources
  • Share the resource list with other learners
Attend a data warehousing Meetup
Connect with other data warehousing professionals and learn about industry trends.
Browse courses on Data Warehousing
Show steps
  • Find a local data warehousing Meetup
  • Attend the Meetup and introduce yourself
  • Ask questions and learn from other attendees
Five other activities
Expand to see all activities and additional details
Show all eight activities
Read 'Data Warehousing Fundamentals' by Ralph Kimball
Gain a foundational understanding of data warehousing concepts and best practices.
Show steps
  • Read Chapter 1: Introduction
  • Read Chapter 2: The Data Warehouse Bus Architecture
  • Read Chapter 3: The Dimensional Data Model
Follow the Apache Hive tutorial
Get hands-on experience with Apache Hive by following a guided tutorial.
Browse courses on Apache Hive
Show steps
  • Set up a Hadoop and Hive environment
  • Create a Hive table
  • Load data into the Hive table
  • Query the Hive table using HiveQL
Build a personal data warehouse
Develop a practical understanding of data warehouse design and implementation by building your own project.
Browse courses on Data Warehousing
Show steps
  • Gather data from different baseball leagues
  • Design a star schema data warehouse model
  • Create the data warehouse using HiveQL
  • Load data into the data warehouse
  • Query the data warehouse to generate insights
Attend a data warehousing workshop
Gain in-depth knowledge of data warehousing technologies and best practices.
Browse courses on Data Warehousing
Show steps
  • Find a local data warehousing workshop
  • Register for the workshop
  • Attend the workshop and participate in hands-on exercises
Solve data warehousing practice problems
Reinforce your understanding of data warehousing concepts by solving practice problems.
Browse courses on Data Warehousing
Show steps
  • Design a star schema data warehouse model for a given business scenario
  • Write HiveQL queries to extract data from a star schema data warehouse
  • Optimize HiveQL queries for performance

Career center

Learners who complete Modeling Data Warehouses using Apache Hive will develop knowledge and skills that may be useful to these careers:
Data Engineer
Data Engineers use Apache Hadoop and Apache Hive to build and maintain data warehouses. This course can help you build a foundation in these tools and learn how to use them to create your own data warehouse. This course can help you learn the skills you need to succeed as a Data Engineer.
Business Intelligence Analyst
Business Intelligence Analysts use data to help businesses make better decisions. This course can help you learn how to use Apache Hadoop and Apache Hive to query data and create reports. This course can help you build a foundation in the skills you need to succeed as a Business Intelligence Analyst.
Data Analyst
Data Analysts use data to make informed decisions. This course can help you learn how to use Apache Hadoop and Apache Hive to query data and create reports. This course can help you build a foundation in the skills you need to succeed as a Data Analyst.
Data Scientist
Data Scientists use data to solve problems. This course can help you learn how to use Apache Hadoop and Apache Hive to query data and create models. This course can help you build a foundation in the skills you need to succeed as a Data Scientist.
Data Warehouse Architect
Data Warehouse Architects design and build data warehouses. This course can help you learn the star schema modeling technique and how to use Apache Hadoop and Apache Hive to create a data warehouse. This course can help you build a foundation in the skills you need to succeed as a Data Warehouse Architect.
Database Administrator
Database Administrators manage databases. This course can help you learn how to use Apache Hadoop and Apache Hive to create and manage data warehouses. This course can help you build a foundation in the skills you need to succeed as a Database Administrator.
Software Engineer
Software Engineers build software. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Software Engineer.
Project Manager
Project Managers manage projects. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Project Manager.
Data Warehouse Manager
Data Warehouse Managers manage data warehouses. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Data Warehouse Manager.
Data Architect
Data Architects design data systems. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Data Architect.
IT Manager
IT Managers manage IT systems. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as an IT Manager.
Systems Analyst
Systems Analysts analyze systems. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Systems Analyst.
Database Architect
Database Architects design databases. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Database Architect.
Information Systems Manager
Information Systems Managers manage information systems. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as an Information Systems Manager.
Big Data Engineer
Big Data Engineers work with big data. This course can help you learn how to use Apache Hadoop and Apache Hive to create data warehouses. This course can help you build a foundation in the skills you need to succeed as a Big Data Engineer.

Reading list

We've selected eight books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Modeling Data Warehouses using Apache Hive.
Classic work on dimensional modeling, the most widely used data modeling technique for data warehouses. It provides a step-by-step guide to designing and building a data warehouse using dimensional modeling. This book must-read for anyone who wants to learn about data warehouse design.
Is an indispensable resource for anyone who wants to learn about dimensional modeling.
Provides a comprehensive overview of data warehousing using SQL Server. It covers topics such as data modeling, data integration, and data analysis. This book good choice for learners who want to learn about data warehousing using SQL Server.
Provides a comprehensive overview of the fundamental concepts and technologies used in data warehousing. It covers topics such as data warehouse architecture, data modeling, data integration, and data quality management. This book good starting point for learners who are new to data warehousing.
Provides a friendly and approachable introduction to Apache Hive. It covers topics such as data storage, data processing, and data analysis. This book good choice for learners who are new to Apache Hive.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Modeling Data Warehouses using Apache Hive.
SQL Big Data Convergence - The Big Picture
Most relevant
Introduction to Big Data with Spark and Hadoop
Most relevant
Data Engineering Essentials using SQL, Python, and PySpark
Most relevant
Big Data, Hadoop, and Spark Basics
Most relevant
Data Modeling, Transformation, and Serving
Most relevant
Data Warehouse - The Ultimate Guide
Master Big Data - Apache...
Developing Spark Applications Using Scala & Cloudera
Advanced Data Modeling
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser