We may earn an affiliate commission when you visit our partners.
Loony Corn

Prerequisites: Working with HBase requires knowledge of Java

Taught by a team which includes 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs.

Relational Databases are so stuffy and old. Welcome to HBase - a database solution for a new age.

HBase: Do you feel like your relational database is not giving you the flexibility you need anymore? Column oriented storage, no fixed schema and low latency make HBase a great choice for the dynamically changing needs of your applications.

Read more

Prerequisites: Working with HBase requires knowledge of Java

Taught by a team which includes 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs.

Relational Databases are so stuffy and old. Welcome to HBase - a database solution for a new age.

HBase: Do you feel like your relational database is not giving you the flexibility you need anymore? Column oriented storage, no fixed schema and low latency make HBase a great choice for the dynamically changing needs of your applications.

What's Covered:

25 solved examples covering all aspects of working with data in HBaseCRUD operations in the shell and with the Java API, Filters, Counters, MapReduce

Implement your own notification service for a social network using HBase

HBase and it’s role in the Hadoop ecosystem, HBase architecture and what makes HBase different from RDBMS and other Hadoop technologies like Hive.

Enroll now

What's inside

Syllabus

You, This Course and Us

We start with an introduction to the course, the instructors and what lies ahead for you.

Source Code and PDFs
Introduction to HBase
Read more

Distributed computing is really exciting, but was a really hard problem before the Hadoop ecosystem evolved.

Let's get started by Installing HBase. Get a taste of whats to come, start HBase, create a table and insert some data. 

Please use the attached text file for copying any commands that are shown on the screen. 

HBase is a database built for the Hadoop ecosystem. Before we get there, let's get a quick understanding of the Hadoop ecosystem.

Hadoop has limitations that make HBase a necessary part of the Hadoop ecosystem

HBase is a database - but its quite different from a traditional RDBMS. Get a quick overview of the differences before we dig deeper in future videos. 

See how column oriented storage and denormalization make HBase's data model quite different from a traditional database. 

HBase supports a limited set of operations - Create, Read, Update and Delete

HBase is a database solution in the Hadoop ecosystem, so it seems is Hive. What makes them different? 

CRUD operations using the HBase Shell

If you are unfamiliar with softwares that require working with a shell/command line environment, this video will be helpful for you. It explains how to update the PATH environment variable, which is needed to set up most Linux/Mac shell based softwares. 

Example1 - Creating a table for User Notifications
Example 2 - Inserting a row
Example 3 - Updating a row
Example 4 - Retrieving a row
Example 5 - Retrieving a range of rows
Example 6 - Deleting a row
Example 7 - Deleting a table
CRUD operations using the Java API
Example 8 - Creating a table with HBaseAdmin
Example 9 - Inserting a row using a Put object
Example 10 - Inserting a list of Puts
Example 11 - Retrieving data - Get and Result objects
Example 12 - A list of Gets
Example 13 - Deleting a row
Example 14 - A list of Deletes
Example 15 - Mix and match with batch operations
Example 16 - Scanning a range of rows
Example 17 - Deleting a table
HBase Architecture
Advanced operations - Filters and Counters
Example 18 - Filter by Row id - RowFilter
Example 19 - Filter by column value - SingleColumnValueFilter
Example 20 - Apply multiple conditions - Filterlist
Example 21 - Retrieve rows within a time range
Example 22 - Atomically incrementing a value with Counters
MapReduce with HBase

Use MapReduce for complex data processing tasks. See how to set up a MapReduce task to count Notifications by type

Implement the MapReduce task to summarize Notifications by type in Java. 

See how to run a MapReduce task using the Java JAR.

Build a Notification Service

Set up a class hierarchy to represent different types of user notifications for a social network

Use a Notifications Manager to manage tasks such as adding notifications and retrieving notifications for a specific user

Installing Hadoop in a Local Environment

Hadoop has 3 different install modes - Standalone, Pseudo-distributed and Fully Distributed. Get an overview of when to use each

How to set up Hadoop in the standalone mode. Windows users need to install a Virtual Linux instance before this video. 

Set up Hadoop in the Pseudo-Distributed mode. All Hadoop services will be up and running! 

Hadoop is basically for Linux/Unix systems. If you are on Windows, you can set up a Linux Virtual Machine on your computer and use that for the install. 

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Covers CRUD operations in the shell and with the Java API, which are essential for interacting with HBase databases
Explores HBase architecture and its role in the Hadoop ecosystem, providing context for its use in big data environments
Requires knowledge of Java, which is a common language used in conjunction with HBase and the Hadoop ecosystem
Includes examples of using MapReduce with HBase, enabling learners to perform complex data processing tasks
Demonstrates building a notification service for a social network using HBase, offering a practical application of the technology
Includes instructions for installing Hadoop in a local environment, which may require additional setup for Windows users

Save this course

Save Learn by Example : HBase - The Hadoop Database to your list so you can find it easily later:
Save

Reviews summary

Practical introduction to hbase

According to learners, this course offers a largely positive introduction to HBase, particularly strong in its use of practical examples and clear explanations of core concepts like HBase architecture and its difference from traditional databases. The coverage of CRUD operations using the Java API is considered a valuable component. While generally well-received for its conceptual clarity and hands-on approach, some students encountered significant challenges with the environment setup, noting potential issues with outdated versions of Hadoop and dependencies. Overall, it serves as a solid foundation for those new to HBase.
Good starting point for newcomers.
"This course is an excellent introduction to HBase for someone who has never worked with it before."
"As a beginner to NoSQL and HBase, I found this course approachable."
"Provides a good overview to get you started with HBase concepts and basic operations."
Useful coverage of Java integration.
"The section on using the HBase Java API for CRUD and batch operations was particularly valuable for me as a developer."
"Examples for interacting with HBase programmatically using Java were well-done."
"Appreciated the coverage of how to integrate HBase into applications using the API."
Explains core HBase ideas effectively.
"The explanations regarding HBase architecture and its place in the Hadoop ecosystem were very clear and easy to follow."
"Understanding the differences between HBase, RDBMS, and Hive was well-explained and provided necessary context."
"The course provides a solid conceptual foundation for HBase."
"Good overview of column-oriented storage and denormalization."
Hands-on examples are highly valued.
"The examples are incredibly helpful and practical. Learning by doing with real-world scenarios makes a huge difference."
"I appreciated the step-by-step approach using examples for CRUD operations and filters. It made abstract concepts concrete."
"The hands-on demos and coding exercises reinforced my understanding of how to interact with HBase programmatically."
"The 'Learn by Example' format works very well for understanding HBase features."
Environment setup can be difficult or outdated.
"Setting up the local Hadoop and HBase environment was a major hurdle. Some instructions or required versions seem outdated."
"I spent more time troubleshooting the installation than actually learning HBase content."
"Windows users especially might struggle with the Linux VM setup needed for Hadoop."
"Dependencies and configuration issues made getting started frustrating for several learners."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Learn by Example : HBase - The Hadoop Database with these activities:
Review Core Java Concepts
Strengthen your understanding of Java fundamentals, which are essential for working with the HBase Java API.
Show steps
  • Review data types, control flow, and object-oriented principles in Java.
  • Practice writing simple Java programs to reinforce your understanding.
Review: Hadoop: The Definitive Guide
Gain a broader understanding of the Hadoop ecosystem and HBase's role within it.
Show steps
  • Read the sections on HDFS and MapReduce.
  • Understand how HBase interacts with these components.
Review: HBase: The Definitive Guide
Deepen your understanding of HBase architecture and data modeling by studying a comprehensive guide.
Show steps
  • Read the chapters on HBase architecture and data modeling.
  • Take notes on key concepts and best practices.
Four other activities
Expand to see all activities and additional details
Show all seven activities
Practice HBase Shell Commands
Reinforce your understanding of HBase shell commands through repetitive practice.
Show steps
  • Set up an HBase environment.
  • Create, read, update, and delete data using the HBase shell.
  • Practice filtering and scanning data using different shell commands.
Write a Blog Post on HBase Use Cases
Deepen your understanding by researching and explaining real-world use cases for HBase.
Show steps
  • Research different industries and applications where HBase is used.
  • Write a blog post summarizing the use cases and benefits of HBase.
  • Publish the blog post on a platform like Medium or LinkedIn.
Build a Simple HBase Application
Solidify your knowledge by building a practical application that uses HBase to store and retrieve data.
Show steps
  • Design a simple application that can benefit from HBase's features.
  • Implement the application using the HBase Java API.
  • Test and debug the application thoroughly.
Contribute to HBase Documentation
Solidify your understanding by contributing to the HBase open-source project.
Show steps
  • Identify areas in the HBase documentation that need improvement.
  • Submit pull requests with your proposed changes.
  • Address feedback from the HBase community.

Career center

Learners who complete Learn by Example : HBase - The Hadoop Database will develop knowledge and skills that may be useful to these careers:
Hadoop Developer
A Hadoop developer focuses on building and maintaining applications within the Hadoop ecosystem. This course centers around HBase, which holds an important role as a database solution alongside Hadoop. For a Hadoop developer, understanding HBase is crucial for working with large datasets that require low latency and flexible schemas. The course's coverage of HBase architecture, CRUD operations, and MapReduce integration helps build a foundation in working with HBase within the Hadoop environment. The implementation of a notification service using HBase is also relevant for practical application development.
Big Data Engineer
A big data engineer develops, maintains, and tests big data infrastructures. This course, centered around HBase, a database solution tailored for the Hadoop ecosystem, can help those who want to become big data engineers. As a big data engineer, familiarity with HBase is crucial for managing large datasets with low latency and flexible schemas. The course's coverage of HBase architecture, CRUD operations, and MapReduce integration provides a solid foundation for designing and implementing data pipelines. By implementing a notification service in HBase, a big data engineer can better understand real-world applications of the technology.
Database Administrator
A database administrator ensures the performance, integrity, and security of databases. This course focuses on HBase, a NoSQL database, broadening a database administrator's skill set beyond traditional relational database management systems. Knowing how HBase works, its architecture, and its role in the Hadoop ecosystem allows a database administrator to manage big data solutions effectively. The HBase database administrator can use the CRUD operations and Java API examples covered in this course for practical implementation. Furthermore, the course's exploration of filters and counters can help with optimizing data retrieval and manipulation.
Software Developer
A software developer designs, develops, and tests software applications. This course on HBase introduces developers to NoSQL databases, expanding their capabilities beyond relational databases. Software Developer knowledge of HBase's column-oriented storage, schema flexibility, and low latency characteristics can inform the design of high-performance applications. The Software Developer may find the course's exploration of CRUD operations with the Java API, filters, and counters, and the implementation of a notification service, provide practical skills for building data-intensive applications.
Backend Engineer
Backend engineers build and maintain the server-side logic and databases that power applications. This course on HBase helps backend engineers learn about NoSQL databases and their role in modern application architectures. Understanding HBase's architecture, its differences from relational databases, and practical examples of CRUD operations can enhance the ability to design scalable and efficient data storage solutions. The implementation of a notification service using HBase is particularly relevant for building real-time applications.
Data Architect
A data architect designs and oversees the implementation of data management systems. This course helps data architects gain insights into NoSQL databases like HBase. Understanding HBase's architecture, its differences from relational databases, and its integration with the Hadoop ecosystem is vital for designing scalable and efficient data solutions. This course may be useful because the examples provided, including CRUD operations and the implementation of a notification service, offer practical knowledge to data architects. This enables them to make informed decisions about incorporating HBase into their data architectures.
Solutions Architect
A solutions architect designs and implements technology solutions to business problems. This course, focused on HBase, a database solution within the Hadoop ecosystem, is useful to a solutions architect. Gaining familiarity with HBase allows them to incorporate NoSQL databases into their solution designs when appropriate. The knowledge of HBase architecture, its differences from traditional RDBMS, and its integration with Hadoop can help develop scalable and efficient solutions. The course's practical examples, such as CRUD operations and MapReduce integration, may provide insights into real-world implementations.
Data Modeler
A data modeler is responsible for creating and maintaining the models that define how data is organized and stored within an organization's systems. This course on HBase provides insights into NoSQL data modeling techniques, expanding a data modeler's toolkit beyond relational models. Data modelers may find the course's exploration of HBase's column-oriented storage and schema flexibility informative for designing efficient and scalable data models for big data applications. The examples covering CRUD operations can inform understanding of data manipulation within an HBase environment.
Data Warehouse Architect
A data warehouse architect designs and oversees the development of data warehousing solutions. This course on HBase helps expand the architect's knowledge beyond traditional relational data warehouses. Understanding HBase's column-oriented storage, schema flexibility, and integration with Hadoop can inform decisions about incorporating NoSQL databases into a data warehousing strategy. The data warehouse architect may find the course's coverage of HBase architecture, CRUD operations, and MapReduce useful in assessing the suitability of HBase for specific data warehousing scenarios.
Data Scientist
A data scientist analyzes and interprets complex data to identify trends and insights. While data scientists often use various tools for analysis, understanding the underlying data storage and retrieval mechanisms can be very helpful. Data scientists may find valuable information in this course because learning HBase and its role in the Hadoop ecosystem, data scientists can better appreciate how data is structured and accessed. They may find understanding CRUD operations, filters, and counters enhance their ability to work with large datasets stored in HBase. The MapReduce examples can also aid in performing complex data processing tasks.
Data Analyst
A data analyst examines data to identify trends, patterns, and insights that can improve decision-making. While data analysts may primarily use tools for analysis, understanding the underlying data storage mechanisms can be beneficial. This course on HBase may be useful because it helps data analysts gain insights into how data is stored and managed in NoSQL databases within the Hadoop ecosystem. They can learn about HBase architecture, CRUD operations, and how to use filters and counters to retrieve specific data, enhancing their ability to work with large datasets.
Cloud Architect
A cloud architect designs and manages cloud computing solutions. As cloud environments increasingly incorporate big data technologies, the cloud architect may find this course helpful. The cloud architect may find learning about HBase and its role in the Hadoop ecosystem enables more informed decisions about data storage and processing in the cloud. Practical examples of HBase operations and its integration with MapReduce may be useful when designing data-intensive cloud applications.
Technical Lead
A technical lead oversees a team of developers and guides the technical direction of projects. This course on HBase may be useful because it gives a technical lead familiarity with NoSQL databases and their application in big data environments. This course gives access to knowledge about HBase's role in the Hadoop ecosystem, how it differs from traditional relational databases, and its architecture. The examples provided in the course, such as CRUD operations and MapReduce integration, can help a technical lead make informed decisions about technology choices.
Application Architect
Application architects are responsible for the high-level design and structure of software applications. This course on HBase may be useful to application architects, offering insights into NoSQL database options for modern applications. By the end of this course, one will learn about how data is organized in the Hadoop ecosystem. The application architect may find the knowledge of HBase architecture informs their design decisions. The practical examples covering CRUD operations and the notification service may also be helpful.
Systems Engineer
A systems engineer is responsible for the design, implementation, and management of an organization's IT infrastructure. They may find a course focusing on HBase useful because it helps them understand how NoSQL databases fit into a broader IT landscape. Systems engineers may find the exploration of HBase architecture helps to design efficient and scalable systems that can handle large volumes of data. The course's practical examples, such as CRUD operations and integration with Hadoop, provide valuable insights into real-world applications of HBase.

Reading list

We've selected two books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Learn by Example : HBase - The Hadoop Database.
Provides a comprehensive overview of HBase, covering architecture, data modeling, and administration. It serves as a valuable reference for understanding HBase concepts and best practices. It is commonly used by industry professionals working with HBase. Reading this book will significantly enhance your understanding of HBase beyond the course material.
Provides a comprehensive overview of the Hadoop ecosystem, including HDFS and MapReduce. It is helpful for understanding the context in which HBase operates. While not strictly necessary for the course, it provides valuable background knowledge. This book is commonly used as a textbook at academic institutions.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Similar courses are unavailable at this time. Please try again later.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser