Fundamentals of Database Engineering from Udemy

What's inside

Learning objectives

Learn and understand acid properties
Database indexing
Database partitioning
Database replication
Database sharding
Database cursors
Concurrency control (optimistic, pessimistic)
B-trees in production database systems
Database system designs

Difference between database management system, database engine and embedded database
Database engines such as myisam, innodb, rocksdb, leveldb and more
Benefits of using one database engine over the other
Switching database engines with mysql
Database security
Homomorphic encryption
Show more
Show less

Learn and understand acid properties
Database indexing
Database partitioning
Database replication
Database sharding
Database cursors
Concurrency control (optimistic, pessimistic)
B-trees in production database systems
Database system designs
Difference between database management system, database engine and embedded database
Database engines such as myisam, innodb, rocksdb, leveldb and more
Benefits of using one database engine over the other
Switching database engines with mysql
Database security
Homomorphic encryption
Show more
Show less

Syllabus

Section dedicates to Course Updates and Welcome

Welcome to the Course

Course Note 1

Course Note 2

ACID which stands for Atomicity, consistency, isolation, and durability are four critical properties of relational database. I think any engineer working with a relational database like postgres, mysql, sqlserver oracle, should understand these properties.

In this course, we will go through the four properties and explain why each is critical to build and use a relational database successfully.

In this video we will demonstrate Atomicity, Isolation, Consistency and Durability on Postgres, fully practical example.

Answer the following questions about ACID properties in databases

This lecture details the inner working of database systems with regards to storage. It is a must watch to understand the difference between tables, pages, IO, rows, indexes and data files.

In this lecture I will discuss the difference between Primary Key and a Secondary Key and how it can affect your performance.

Lots of you asked me how to create a table with millions of rows in postgres, here are the details

In this video, I explain the benefits of Bitmap Index Scan and how it differs from Index scan and table sequential scan.

If you create an index on a large production table in postgres, the operations blocks writes in order to make sure to pull all the field entries to the index. However most of the time you can't afford to block writes on an active production database table. Postgres new feature which allows create index concurrently allows writes and reads to go in the expense of cpu/memory, time and chance for the index to be invalid. A small price to pay for fast production writes! https://www.postgresql.org/docs/9.1/sql-createindex.html#SQL-CREATEINDEX-CONCURRENTLY

B-tree is a self-balancing tree data structure that maintains sorted data and allows searches, sequential access, insertions, and deletions in logarithmic time. However, most contents explain this data structure from a theoretical point of view, in this lecture I’d like to shed some light on the practical considerations of B-Tree and B+Trees in real production database systems such as Postgres and MySQL.

Link to the original paper https://infolab.usc.edu/csci585/Spring2010/den_ar/indexing.pdf

B-Tree limitation

Assume you have a table that is partitioned on the customer_id field serial 32bit, and you want to partition by range, how do you create all the necessary partitions? this is what I discuss in this video

Source Code

https://github.com/hnasr/javascript_playground/tree/master/automate_partitions

In this lecture we explain the difference between exclusive (write locks) and shared locks (read locks)

In this video, I demonstrate how is it possible to get double booking in database-backed web applications and how to prevent double booking and race conditions with row-level locks.

Source Code https://github.com/hnasr/javascript_playground/tree/master/booking-system

In this video I’ll explain why you should avoid using SQL offset when implementing any kind of paging. I’ll explain what offset does, why is it slow and what is the alternative for better performance This video is inspired by Use the index luke, i’ll have a link to the blog and slides to learn more. Let say you have a web application with an API that supports paging, you user want to request 10 news articles in page 10, this is performed via a simple GET request as shown here The API server receives the GET request and builds the SQL in order to send it to the database hopefully a pool of connections exist here. Page 10 translates to offset 100 assuming each page has 10 records and now the database is ready to execute the query against the table. Offset by design means fetch and drop the first x number of rows, so in this case the database will fetch the first 110 rows and physically drop the first 100 leaving the limit of 10 which the user will get. As the offset increase, the database is doing more work which makes this operation extremely expensive. Furthermore, the problem with offset is you might accidentally read duplicate records. consider the user now want to read page 11 and meanwhile someone inserted a new row in the table, row 111 will be read twice Let us jump and test this against postgres

Use the Index Luke Blog https://use-the-index-luke.com/no-offset

Slides in this video https://payhip.com/b/B6o1

Connection pooling is a pattern of creating a pool of available connections (usually TCP) and allow multiple clients to share the same pool of connections. This pattern is usually used when connection establishment and tearing down is costly, and the server has a limited number of connections. In this video we will learn how to use connection pooling in NodeJs when working with a Postgres Database, we will learn how to spin up a pool of database connections and use stateless pool queries and transactional queries begin/end, and finally, we will

Node JS Source Code used in this lecture here https://github.com/hnasr/javascript_playground/tree/master/postgresnode-pool

Scripts and commands

docker run --name pgmaster -v /Users/HusseinNasser/postgres/v/master_data:/var/lib/postgresql/data -p 5432:5432 -e POSTGRES_PASSWORD=postgres -d postgres

docker run --name pgstandby -v /Users/HusseinNasser/postgres/v/standby_data:/var/lib/postgresql/data -p 5433:5432 -e POSTGRES_PASSWORD=postgres -d postgres

In standby node update postgresql.conf

primary_conninfo = 'application_name=standby host=husseinmac port=5432 user=postgres password=postgres’

add file standby.signal

touch standby.signal

In master update postgresql.conf

first 1 (standby1)

select * from pg_stat_replication

Scripts and commands

docker run --name pgmaster -v /Users/HusseinNasser/postgres/v/master_data:/var/lib/postgresql/data -p 5432:5432 -e POSTGRES_PASSWORD=postgres -d postgres

docker run --name pgstandby -v /Users/HusseinNasser/postgres/v/standby_data:/var/lib/postgresql/data -p 5433:5432 -e POSTGRES_PASSWORD=postgres -d postgres

In standby node update postgresql.conf

primary_conninfo = 'application_name=standby host=husseinmac port=5432 user=postgres password=postgres’

add file standby.signal

touch standby.signal

In master update postgresql.conf

first 1 (standby1)

select * from pg_stat_replication

If using the hostname doesn't work, use the IP address of the container itself. You can get the local IP address of the container by running docker inspect container name

We got through a practical system design exercises, this lecture is two parts. Part 1 is all about backend engineering and scaling and Part 2 focuses on database design.

Database engines or storage engines or sometimes even called embedded databases is software library that a database management software uses to store data on disk and do CRUD (create update delete)

Resources
https://youtu.be/V_C-T5S-w8g

https://mariadb.com/kb/en/library/changes-improvements-in-mariadb-102/

https://mariadb.com/kb/en/library/why-does-mariadb-102-use-innodb-instead-of-xtradb/

https://github.com/facebook/rocksdb/wiki/Features-Not-in-LevelDB

https://mariadb.com/kb/en/library/aria-storage-engine/

https://dev.mysql.com/doc/refman/8.0/en/innodb-index-types.html https://eng.uber.com/mysql-migration/

Traffic lights

Read about what's good

what should give you pause

and possible dealbreakers

Develops skills, knowledge, and tools that are core skills for database engineers

This is a multi-modal course including a mix of videos, readings, and discussions

Develops skills, knowledge, and tools that are highly relevant to industry

Develops a strong foundation for beginners

Taught by instructors with recognized expertise in database engineering

Covers unique perspectives and ideas that may add color to other topics and subjects

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Fundamentals of Database Engineering with these activities:

Read DBMS Fundamentals

Show steps

Solidify knowledge of database fundamentals, including data models, query languages, and transaction management concepts.

View Database Systems: The Complete Book on Amazon

Show steps

Read Chapters 1-5 to grasp the basic concepts of DBMS.
Work through the practice exercises at the end of each chapter to reinforce your understanding.
Summarize the key takeaways from each chapter in your own words to improve retention.

Discussion Forum Participation

Show steps

Engage with fellow students to clarify concepts, share perspectives, and enhance understanding.

Show steps

Regularly participate in the online discussion forums.
Post thoughtful questions and responses to foster discussion.

Indexing Strategies Tutorial

Show steps

Enhance your understanding of indexing techniques to improve database performance.

Browse courses on Database Indexing

Show steps

Review the official documentation for PostgreSQL or MySQL on indexing.
Follow a tutorial on best practices for creating and managing indexes.
Experiment with different indexing strategies on a practice database.

Three other activities

Expand to see all activities and additional details

Show all six activities

SQL Practice Problems

Show steps

Develop proficiency in writing SQL queries, a fundamental skill for database engineers.

Browse courses on SQL

Show steps

Visit LeetCode or HackerRank for SQL practice problems.
Solve at least 10 problems to gain comfort with basic SQL syntax and operations.

Database Design Project

Show steps

Apply database design principles and create a schema for a real-world application to solidify your understanding.

Browse courses on Database Design

Show steps

Identify a use case or problem that requires a database solution.
Sketch out an entity-relationship diagram to model the data relationships.
Create a database schema based on the ER diagram, including tables, columns, and constraints.
Implement the database schema using a DBMS (e.g., PostgreSQL, MySQL).

Contribute to a Database Project

Show steps

Gain practical experience and deepen your understanding by contributing to an open-source database project.

Browse courses on Open Source

Show steps

Identify an active open-source database project on GitHub or GitLab.
Review the project's documentation and find an area where you can contribute.
Submit a pull request with your proposed changes or improvements.

Career center

Learners who complete Fundamentals of Database Engineering will develop knowledge and skills that may be useful to these careers:

Database Administrator

A Database Administrator is responsible for the day-to-day maintenance and configuration of databases. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Database Administrator

Database Engineer

A Database Engineer is responsible for designing, developing, and maintaining databases. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Database Engineer

Data Analyst

A Data Analyst is responsible for analyzing data to identify trends and patterns. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Data Analyst

Data Scientist

A Data Scientist is responsible for developing and applying statistical and machine learning models to data. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Data Scientist

Software Engineer

A Software Engineer is responsible for designing, developing, and maintaining software applications. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Software Engineer

Systems Engineer

A Systems Engineer is responsible for designing, developing, and maintaining computer systems. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Systems Engineer

Technical Writer

A Technical Writer is responsible for writing technical documentation, such as user manuals, white papers, and training materials. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Technical Writer

Product Manager

A Product Manager is responsible for managing the development and launch of new products. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Product Manager

Consultant

A Consultant is responsible for providing advice and guidance to clients on a variety of topics. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Consultant

Teacher

A Teacher is responsible for teaching students about a variety of subjects. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Teacher

Researcher

A Researcher is responsible for conducting research on a variety of topics. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Researcher

Librarian

A Librarian is responsible for managing a library and providing access to information. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Librarian

Archivist

An Archivist is responsible for managing and preserving historical records. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Archivist

Museum curator

A Museum Curator is responsible for managing and preserving museum collections. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Museum curator

Historian

A Historian is responsible for studying and interpreting the past. This course can help you learn the fundamentals of database engineering, which is essential for success in this role. You will learn about database indexing, partitioning, replication, and sharding. You will also learn about database security and how to protect your data from unauthorized access.

See salaries and explore the career path for Historian

Fundamentals of Database Engineering

What's inside

Learning objectives

Syllabus

Traffic lights

Save this course

Activities

Career center

Reading list

Share

Similar courses