Save For Later

Data Engineering

Save For Later

Organizations have more data at their disposal today than ever before. The vast amount of data that organizations are capturing, along with their desire to extract meaningful insights is driving an urgent demand for Data Engineers.

Data Engineers play a fundamental role in harnessing data that enable organizations to apply business intelligence for making informed decisions. Today’s Data Engineers require a broad set of skills to develop and optimize data systems and make data available to the organization for analysis.

This Professional Certificate provides you the job-ready skills you will need to launch your career as an entry level data engineer.

Upon completing this Professional Certificate, you will have extensive knowledge and practical experience with cloud-based relational databases (RDBMS) and NoSQL data repositories, working with Python, Bash and SQL, processing big data with Apache Hadoop and Apache Spark, using ETL (extract, transform and load) tools, creating data pipelines, using Apache Kafka and Airflow, designing, populating, and querying data warehouses and utilizing business intelligence tools.

Within each course, you’ll gain practical experience with hands-on labs and projects for building your portfolio. In the final Capstone project, you’ll apply your knowledge and skills attained throughout this program and demonstrate your ability to perform as a Data Engineer.

This program does not require any prior data engineering or programming experience.

What you'll learn

  • Describe the core concepts, processes, tools and technologies in the field of data engineering.
  • Demonstrate your aptitude with RDBMS fundamentals including design & creation of databases, schemas, tables; DB administration, security & working with MySQL, PostgreSQL & IBM Db2.
  • Demonstrate your proficiency with SQL query language, SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procs, working with multiple tables, JOINs, & transactions.
  • Explain NoSQL and big data concepts including practice with MongoDB, Cassandra, IBM Cloudant, Apache Hadoop, Apache Spark, SparkSQL, SparkML, Spark Streaming.
  • Describe ETL tools, data pipelines using Python, shell scripts with Linux, Apache Airflow and Apache Kafka.
  • Describe Data Lakes, Data Marts and Enterprise Data Warehouses (EDW) and design them using Star and Snowflake schemas.
  • Design and populate Data Warehouses and analyze their data with Business Intelligence (BI) tools like Cognos Analytics.

Read More

OpenCourser is an affiliate partner of edX and may earn a commission when you buy through our links.

From IBM via edX
Hours 168
Instructors Joseph Santarcangelo, Rav Ahuja, Lin Joyner, Rose Malcolm, Ramesh Sannareddy, Steve Ryan, Karthik Muthuraman, Aije Egwaikhide, Romeo Kienzler, Yan Luo, Jeff Grossman
Language English
Subjects Programming Data Science Business

Similar Courses

Sorted by relevance

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile (33rd - 99th).

Business Intelligence Implementer $68k

Business Intelligence Generalist $68k

Business Intelligence Researcher $84k

Business Intelligence Contractor $86k

Advisor Business Intelligence $94k

Business Intelligence and Investigations $97k

Associate Business Intelligence $104k

Business Intelligence Architect 3 $109k

Business Intelligence Advisor $109k

Developer, Business Intelligence $121k

Business Intelligence Strategist $122k

IT Business Intelligence Developer $149k

Courses in this Professional Certificate

Listed in the order in which they should be taken

Starts Course Information

On Demand

Python Basics for Data Science

Please Note: Learners who successfully complete this IBM course can earn a skill badge —a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve...

edX | IBM

Save

On Demand

SQL for Data Science

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills...

edX | IBM

Save

On Demand

SQL Concepts for Data Engineers

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills...

edX | IBM

Save

On Demand

Relational Database Basics

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills...

edX | IBM

Save

On Demand

Data Engineering Basics for Everyone

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills...

edX | IBM

Save

On Demand

Python for Data Engineering Project

Please Note: Learners who successfully complete this IBM course can earn a skill badge — a detailed, verifiable and digital credential that profiles the knowledge and skills...

edX | IBM

Save

On Demand

NoSQL Database Basics

This course will provide you with technical hands-on knowledge of NoSQL databases and Database-as-a-Service (DaaS) offerings. With the advent of Big Data and agile development...

edX | IBM

Save

On Demand

Big Data, Hadoop, and Spark Basics

Organizations need skilled, forward-thinking Big Data practitioners who can apply their business and technical skills to unstructured data such as tweets, posts, pictures, audio...

edX | IBM

Save

On Demand

Apache Spark for Data Engineering and Machine Learning

Apache® Spark™ is a fast, flexible, and developer-friendly open-source platform for large-scale SQL, batch processing, stream processing, and machine learning. Users can take...

edX | IBM

Save

On Demand

Linux Commands & Shell Scripting

This mini-course provides a practical introduction to commonly used Linux / UNIX shell commands and teaches you basics of Bash shell scripting to automate a variety of tasks. The...

edX | IBM

Save

On Demand

Building ETL and Data Pipelines with Bash, Airflow and Kafka

Well-designed and automated data pipelines and ETL processes are the foundation of a successful Business Intelligence platform. Defining your data workflows, pipelines and...

edX | IBM

Save

On Demand

Relational Database Administration (DBA)

Managing databases is a critical skill for Data Engineers and Database Administrators to ensure data is reliable, protected and easily accessible for organizations to make better...

edX | IBM

Save

On Demand

Data Warehousing and BI Analytics

Today’s businesses are investing significantly in capabilities to harness the massive amounts of data that fuel Business Intelligence (BI). Working knowledge of Data Warehouses...

edX | IBM

Save

On Demand

Data Engineering Capstone Project

In this Capstone you’ll demonstrate your ability to perform like a Data Engineer. Your mission is to design, implement, and manage a complete data and analytics platform...

edX | IBM

Save

edX

&

IBM

From IBM via edX
Hours 168
Instructors Joseph Santarcangelo, Rav Ahuja, Lin Joyner, Rose Malcolm, Ramesh Sannareddy, Steve Ryan, Karthik Muthuraman, Aije Egwaikhide, Romeo Kienzler, Yan Luo, Jeff Grossman
Language English
Subjects Programming Data Science Business

Careers

An overview of related careers and their average salaries in the US. Bars indicate income percentile (33rd - 99th).

Business Intelligence Implementer $68k

Business Intelligence Generalist $68k

Business Intelligence Researcher $84k

Business Intelligence Contractor $86k

Advisor Business Intelligence $94k

Business Intelligence and Investigations $97k

Associate Business Intelligence $104k

Business Intelligence Architect 3 $109k

Business Intelligence Advisor $109k

Developer, Business Intelligence $121k

Business Intelligence Strategist $122k

IT Business Intelligence Developer $149k

Similar Courses

Sorted by relevance