We may earn an affiliate commission when you visit our partners.
Course image
Udemy logo

Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0

Stephane Maarek | AWS Certified Cloud Practitioner,Solutions Architect,Developer

Apache NiFi (Cloudera DataFlows - ex Hortonworks DataFlow) is an innovative technology to build data flows and solve your streaming challenges?

Read more

Apache NiFi (Cloudera DataFlows - ex Hortonworks DataFlow) is an innovative technology to build data flows and solve your streaming challenges?

In today's big data world, fast data is becoming increasingly important. Streaming data at scale and rapidly between all your systems should be centralised, automated and resilient to failure to ensure good delivery to your downstream systems.

With NiFi, you can build all your flows directly from a UI, no coding required, and at scale.

Apache NiFi initially used by the NSA so they could move data at scale and was then open sourced. Being such a hot technology, Onyara (the company behind it) was then acquired by Hortonworks, one of the main backers of the big data project Hadoop and then Hadoop Data Platform.

Apache NiFi is now used in many top organisations that want to harness the power of their fast data by sourcing and transferring information from and to their database and big data lakes. It is a key tool to learn for the analyst and data scientists alike. Its simplicity and drag and drop interface make it a breeze to use.

You can build streaming pipelines between Kafka and ElasticSearch, an FTP and MongoDB, and so much more. Your imagination is the limit

Quick Overview Of Course Content

This course will take you through an introduction of the Apache NiFi technology.

With a mix of theory lessons and hands-on labs, you'll get started and build your first data flows.

You will learn how to set up your connectors, processors, and how to read your FlowFiles to make most of what NiFi offer.

The most important configuration options will be demonstrated so you will be able to get started in no time.

We will also analyse a template picked from the web and understand how to debug your flows as well as route your data to different processors based on outcomes through relationships.

We will finally learn about the integrations between NiFi and Apache Kafka or MongoDB. Lots of learning ahead.

Why I should take this course?

  • With over 1.5 hours of videos and over 15 classes, you will get a great understand of Apache NiFi in no time.

  • You will learn how to install and configure Apache NiFi to get started

  • You will learn Apache NiFI Architecture and Core Concepts

  • The core concepts like FlowFile, FlowFile Processor, Connection, Flow Controller, Process Groups etc.

  • You will learn how to use Apache NiFi Efficiently to Stream Data using NiFi between different systems at scale

  • You will also understand how to monitor Apache NiFi

  • Integrations between Apache Kafka and Apache NiFi.

  • Questions can also be asked on the forum and instructor is keen to answer those in timely manner

Students Loved this course

Ashish Ranjan says “Great Course to get started with Nifi. Also, the instructor is very helpful and answers all your questions. I would highly recommend it. Great Job.” (Rated with 5 star)

Luca Costa says “It was very interesting and now I have an Idea how to start my project :) Thank you” (Rated with 5 star)

Aaron Gong says “Very clear and well instructed, first section is the most important, why use Nifi and for what purpose it is better suited for…” (Rated with 5 star)

I am sure that you will walk away with a great enterprise skill and start solving your streaming challenges.

Instructor

My name is Stephane Maarek, and I'll be your instructor in this course. I teach about Data Engineering and API, and throughout my career in designing and delivering these certifications and courses, I have already taught

With NiFi becoming much more than a buzzword out there, I've decided it's time for students to properly learn about Apache NiFi - Cloudera DataFlow - HDF 2.0. So, let’s kick start the course. You are in good hands.

This Course Also Comes With:

  • Lifetime Access to All Future Updates

  • A responsive instructor in the Q&A Section

  • Links to interesting articles, and lots of good code to base your next template onto

  • Udemy Certificate of Completion Ready for Download

  • A 30 Day "No Questions Asked" Money Back Guarantee.

I hope to see you inside the course.

Enroll now

What's inside

Learning objectives

  • Install and configure apache nifi
  • Design apache nifi architecture
  • Master core functionalities like flowfile, flowfile processor, connection, flow controller, process groups, etc.
  • Use nifi to stream data between different systems at scale
  • Monitor apache nifi
  • Integrate nifi with apache kafka
  • Integration nifi with mongodb

Syllabus

Introduction to NiFi and first concepts

Introduction to what Apache NiFi is, what it's good for and not good for, and what benefits it will provide you and your company

Read more
Slides Download
About your instructor

Introduction to the three most important concepts in Apache NiFi, the FlowFile, the Processor and the Connector

Apache NiFi basics
Users will be able to create a data pipeline, and learn about the UI
Pre-requisite: Java 8

Download and Install NiFi. Run on Windows, MacOS X or Linux. Install as a Service

Create your first processor GetFile and configure it

Create your second processor PutFile, connect it to your first processor and start your data flow

This lecture walks you through the different elements within the UI in Apache NiFi so you can start using the tool to its full potential

Theoretical lecture to introduce you to the variety of processors that are available to you in Apache NiFi

Another Flow example to Generate data that you will find useful for debugging your flows. 

Getting started with Apache NiFi
Start using Apache NiFi to its full potential!

Learn what templates are and how to import them

Create process groups and utilise them to export your flows as templates

Theoretical lecture about how FlowFiles are structured, with their content and attributes

Start manipulating FlowFile's content and attributes through the analysis of a template

Learn how attributes can be used in the expression language so that they can alter the content

Learn how to monitor your overall Apache NiFi state

Explore the data provenance menu to understand the FlowFile journey through your data flows

Learn about Relationships so that you can start routing data to different processors

Apache NiFi in depth
Annex lectures that I will create to help students

In this lecture I will go through how to set-up a NiFi flow so that JSON documents are written to a MongoDB database. The processor PutMongo will be used

Annex 2: Integration with Apache Kafka
Awesome discounts for all my other courses
THANK YOU!
Bonus Lecture: Student Special Coupons for my Other courses

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Explores Apache NiFi, which is standard in industry cloud data management and streaming data integration pipelines
Taught by Stephane Maarek, who is recognized for their work in Data Engineering and API design
Develops Apache NiFi technical skills and knowledge, which are core skills for cloud data engineers and data scientists
In-depth, comprehensive study of Apache NiFi, including hands-on labs and interacive materials
Access to lifetime of future updates to the course
Students can ask questions directly to the instructor through the forum

Save this course

Save Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0 to your list so you can find it easily later:
Save

Reviews summary

Well-received intro to apache nifi

Learners say this introductory course on Apache Nifi is well received. They especially appreciate the engaging assignments including hands-on examples. Students remark that the course gives them ideas for how to conduct further research on Nifi.
Learners remark that the course encourages further research.
"It gives the idea to further research and work on nifi."
Students especially like the hands-on examples.
"Especially the last section with examples along with hands-on is quite helpful."
Students find this a great introductory course to Apache Nifi.
"Its a very great introductory course for Nifi."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0 with these activities:
Follow a tutorial on using NiFi for data streaming
This activity will help students learn how to use NiFi to stream data between different systems.
Browse courses on Data Streaming
Show steps
  • Find a tutorial on NiFi data streaming
  • Follow the steps in the tutorial
  • Experiment with different data sources and destinations
  • Troubleshoot any issues that may arise
Create a data pipeline in NiFi
This activity will help students solidify their understanding of NiFi architecture and how to use it to create data pipelines.
Browse courses on Data Pipeline
Show steps
  • Install and configure Apache NiFi
  • Create a data flow
  • Connect processors to create a pipeline
  • Start the data flow and monitor its progress
  • Troubleshoot any issues that may arise
Show all two activities

Career center

Learners who complete Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0 will develop knowledge and skills that may be useful to these careers:
Data Analyst
Data Analysts use their knowledge of data analysis tools and technologies to analyze data and identify trends. They use these insights to make informed decisions about products, services, and strategies. This course can help Data Analysts learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Analysts who need to build scalable and reliable data pipelines.
Data Scientist
Data Scientists use their knowledge of data science tools and technologies to extract insights from data. They use these insights to make informed decisions about products, services, and strategies. This course can help Data Scientists learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Scientists who need to build scalable and reliable data pipelines.
Data Engineer
Data Engineers are responsible for designing, building, testing, and maintaining big data systems. They use their knowledge of data engineering tools and technologies to solve complex data problems. This course can help Data Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Engineers who need to build scalable and reliable data pipelines.
Business Analyst
Business Analysts use their knowledge of business analysis tools and technologies to analyze business processes and identify opportunities for improvement. They use these insights to make informed decisions about products, services, and strategies. This course can help Business Analysts learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Business Analysts who need to build scalable and reliable data pipelines.
Cloud Architect
Cloud Architects use their knowledge of cloud computing tools and technologies to design and implement cloud-based solutions. They use these skills to build cloud-based solutions that meet the needs of users. This course can help Cloud Architects learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Cloud Architects who need to build scalable and reliable data pipelines.
Data Pipeline Engineer
Data Pipeline Engineers use their knowledge of data pipeline tools and technologies to design and implement data pipelines. They use these skills to build data pipelines that move data from one place to another. This course can help Data Pipeline Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Pipeline Engineers who need to build scalable and reliable data pipelines.
Data Architect
Data Architects use their knowledge of data architecture tools and technologies to design and implement data architectures. They use these skills to build data architectures that meet the needs of users. This course can help Data Architects learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Architects who need to build scalable and reliable data pipelines.
Software Engineer
Software Engineers use their knowledge of software engineering tools and technologies to design, develop, and maintain software applications. They use these skills to build software applications that meet the needs of users. This course can help Software Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Software Engineers who need to build scalable and reliable data pipelines.
Database Administrator
Database Administrators use their knowledge of database administration tools and technologies to manage and maintain databases. They use these skills to ensure that databases are available, reliable, and secure. This course can help Database Administrators learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Database Administrators who need to build scalable and reliable data pipelines.
ETL Developer
ETL Developers use their knowledge of ETL tools and technologies to design and implement ETL processes. They use these skills to build ETL processes that extract, transform, and load data from a variety of sources. This course can help ETL Developers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for ETL Developers who need to build scalable and reliable data pipelines.
Big Data Engineer
Big Data Engineers use their knowledge of big data tools and technologies to design and implement big data solutions. They use these skills to build big data solutions that process and analyze large amounts of data. This course can help Big Data Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Big Data Engineers who need to build scalable and reliable data pipelines.
Data Integration Engineer
Data Integration Engineers use their knowledge of data integration tools and technologies to design and implement data integration solutions. They use these skills to build data integration solutions that connect different data sources and systems. This course can help Data Integration Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it an essential tool for Data Integration Engineers who need to build scalable and reliable data pipelines.
DevOps Engineer
DevOps Engineers use their knowledge of DevOps tools and technologies to design and implement DevOps processes. They use these skills to build DevOps processes that automate the software development and delivery process. This course may help DevOps Engineers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it a potentially useful tool for DevOps Engineers.
IT Manager
IT Managers use their knowledge of IT management tools and technologies to manage and maintain IT systems. They use these skills to ensure that IT systems are available, reliable, and secure. This course may help IT Managers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it a potentially useful tool for IT Managers.
Project Manager
Project Managers use their knowledge of project management tools and technologies to plan and execute projects. They use these skills to ensure that projects are completed on time, within budget, and to the required quality. This course may help Project Managers learn about Apache NiFi, a powerful tool for building data pipelines. Apache NiFi can be used to ingest, process, and transform data from a variety of sources, making it a potentially useful tool for Project Managers.

Reading list

We've selected five books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0.
While this book focuses on MapReduce, it provides valuable insights into data-intensive processing techniques that are applicable to NiFi. It helps readers understand how to handle large datasets efficiently.
Comprehensive guide to Apache Spark, the open-source framework for data processing. It covers topics such as Spark Core, Spark SQL, and Spark Streaming.
Comprehensive guide to Apache Kafka, the open-source streaming platform. It covers topics such as Kafka architecture, data ingestion, and stream processing.
Comprehensive guide to MongoDB, the open-source NoSQL database. It covers topics such as MongoDB architecture, data modeling, and query optimization.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Introduction to Apache NiFi | Cloudera DataFlow - HDF 2.0.
Processing Streaming Data Using Apache Spark Structured...
Most relevant
Conceptualizing the Processing Model for the AWS Kinesis...
Most relevant
Windowing and Join Operations on Streaming Data with...
Most relevant
Apache Kafka Series - Learn Apache Kafka for Beginners v3
Most relevant
Handling Fast Data with Apache Spark SQL and Streaming
Most relevant
Conceptualizing the Processing Model for Apache Spark...
Most relevant
Serverless Data Processing with Dataflow: Foundations
Most relevant
Exploring the Apache Beam SDK for Modeling Streaming Data...
Handling Streaming Data with AWS Kinesis Data Analytics...
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser