We may earn an affiliate commission when you visit our partners.
Course image
Jong-Moon Chung

Every time you use Google to search something, every time you use Facebook, Twitter, Instagram or any other SNS (Social Network Service), and every time you buy from a recommended list of products on Amazon.com you are using a big data system. In addition, big data technology supports your smartphone, smartwatch, Alexa, Siri, and automobile (if it is a newer model) every day. The top companies in the world are currently using big data technology, and every company is in need of advanced big data technology support. Simply put, big data technology is not an option for your company, it is a necessity for survival and growth. So now is the right time to learn what big data is and how to use it in advantage of your company. This 6 module course first focuses on the world’s industry market share rankings of big data hardware, software, and professional services, and then covers the world’s top big data product line and service types of the major big data companies. Then the lectures focused on how big data analysis is possible based on the world’s most popular three big data technologies Hadoop, Spark, and Storm. The last part focuses on providing experience on one of the most famous and widely used big data statistical analysis systems in the world, the IBM SPSS Statistics. This course was designed to prepare you to be more successful in businesses strategic planning in the upcoming big data era. Welcome to the amazing Big Data world!

Enroll now

What's inside

Syllabus

Big Data Rankings & Products
The first module “Big Data Rankings & Products” focuses on the relation and market shares of big data hardware, software, and professional services. This information provides an insight to how future industry, products, services, schools, and government organizations will be influenced by big data technology. To have a deeper view into the world’s top big data products line and service types, the lecture provides an overview on the major big data company, which include IBM, SAP, Oracle, HPE, Splunk, Dell, Teradata, Microsoft, Cisco, and AWS. In order to understand the power of big data technology, the difference of big data analysis compared to traditional data analysis is explained. This is followed by a lecture on the 4 V big challenges of big data technology, which deal with issues in the volume, variety, velocity, and veracity of the massive data. Based on this introduction information, big data technology used in adding global insights on investments, help locate new stores and factories, and run real-time recommendation systems by Wal-Mart, Amazon, and Citibank is introduced.
Read more
Big Data & Hadoop
The second module “Big Data & Hadoop” focuses on the characteristics and operations of Hadoop, which is the original big data system that was used by Google. The lectures explain the functionality of MapReduce, HDFS (Hadoop Distributed FileSystem), and the processing of data blocks. These functions are executed on a cluster of nodes that are assigned the role of NameNode or DataNodes, where the data processing is conducted by the JobTracker and TaskTrackers, which are explained in the lectures. In addition, the characteristics of metadata types and the differences in the data analysis processes of Hadoop and SQL (Structured Query Language) are explained. Then the Hadoop Release Series is introduced which include the descriptions of Hadoop YARN (Yet Another Resource Negotiator), HDFS Federation, and HDFS HA (High Availability) big data technology.
Spark
The third module “Spark” focuses on the operations and characteristics of Spark, which is currently the most popular big data technology in the world. The lecture first covers the differences in data analysis characteristics of Spark and Hadoop, then goes into the features of Spark big data processing based on the RDD (Resilient Distributed Datasets), Spark Core, Spark SQL, Spark Streaming, MLlib (Machine Learning Library), and GraphX core units. Details of the features of Spark DAG (Directed Acyclic Graph) stages and pipeline processes that are formed based on Spark transformations and actions are explained. Especially, the definition and advantages of lazy transformations and DAG operations are described along with the characteristics of Spark variables and serialization. In addition, the process of Spark cluster operations based on Mesos, Standalone, and YARN are introduced.
Spark ML & Streaming
The fourth module “Spark ML & Streaming” focuses on how Spark ML (Machine Learning) works and how Spark streaming operations are conducted. The Spark ML algorithms include featurization, pipelines, persistence, and utilities which operate on the RDDs (Resilient Distributed Datasets) to extract information form the massive datasets. The lectures explain the characteristics of the DataFrame-based API, which is the primary ML API in the spark.ml package. Spark ML basic statistics algorithms based on correlation and hypothesis testing (P-value) are first introduced followed by the Spark ML classification and regression algorithms based on linear models, naive Bayes, and decision tree techniques. Then the characteristics of Spark streaming, streaming input and output, as well as streaming receiver types (which include basic, custom, and advanced) are explained, followed by how the Spark Streaming process and DStream (Discretized Stream) enable big data streaming operations for real-time and near-real-time applications.
Storm
The fifth module “Storm” focuses on the characteristics and operations of Storm big data systems. The lecture first covers the differences in data analysis characteristics of Storm, Spark, and Hadoop technology. Then the features of Storm big data processing based on the nimbus, spouts, and bolts are described followed by the Storm streams, supervisor, and ZooKeeper details. Further details on Storm reliable and unreliable spouts and bolts are provided followed by the advantages of Storm DAG (Directed Acyclic Graph) and data stream queue management. In addition, the advantages of using Storm based fast real-time applications, which include real-time analytics, online ML (Machine Learning), continuous computation, DRPC (Distributed Remote Procedure Call), and ETL (Extract, Transform, Load) are introduced.
IBM SPSS Statistics Project
The sixth and last module “IBM SPSS Statistics Project” focuses on providing experience on one of the most famous and widely used big data statistical analysis systems in the world. First, the lecture starts with how to setup and use IBM SPSS Statistics, and continues on to describe how IBM SPSS Statistics can be used to gain corporate data analysis experience. Then the data processing statistical results of two projects based on using the IBM SPSS Statistics big data system is conducted. The projects are conducted so the student can discover new ways to use, analyze, and draw charts of the relationship between datasets, and also compare the statistical results using IBM SPSS Statistics.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Covers the core concepts and tools used in the big data industry
Emphasizes the practical application of big data technologies, making it relevant for professionals in various industries
Taught by experienced instructors with expertise in big data, ensuring high-quality content
The course is project-based, providing hands-on experience in working with big data tools
Leverages industry-leading technologies such as Hadoop, Spark, and IBM SPSS Statistics
Requires a strong foundation in programming and data analysis concepts

Save this course

Save Big Data Emerging Technologies to your list so you can find it easily later:
Save

Reviews summary

Well-received big data course

Learners largely praise this well-structured course on Big Data concepts and technologies, calling it informative, engaging, and accessible for beginners. They especially appreciate the clear explanations, practical examples, and enthusiastic instructor. However, some mention that the difficulty level can be challenging, and the final project could be improved. Overall, learners highly recommend this course for anyone seeking a comprehensive introduction to Big Data technologies.
Course is packed with up-to-date information on Big Data technologies.
"I have learned so much about the Big Data technologies in this course."
"Gives you a wide perspective regarding the Big Data"
"Amazing introduction course. I want to learn more about this fascinating area."
Course is well-suited for those new to Big Data.
"It is designed well for new learners on this topics. "
"This course gives a very good exposure to basics of Big data."
"The course was a very good intro in Big Data Technologies."
Instructor is knowledgeable, passionate, and engaging.
"Great course and excellent instructor."
"The lecturer explained everthing in details which is more understandable. Thank you I real enjoy the course."
"P​rofessor knows EVERYTHING, nevertheless showing a fine humble attitude during the lectures, providing extensive references for further studies at the end of the videos."
Instructor presents complex Big Data concepts in a way that's easy to understand.
"The lecturer explained everthing in details which is more understandable."
"The course was insightful and well explained even for someone who is is not tech savvy as me."
"P​rofessor knows EVERYTHING, nevertheless showing a fine humble attitude during the lectures, providing extensive references for further studies at the end of the videos."
Final project instructions are unclear and the tool used is only free for a limited time.
"Some quizzes require watching lectures from future weeks, which break the correct pace"
"The final assignment is formulated somewhat ambiguously and feels disconnected from the rest of the course."
"For this reasons, i gave three stars only, even if the course quality and the material is very good, the assignment could be improved a lot and is decreasing my final evaluation."
Course can be demanding at times, requiring dedication and effort.
"It's nice course to big jump into bigdata"
"A lot to learn and remember but a great course overall!"
"The course was insightful and well explained even for someone who is is not tech savvy as me."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Big Data Emerging Technologies with these activities:
Overview of big data systems
Review the basic concepts of big data systems to enhance understanding of the course content.
Browse courses on Big Data
Show steps
  • Read the course overview and syllabus.
  • Watch introductory videos on big data.
  • Complete practice exercises on big data fundamentals.
Study Group Discussions
Reinforce your understanding and challenge your perspectives by engaging in discussions with peers.
Show steps
  • Provide feedback and support to your peers.
  • Join an online study group or create your own.
  • Discuss course concepts, assignments, and industry trends.
Spark Exploration
Enhance your understanding of Spark and its applications through guided tutorials.
Browse courses on Spark
Show steps
  • Find online tutorials on Spark basics.
  • Follow along with the tutorials, completing exercises and assignments.
  • Apply what you've learned to analyze sample datasets.
Two other activities
Expand to see all activities and additional details
Show all five activities
Big Data Coding Challenges
Strengthen your coding skills in big data technologies through practice drills.
Browse courses on Spark
Show steps
  • Solve coding problems on platforms like LeetCode or HackerRank.
  • Participate in online coding competitions.
  • Implement big data algorithms and techniques in personal projects.
Real-Time Data Analysis Project
Apply your knowledge to a real-world big data project and build confidence.
Browse courses on Big Data Analytics
Show steps
  • Choose a dataset and define your project goals.
  • Design and implement your data pipeline using Storm or another technology.
  • Analyze your data and draw insights.
  • Present your findings and discuss your approach.

Career center

Learners who complete Big Data Emerging Technologies will develop knowledge and skills that may be useful to these careers:
Data Analyst
Data Analysts use advanced analytics tools to collect, clean, and analyze data to identify patterns and trends. They then use this information to make recommendations and solve business problems. Big Data is a rapidly growing field, and Data Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Data Analyst.
Big Data Architect
Big Data Architects design and implement Big Data solutions. They work with businesses to understand their data needs and then design and build systems to meet those needs. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Big Data Architect.
Data Scientist
Data Scientists use advanced analytics techniques to solve complex business problems. They work with large datasets to identify patterns and trends, and then develop models to predict future outcomes. Big Data is a rapidly growing field, and Data Scientists with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Data Scientist.
Machine Learning Engineer
Machine Learning Engineers design and implement machine learning models. They work with businesses to understand their business needs and then develop models to solve those needs. Big Data is a rapidly growing field, and Machine Learning Engineers with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Machine Learning Engineer.
Database Administrator
Database Administrators manage and maintain databases. They ensure that databases are running smoothly and that data is safe and secure. Big Data is a rapidly growing field, and Database Administrators with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Database Administrator.
Software Engineer
Software Engineers design, develop, and maintain software applications. They work with businesses to understand their needs and then develop software solutions to meet those needs. Big Data is a rapidly growing field, and Software Engineers with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Software Engineer.
Business Analyst
Business Analysts work with businesses to understand their needs and then develop solutions to meet those needs. They often use data analysis techniques to identify patterns and trends, and then develop recommendations to improve business performance. Big Data is a rapidly growing field, and Business Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Business Analyst.
Data Engineer
Data Engineers design and build data pipelines. They work with businesses to understand their data needs and then design and build systems to collect, clean, and process data. Big Data is a rapidly growing field, and Data Engineers with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Data Engineer.
Statistician
Statisticians use statistical methods to analyze data and draw conclusions. They work with businesses to understand their data needs and then develop statistical models to answer questions and make predictions. Big Data is a rapidly growing field, and Statisticians with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Statistician.
Operations Research Analyst
Operations Research Analysts use mathematical and analytical techniques to solve complex business problems. They work with businesses to understand their needs and then develop models to optimize operations. Big Data is a rapidly growing field, and Operations Research Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become an Operations Research Analyst.
Financial Analyst
Financial Analysts use financial data to make investment recommendations. They work with businesses to understand their financial needs and then develop models to predict future financial performance. Big Data is a rapidly growing field, and Financial Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Financial Analyst.
Marketing Analyst
Marketing Analysts use data to understand customer behavior and develop marketing campaigns. They work with businesses to understand their marketing needs and then develop models to predict customer behavior. Big Data is a rapidly growing field, and Marketing Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Marketing Analyst.
Risk Analyst
Risk Analysts use data to identify and assess risks. They work with businesses to understand their risk needs and then develop models to predict future risks. Big Data is a rapidly growing field, and Risk Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Risk Analyst.
Quantitative Analyst
Quantitative Analysts use mathematical and statistical techniques to analyze data and make investment recommendations. They work with businesses to understand their investment needs and then develop models to predict future investment performance. Big Data is a rapidly growing field, and Quantitative Analysts with skills in Big Data technologies are in high demand. This course provides a comprehensive overview of Big Data technologies and techniques, making it an ideal choice for those who want to become a Quantitative Analyst.

Reading list

We've selected ten books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Big Data Emerging Technologies.
Is the definitive reference for Apache Spark, the fast and general-purpose distributed computing engine. It includes topics such as streaming data, machine learning, and graph processing with Spark. If you are interested in Spark, then this must-read.
Focuses on big data architecture and provides the knowledge for you to build big data systems from scratch. Topics covered include real-time data processing with Apache Storm, distributed storage systems such as Apache HDFS, and stream processing systems like Apache Kafka.
Provides a Java-centric view of big data analytics. Topics covered include using Hadoop, Spark, and Storm. You can follow along with the code examples in the book to get hands-on experience with these technologies.
Widely used as a university textbook, this book is considered an advanced exploration of the Hadoop framework. It covers the internals of Hadoop, Hadoop data formats, and the Hadoop ecosystem, making it a great reference for advanced Hadoop knowledge.
While this book little dated, it provides a good foundation for big data and its components. Big data analytics, data science, and big data governance are discussed along with data lakes and data warehouses. Finally, big data security and privacy are discussed.
While this book focuses on R, it provides a great introduction to data science concepts and techniques. Topics covered include data wrangling, data visualization, and statistical modeling. If data science interests you, this great addition to this course with Spark having integration with R.
Provides a business-oriented view of data science. Topics covered include data mining, machine learning, and data visualization. It good choice for someone who is interested in using data science to solve business problems.
Provides a practical introduction to big data technologies. Topics covered include data storage, data processing, and data visualization. It good starting point for someone who is new to big data.
Written with the beginner in mind, this book provides an overview of data analytics. Topics covered include data collection, data cleaning, and data analysis. It good introduction to the field of data analytics.
Provides a comprehensive glossary of big data terms. It valuable resource for anyone who wants to learn more about big data.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Big Data Emerging Technologies.
Hadoop for .NET Developers
Apache Spark 2.0 with Java -Learn Spark from a Big Data...
Creating Your First Big Data Hadoop Cluster Using...
Mastering Amazon Redshift Development & Administration
Big Data: Executive Briefing
Deep Learning for Business
Developer Trends & What They Signal About The Future of...
Business Intelligence and Competitive Analysis
Managing Big Data with AWS Storage Options
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser