We may earn an affiliate commission when you visit our partners.
Course image
Anne Rynearson, JJ Miclat, and Vaishali Agarwal

Enroll in our data infrastructure training course to learn about building data pipelines, producers, consumers, and more. Start your data journey with Udacity.

Prerequisite details

To optimize your success in this program, we've created a list of prerequisites and recommendations to help you prepare for the curriculum. Prior to enrolling, you should have the following knowledge:

  • Basic spreadsheet use
  • Tableau proficiency
  • Basic SQL

You will also need to be able to communicate fluently and professionally in written and spoken English.

Here's a deal for you

We found an offer that may be relevant to this course.
Save money when you learn. All coupon codes, vouchers, and discounts are applied automatically unless otherwise noted.

What's inside

Syllabus

This is the Introduction to the Data Infrastructure Course.
Introduction to the concept of Data Pipelines, need for a data pipeline, and components of a data pipeline.
Read more
About systems, services or users who consume data
This lesson is about systems or services that produce data.
Putting everything together to create the right data strategy for your situation.
Flyber has been growing and now it needs a scalable data strategy. It is your job to create that data strategy.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Offers a comprehensive overview of the data infrastructure landscape
Appropriate for learners with basic spreadsheet use, Tableau proficiency, and basic SQL skills
Covers in-demand skills and knowledge in the field of data infrastructure
Taught by experienced instructors in the field
Requires learners to take prior courses as prerequisites
May require access to additional resources or software not readily available to all learners

Save this course

Save Establishing Data Infrastructure to your list so you can find it easily later:
Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Establishing Data Infrastructure with these activities:
Review basic spreadsheet use
Sharpen spreadsheet skills to streamline data manipulation tasks.
Browse courses on Spreadsheets
Show steps
  • Review the fundamentals of spreadsheet software
  • Practice creating and formatting spreadsheets
  • Explore formulas and functions for data analysis
Review SQL Fundamentals
Refresh your understanding of basic SQL commands and concepts to enhance your comprehension of data pipeline construction and management.
Browse courses on SQL
Show steps
  • Review syntax and commands for data selection, manipulation, and aggregation.
  • Practice writing SQL queries to extract and analyze data from sample datasets.
Organize a Study Group
Collaborate with peers to reinforce concepts, solve problems, and enhance your understanding of data pipelines.
Show steps
  • Identify a group of peers who share an interest in data pipelines.
  • Schedule regular meetings to discuss course materials, share knowledge, and work on projects.
  • Take turns leading discussions and presenting on specific topics related to data pipelines.
Five other activities
Expand to see all activities and additional details
Show all eight activities
Explore Data Pipelines with AWS Step Functions
Enhance your understanding of data pipeline architecture and implementation by following tutorials on AWS Step Functions.
Show steps
  • Review the basics of AWS Step Functions and its role in data pipelines.
  • Follow step-by-step tutorials to create and manage data pipelines using AWS Step Functions.
  • Experiment with different configurations and scenarios to gain hands-on experience.
Attend a Data Pipeline Workshop
Connect with experts and gain practical insights into data pipeline implementation by attending a workshop.
Show steps
  • Research and identify industry-leading data pipeline workshops or conferences.
  • Register and participate in the workshop, actively engaging with speakers and attendees.
  • Apply the knowledge and best practices gained from the workshop to enhance your data pipeline skills.
Data Manipulation Exercises
Reinforce your data manipulation skills by practicing with realistic data sets and exercises.
Browse courses on Data Manipulation
Show steps
  • Load and explore data sets using Python or other programming languages.
  • Perform data cleaning, transformation, and analysis tasks based on specific scenarios.
  • Evaluate and refine your data manipulation techniques to improve efficiency.
Design a Data Pipeline for a Real-World Scenario
Apply your learning by designing a comprehensive data pipeline for a specific business scenario.
Show steps
  • Identify a real-world business problem that can be solved with a data pipeline.
  • Research and gather data from various sources to support your pipeline design.
  • Plan the architecture and components of your data pipeline, including data ingestion, transformation, and analysis.
  • Implement your design using appropriate tools and technologies.
  • Evaluate the performance and effectiveness of your data pipeline.
Mentor a Junior Data Engineer
Deepen your understanding of data pipelines by mentoring a junior professional and sharing your knowledge.
Show steps
  • Find a junior data engineer who is seeking guidance and mentorship in data pipelines.
  • Offer regular guidance, support, and technical advice to your mentee.
  • Provide feedback and encouragement to foster their growth and development.

Career center

Learners who complete Establishing Data Infrastructure will develop knowledge and skills that may be useful to these careers:
Data Architect
A Data Architect designs and manages the overall data infrastructure for an organization. They work with business stakeholders to understand their data needs and develop a strategy for meeting those needs. This course provides a comprehensive overview of the principles and practices of data architecture, making it a valuable resource for those who want to pursue a career in this field.
Data Engineer
A Data Engineer is responsible for designing, building, and maintaining data pipelines that move data between different systems. They work closely with data scientists and other data professionals to ensure that the data is clean, accurate, and timely. This course provides a solid foundation in the concepts and tools used by Data Engineers, making it a valuable resource for those who want to pursue a career in this field.
Data Scientist
A Data Scientist uses data to solve business problems. They work with data engineers and other data professionals to access and analyze data, and then use that data to develop models and insights that can help businesses make better decisions. This course provides a foundation in the skills and tools used by Data Scientists, making it a valuable resource for those who want to pursue a career in this field.
Data Analyst
A Data Analyst uses data to analyze business problems and develop recommendations for improvement. They work with data engineers and other data professionals to access and analyze data. This course provides a foundation in the skills and tools used by Data Analysts, making it a valuable resource for those who want to pursue a career in this field.
Business Analyst
A Business Analyst works with business stakeholders to understand their needs and develop solutions to meet those needs. They often use data to analyze business problems and develop recommendations for improvement. This course provides a foundation in the skills and tools used by Business Analysts, making it a valuable resource for those who want to pursue a career in this field.
Database Administrator
A Database Administrator manages and maintains databases. They work with data engineers and other data professionals to ensure that data is stored and managed efficiently and securely. This course provides a foundation in the skills and tools used by Database Administrators, making it a valuable resource for those who want to pursue a career in this field.
Software Engineer
A Software Engineer designs, develops, and maintains software applications. They work with data engineers and other data professionals to integrate data into software applications. This course provides a foundation in the skills and tools used by Software Engineers, making it a valuable resource for those who want to pursue a career in this field.
Data Privacy Specialist
A Data Privacy Specialist is responsible for ensuring that an organization complies with data privacy laws and regulations. They work with data engineers and other data professionals to develop and implement data privacy policies and procedures. This course provides a foundation in the skills and tools used by Data Privacy Specialists, making it a valuable resource for those who want to pursue a career in this field.
Information Security Analyst
An Information Security Analyst is responsible for protecting an organization's data from unauthorized access, use, disclosure, disruption, modification, or destruction. They work with data engineers and other data professionals to implement and maintain security measures. This course provides a foundation in the skills and tools used by Information Security Analysts, making it a valuable resource for those who want to pursue a career in this field.
Product Manager
A Product Manager is responsible for the development and launch of new products. They work with data engineers and other data professionals to ensure that data is used to make informed decisions about product development. This course provides a foundation in the skills and tools used by Product Managers, making it a valuable resource for those who want to pursue a career in this field.
Consultant
A Consultant provides advice and guidance to organizations on a variety of topics, including data infrastructure. They work with data engineers and other data professionals to help organizations improve their data management practices. This course provides a foundation in the skills and tools used by Consultants, making it a valuable resource for those who want to pursue a career in this field.
Data Governance Specialist
A Data Governance Specialist is responsible for developing and implementing data governance policies and procedures. They work with data engineers and other data professionals to ensure that data is used in a consistent and ethical manner. This course provides a foundation in the skills and tools used by Data Governance Specialists, making it a valuable resource for those who want to pursue a career in this field.
Project Manager
A Project Manager is responsible for the planning, execution, and delivery of projects. They work with data engineers and other data professionals to ensure that data is used to track project progress and make informed decisions. This course provides a foundation in the skills and tools used by Project Managers, making it a valuable resource for those who want to pursue a career in this field.
Machine Learning Engineer
A Machine Learning Engineer develops and deploys machine learning models. They work with data engineers and other data professionals to access and prepare data for machine learning. This course provides a foundation in the skills and tools used by Machine Learning Engineers, making it a valuable resource for those who want to pursue a career in this field.
Data Visualization Specialist
A Data Visualization Specialist creates visualizations that communicate data insights to a variety of audiences. They work with data engineers and other data professionals to access and prepare data for visualization. This course provides a foundation in the skills and tools used by Data Visualization Specialists, making it a valuable resource for those who want to pursue a career in this field.

Reading list

We've selected ten books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Establishing Data Infrastructure.
Covers similar topics like this course on building scalable data pipelines. Provides more real-world examples and use cases.
Provides a comprehensive overview of MapReduce, a powerful programming model for processing large datasets. It valuable resource for anyone who wants to learn more about MapReduce and how to use it to solve real-world problems.
Provides a comprehensive introduction to Hadoop, a popular framework for building data pipelines. Covers topics like Hadoop Distributed File System (HDFS), MapReduce, and YARN. Useful for gaining a foundational understanding of the technology used in many data pipelines.
Provides a comprehensive overview of Apache Spark, a popular open-source framework for processing big data. It valuable resource for anyone who wants to learn more about Spark and how to use it to solve real-world problems.
Provides a hands-on approach to data analytics, including its techniques, tools, and applications. It valuable resource for anyone who wants to learn more about this emerging field.
Provides a complete guide to Power BI, a popular data visualization tool. It valuable resource for anyone who wants to learn more about Power BI and how to use it to create effective data visualizations.
Provides a practical introduction to data visualization, including its techniques, tools, and applications. It valuable resource for anyone who wants to learn more about this essential topic.
Provides a practical guide to data science, including its techniques, tools, and applications. It valuable resource for anyone who wants to learn more about this emerging field.
Provides a comprehensive overview of data science, including its techniques, tools, and applications. It valuable resource for anyone who wants to learn more about this emerging field.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Establishing Data Infrastructure.
Designing and Implementing Azure Infrastructure to...
Digital Thread: Components
Creating a Student Profile in Google Docs
Pluralsight LIVE Europe Kickoff Video
Design and Document Data Flows with Microsoft Azure
Amazon Kinesis Deep Dive
Kafka: Build, Deploy, and Monitor Your First Real-world...
Introduction to Secure Networking
Microsoft Azure Management Tools and Security Solutions
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser