AI Workflow: Business Priorities and Data Ingestion from Coursera

This is the first course of a six part specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones.

This first course in the IBM AI Enterprise Workflow Certification specialization introduces you to the scope of the specialization and prerequisites. Specifically, the courses in this specialization are meant for practicing data scientists who are knowledgeable about probability, statistics, linear algebra, and Python tooling for data science and machine learning. A hypothetical streaming media company will be introduced as your new client. You will be introduced to the concept of design thinking, IBMs framework for organizing large enterprise AI projects. You will also be introduced to the basics of scientific thinking, because the quality that distinguishes a seasoned data scientist from a beginner is creative, scientific thinking. Finally you will start your work for the hypothetical media company by understanding the data they have, and by building a data ingestion pipeline using Python and Jupyter notebooks.

By the end of this course you should be able to:

1. Know the advantages of carrying out data science using a structured process

2. Describe how the stages of design thinking correspond to the AI enterprise workflow

3. Discuss several strategies used to prioritize business opportunities

4. Explain where data science and data engineering have the most overlap in the AI workflow

5. Explain the purpose of testing in data ingestion

6. Describe the use case for sparse matrices as a target destination for data ingestion

7. Know the initial steps that can be taken towards automation of data ingestion pipelines

Who should take this course?

This course targets existing data science practitioners that have expertise building machine learning models, who want to deepen their skills on building and deploying AI in large enterprises. If you are an aspiring Data Scientist, this course is NOT for you as you need real world expertise to benefit from the content of these courses.

What skills should you have?

It is assumed you have a solid understanding of the following topics prior to starting this course: Fundamental understanding of Linear Algebra; Understand sampling, probability theory, and probability distributions; Knowledge of descriptive and inferential statistical concepts; General understanding of machine learning techniques and best practices; Practiced understanding of Python and the packages commonly used in data science: NumPy, Pandas, matplotlib, scikit-learn; Familiarity with IBM Watson Studio; Familiarity with the design thinking process.

What's inside

Syllabus

IBM AI Enterprise Workflow Introduction

The goal of this first module is to introduce you to the overall specialization requirements, evaluate your understanding of some key prerequisite knowledge, and familiarize you with several process models commonly used today. In this course we will use the process of design thinking, but it is the consistent application of a process in practice that is important, not the exact process itself. There are a number of reasons for choosing the design thinking process, but the most important is that it is being applied in a cross-disciplinary way—that is outside of data science.

Traffic lights

Read about what's good

what should give you pause

and possible dealbreakers

Explores how to carry out data science using a structured process, which is standard in industry

Teaches how to apply a scientific thought process to understanding business use cases, which helps learners develop critical thinking skills

Provides a case study to help learners practice the process of ingesting data, which helps learners develop practical skills

Taught by Mark J Grover and Ray Lopez, who are recognized for their work in the field of data science

Examines the overlap between data science and data engineering, which is highly relevant to professionals working in these fields

Requires students to have extensive background knowledge in linear algebra, probability, statistics, machine learning, Python, and IBM Watson Studio, which may pose a barrier for some learners

Reviews summary

Enterprise ai workflow business context

According to learners, this course provides a strong foundation for understanding the enterprise AI workflow, particularly emphasizing the critical link between AI projects and business priorities. Students appreciated the introduction to frameworks like design thinking within the AI context and the focus on scientific thinking for real-world deployment. Many found it an excellent first course that effectively sets up the rest of the specialization. However, some reviewers noted that the data ingestion module felt too basic, potentially creating a mismatch with the course's stated high prerequisites for experienced data scientists. Overall, it's viewed as valuable for framing large-scale AI initiatives from a business perspective.

Valuable insights on design & scientific thinking.

"The design thinking framework was explained well in this context."

"Really appreciated the focus on business priorities and scientific thinking alongside the technical bits."

"Covered important concepts like design thinking in the AI context."

Strong start for the entire series.

"Sets up the specialization nicely."

"A strong start to the specialization."

"Good foundational course."

"It frames everything needed for the rest of the specialization."

Excellent overview of AI in business.

"Excellent introduction to the enterprise AI workflow perspective."

"Good overview, highlights important considerations for applying AI in a business setting."

"Perfect first course for understanding the scope and business context of large-scale AI projects."

"I appreciated the focus on business priorities... crucial for moving from model building to real-world deployment."

Stated prerequisites may not match depth.

"...Prerequisites feel a bit inconsistent with the module depths."

"Prerequisites are high for this basic content."

Technical parts can be too basic for some.

"Data ingestion part with Python/Pandas was a good refresh, although basic for experienced folks."

"The data ingestion module was a bit too basic for my background..."

"Data ingestion part was very simple."

"Expected more technical depth on AI workflows... Data ingestion was standard."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in AI Workflow: Business Priorities and Data Ingestion with these activities:

Review Python and Jupyter Notebooks

Show steps

Reinforces understanding of key tools and environment used in the AI workflow.

Browse courses on Python

Show steps

Read through online tutorials and documentation on Python and Jupyter Notebooks.
Complete practice exercises or coding challenges related to Python.
Set up a Jupyter Notebook environment on your computer and experiment with basic commands.

Solve Probability and Statistics Practice Problems

Show steps

Strengthens foundational knowledge and problem-solving skills essential for data science and the AI workflow.

Browse courses on Probability

Show steps

Obtain practice problems from textbooks, online resources, or previous coursework.
Solve problems independently, focusing on understanding concepts and applying formulas.
Review solutions and identify areas for improvement.

Compile a Glossary of Key Terms and Concepts

Show steps

Enhances understanding and retention of key technical terms and concepts throughout the course.

Browse courses on Data Science

Show steps

Create a document or spreadsheet to record key terms and their definitions.
Regularly add to the glossary while studying course materials, participating in discussions, or reading external resources.
Review the glossary periodically to reinforce understanding.

Nine other activities

Expand to see all activities and additional details

Show all 12 activities

Join a study group

Show steps

Connect with other students in the course to discuss the concepts, share insights, and work through challenges together.

Browse courses on Data Science

Show steps

Find a study group or create your own.
Meet regularly to discuss the course material.
Work together on assignments and projects.

Follow Tutorials on Design Thinking Process

Show steps

Introduces the design thinking process and its application in the AI enterprise workflow.

Browse courses on Design Thinking

Show steps

Identify and access online tutorials or courses on design thinking.
Follow the tutorials step-by-step, actively engaging with the content.
Complete any exercises or assignments associated with the tutorials.

Complete Jupyter notebook exercises

Show steps

Practice the hands-on skills of data ingestion using Jupyter notebooks to solidify your understanding of the concepts covered in the Data Ingestion module.

Browse courses on Data Ingestion

Show steps

Review the Jupyter notebook provided in the course materials.
Follow the instructions in the notebook to complete the exercises.
Run the code in the notebook to verify your results.
Troubleshoot any errors that you encounter.

Explore data ingestion tools

Show steps

Expand your knowledge of data ingestion techniques by exploring online tutorials and documentation for popular tools and frameworks.

Browse courses on Data Ingestion

Show steps

Identify relevant data ingestion tools and frameworks.
Review tutorials and documentation to understand their capabilities.
Experiment with the tools and frameworks in a hands-on environment.

Join a Study Group for AI Enterprise Workflow

Show steps

Fosters collaboration, knowledge sharing, and discussion of concepts related to the AI enterprise workflow.

Show steps

Find or create a study group with peers enrolled in the course.
Set regular meeting times and discuss assigned materials, case studies, or practice problems.
Support each other through Q&A and sharing of resources.

Design a data ingestion pipeline

Show steps

Apply the principles of design thinking to create a data ingestion pipeline that meets the specific requirements of the hypothetical streaming media company.

Browse courses on Data Ingestion

Show steps

Identify the data sources and data types that need to be ingested.
Design the architecture of the data ingestion pipeline, including data transformation and cleansing processes.
Develop a testing plan to validate the accuracy and completeness of the ingested data.
Document the design and implementation of the data ingestion pipeline.

Build a Sample Data Ingestion Pipeline

Show steps

Provides hands-on experience in building a data ingestion pipeline, bridging the gap between data science and data engineering.

Browse courses on Data Ingestion

Show steps

Identify a small dataset and define the desired transformations.
Use Python and appropriate libraries to write code for data ingestion, cleaning, and transformation.
Test the pipeline for accuracy and efficiency.

Attend a Workshop on Data Science and AI in Enterprise

Show steps

Provides exposure to industry experts, case studies, and best practices in data science and AI in an enterprise context.

Browse courses on Data Science

Show steps

Identify and register for relevant workshops in the field.
Attend the workshop and actively participate in sessions.
Engage with speakers, ask questions, and network with other attendees.

Contribute to Open-Source Projects Related to Data Science

Show steps

Enhances practical skills and promotes collaboration within the data science community while reinforcing concepts learned in the course.

Browse courses on Data Science

Show steps

Identify open-source projects on platforms like GitHub that align with interests and skill level.
Review the project documentation and codebase.
Submit bug reports, feature requests, or code contributions to the project.

Career center

Learners who complete AI Workflow: Business Priorities and Data Ingestion will develop knowledge and skills that may be useful to these careers:

Data Engineer

Automating data ingestion pipelines is an essential skill set for Data Engineers, who design, construct, and manage data pipelines to ensure that data is available for consumption by business-critical applications. This course provides a detailed overview of the data ingestion process, with a focus on building and automating data ingestion pipelines. Learners will gain hands-on experience with Python and Jupyter notebooks to build their own data ingestion pipelines, which will be essential for success as a Data Engineer.

See salaries and explore the career path for Data Engineer

Data Analyst

In order to identify patterns, trends, and anomalies in data, Data Analysts must have a deep understanding of data ingestion and preparation techniques. This course provides a comprehensive introduction to data collection and data ingestion, as well as the importance of applying a scientific thought process to the task of understanding the business use case. These skills are critical for success in Data Analyst roles.

See salaries and explore the career path for Data Analyst

Data Architect

The design and implementation of data ingestion pipelines is a fundamental responsibility of Data Architects, who are responsible for ensuring that data is managed and used effectively within an organization. This course provides a solid conceptual understanding of data ingestion, as well as practical experience with building data ingestion pipelines using Python and Jupyter notebooks. This knowledge is essential for success in Data Architect roles.

See salaries and explore the career path for Data Architect

Machine Learning Engineer

To ensure that machine learning models are trained on high-quality data, Machine Learning Engineers must have a solid understanding of data ingestion and preparation techniques. This course provides a comprehensive overview of data collection, data ingestion, and data cleaning, with a focus on the specific requirements of machine learning applications. These skills will help Machine Learning Engineers build and deploy successful machine learning models.

See salaries and explore the career path for Machine Learning Engineer

Cloud Architect

Cloud Architects design, build, and manage cloud-based solutions, which often involve the ingestion and processing of large amounts of data. This course provides a strong foundation in data ingestion and data engineering, with a focus on cloud-based technologies. These skills will help Cloud Architects design and implement scalable, reliable, and secure cloud-based solutions.

See salaries and explore the career path for Cloud Architect

Software Engineer

For Software Engineers working in the field of data science or machine learning, a deep understanding of data ingestion and preparation techniques is essential. This course provides a comprehensive overview of data collection, data ingestion, and data cleaning, with a focus on the specific requirements of software development. These skills will help Software Engineers build and deploy data-driven applications.

See salaries and explore the career path for Software Engineer

Business Analyst

In order to understand the business requirements and translate them into technical specifications, Business Analysts must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on the business context. These skills will help Business Analysts bridge the gap between business and technical teams.

See salaries and explore the career path for Business Analyst

Product Manager

To understand the needs of users and stakeholders, Product Managers must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on the product development process. These skills will help Product Managers build and launch successful products.

See salaries and explore the career path for Product Manager

Operations Research Analyst

To optimize business processes and make data-driven decisions, Operations Research Analysts must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on the principles of operations research. These skills will help Operations Research Analysts solve complex problems and improve decision-making.

See salaries and explore the career path for Operations Research Analyst

Financial Analyst

To analyze financial data and make investment recommendations, Financial Analysts must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on the financial industry. These skills will help Financial Analysts make informed investment decisions.

See salaries and explore the career path for Financial Analyst

Market Researcher

To understand consumer behavior and market trends, Market Researchers must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on market research. These skills will help Market Researchers conduct effective research and make data-driven recommendations.

See salaries and explore the career path for Market Researcher

UX Researcher

To design and evaluate user experiences, UX Researchers must have a basic understanding of data ingestion and preparation techniques. This course provides an introduction to data collection, data ingestion, and data cleaning, with a focus on user experience research. These skills will help UX Researchers gather and analyze user data to improve the design of products and services.

See salaries and explore the career path for UX Researcher

Data Scientist

Although this course is intended for practicing data scientists, it may also be beneficial for those aspiring to enter the field, particularly those with a strong background in computer science, mathematics, or statistics. The course provides a comprehensive overview of the AI enterprise workflow, with a focus on the role of data ingestion and data engineering. These skills will help aspiring Data Scientists build a solid foundation for success in the field.

See salaries and explore the career path for Data Scientist

Software Developer

Software Developers working on data-intensive applications may find this course helpful, as it provides a solid foundation in data ingestion and data engineering. The course covers the principles of data collection, data ingestion, and data cleaning, with a focus on Python and Jupyter notebooks. These skills will help Software Developers build and deploy scalable, reliable, and secure data-intensive applications.

See salaries and explore the career path for Software Developer

Database Administrator

Database Administrators responsible for managing data ingestion and data pipelines may find this course helpful, as it provides a comprehensive overview of the AI enterprise workflow, with a focus on data engineering and data management. The course covers the principles of data collection, data ingestion, and data cleaning, with a focus on best practices for database management. These skills will help Database Administrators ensure the availability, integrity, and security of data.

See salaries and explore the career path for Database Administrator