We may earn an affiliate commission when you visit our partners.
Course image

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

Read more

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.

From there we'll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.

We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.

Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.

Last but not least, we’ll introduce recommendation engines, and you'll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.

Throughout the course you'll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you'll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.

COURSE OUTLINE:

  • Intro to Data Science

    • Introduce the fields of data science and machine learning, review essential skills, and introduce each phase of the data science workflow

  • Unsupervised Learning 101

    • Review the basics of unsupervised learning, including key concepts, types of techniques and applications, and its place in the data science workflow

  • Pre-Modeling Data Prep

    • Recap the data prep steps required to apply unsupervised learning models, including restructuring data, engineering & scaling features, and more

  • Clustering

    • Apply three different clustering techniques in Python and learn to interpret their results using metrics, visualizations, and domain expertise

  • Anomaly Detection

    • Understand where anomaly detection fits in the data science workflow, and apply techniques like Isolation Forests and DBSCAN in Python

  • Dimensionality Reduction

    • Use techniques like Principal Component Analysis (PCA) and t-SNE in Python to reduce the number of features in a data set without losing information

  • Recommenders

    • Recognize the variety of approaches for creating recommenders, then apply unsupervised learning techniques in Python, including Cosine Similarity and Singular Vector Decomposition (SVD)

Ready to dive in? Join today and get immediate5 hours of high-quality video

  • 22 homework assignments

  • 7 quizzes

  • 3 projects

  • Data Science in Python: Unsupervised Learning ebook (350+ pages)

  • Downloadable project files & solutions

  • Expert support and Q&A forum

  • 30-day Udemy satisfaction guarantee

  • If you're an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.

    Happy learning.

    -Alice Zhao (Python Expert & Data Science Instructor, Maven Analytics)

    Enroll now

    Good to know

    Know what's good
    , what to watch for
    , and possible dealbreakers
    Develops skills for unsupervised learning in Python, which is highly relevant and in-demand in industry
    Taught by qualified instructors who are experts in data science and unsupervised learning
    Emphasizes hands-on learning with practical assignments and projects
    Covers a comprehensive range of topics in unsupervised learning, including clustering, anomaly detection, dimensionality reduction, and recommendation engines
    Suitable for aspiring and seasoned data scientists seeking to enhance their skills in unsupervised learning

    Save this course

    Save Data Science in Python: Unsupervised Learning to your list so you can find it easily later:
    Save

    Activities

    Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Data Science in Python: Unsupervised Learning with these activities:
    Organize and review course notes and materials
    Strengthen your understanding of course content by reviewing and organizing your notes, assignments, and materials regularly.
    Show steps
    • Gather all course notes, assignments, and materials.
    • Organize the materials into a logical structure using folders or a note-taking app.
    • Review the materials regularly to reinforce your understanding and identify areas for further study.
    Review Python basics
    Review fundamentals of Python syntax such as data types, control flow, and functions to strengthen your foundation.
    Browse courses on Python Basics
    Show steps
    • Go through online tutorials or documentation on Python basics.
    • Complete practice exercises or coding challenges to reinforce your understanding.
    Read 'Unsupervised Learning' by Jake VanderPlas
    Gain a comprehensive understanding of unsupervised learning concepts and techniques by reading an authoritative book on the subject.
    Show steps
    • Obtain a copy of 'Unsupervised Learning' by Jake VanderPlas.
    • Read through the book, taking notes and highlighting important concepts.
    • Complete the exercises and practice problems provided in the book to reinforce your understanding.
    Three other activities
    Expand to see all activities and additional details
    Show all six activities
    Join study groups or online forums
    Enhance your learning by collaborating with peers, discussing course concepts, and sharing insights.
    Show steps
    • Identify study groups or online forums related to the course material.
    • Actively participate in discussions, asking questions and sharing your own perspectives.
    • Collaborate on projects or assignments to gain diverse perspectives and improve your understanding.
    Follow tutorials on clustering algorithms
    Deepen your understanding of clustering techniques such as K-Means, hierarchical clustering, and DBSCAN through guided tutorials.
    Browse courses on Clustering Algorithms
    Show steps
    • Identify online resources or courses that provide step-by-step tutorials on clustering algorithms.
    • Follow the tutorials, implementing the algorithms in Python.
    • Experiment with different parameters and datasets to observe the impact on clustering results.
    Solve coding exercises on unsupervised learning
    Sharpen your coding skills by solving practice problems and exercises focused on unsupervised learning techniques.
    Browse courses on Unsupervised Learning
    Show steps
    • Find online platforms or coding challenge websites that offer unsupervised learning exercises.
    • Attempt to solve the exercises, debugging and refining your code as needed.
    • Review solutions or compare your approach with others to identify areas for improvement.

    Career center

    Learners who complete Data Science in Python: Unsupervised Learning will develop knowledge and skills that may be useful to these careers:
    Data Scientist
    Data Scientists are responsible for collecting, analyzing, and interpreting data to help businesses make informed decisions. This course may be useful for aspiring Data Scientists as it provides a foundation in unsupervised learning techniques, which are essential for identifying patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Machine Learning Engineer
    Machine Learning Engineers design, develop, and deploy machine learning models to solve business problems. This course may be useful for aspiring Machine Learning Engineers as it provides a foundation in unsupervised learning techniques, which are essential for building and evaluating machine learning models. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Data Analyst
    Data Analysts collect, analyze, and interpret data to help businesses understand their customers and make informed decisions. This course may be useful for aspiring Data Analysts as it provides a foundation in unsupervised learning techniques, which are essential for identifying patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Software Engineer
    Software Engineers design, develop, and maintain software applications. This course may be useful for aspiring Software Engineers as it provides a foundation in unsupervised learning techniques, which can be used to improve the performance and efficiency of software applications. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Business Analyst
    Business Analysts help businesses understand their customers and make informed decisions. This course may be useful for aspiring Business Analysts as it provides a foundation in unsupervised learning techniques, which can be used to identify patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Product Manager
    Product Managers are responsible for the development and launch of new products. This course may be useful for aspiring Product Managers as it provides a foundation in unsupervised learning techniques, which can be used to identify customer needs and develop products that meet those needs. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Marketing Analyst
    Marketing Analysts help businesses understand their customers and make informed decisions. This course may be useful for aspiring Marketing Analysts as it provides a foundation in unsupervised learning techniques, which can be used to identify patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Financial Analyst
    Financial Analysts help businesses make informed decisions about investments and financial planning. This course may be useful for aspiring Financial Analysts as it provides a foundation in unsupervised learning techniques, which can be used to identify patterns and trends in financial data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Operations Research Analyst
    Operations Research Analysts help businesses make informed decisions about operations and logistics. This course may be useful for aspiring Operations Research Analysts as it provides a foundation in unsupervised learning techniques, which can be used to identify patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Statistician
    Statisticians collect, analyze, and interpret data to help businesses and organizations make informed decisions. This course may be useful for aspiring Statisticians as it provides a foundation in unsupervised learning techniques, which are essential for identifying patterns and trends in data. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Data Engineer
    Data Engineers design, build, and maintain data pipelines and infrastructure. This course may be useful for aspiring Data Engineers as it provides a foundation in unsupervised learning techniques, which can be used to improve the performance and efficiency of data pipelines. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Quantitative Analyst
    Quantitative Analysts use mathematical and statistical models to help businesses make informed decisions. This course may be useful for aspiring Quantitative Analysts as it provides a foundation in unsupervised learning techniques, which are essential for building and evaluating mathematical and statistical models. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Actuary
    Actuaries use mathematical and statistical models to assess risk and uncertainty. This course may be useful for aspiring Actuaries as it provides a foundation in unsupervised learning techniques, which are essential for building and evaluating mathematical and statistical models. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.
    Risk Analyst
    Risk Analysts help businesses identify and manage risks. This course may be useful for aspiring Risk Analysts as it provides a foundation in unsupervised learning techniques, which are essential for identifying and assessing risks. The hands-on projects and real-world case studies will also help learners develop the practical skills needed to succeed in this role.

    Reading list

    We've selected eight books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Data Science in Python: Unsupervised Learning.
    Provides a comprehensive overview of unsupervised learning algorithms and techniques. It covers a wide range of topics, from clustering and dimensionality reduction to anomaly detection and recommender systems.
    Provides a comprehensive overview of data science using Python, with a focus on practical implementation. It covers a wide range of topics, from data cleaning and preparation to data analysis and visualization.
    Provides a comprehensive overview of data mining, with a focus on practical implementation using Python. It covers a wide range of topics, from data cleaning and preparation to data analysis and visualization.
    Provides a comprehensive overview of deep learning using Python, with a focus on practical implementation. It covers a wide range of topics, from deep neural networks and convolutional neural networks to recurrent neural networks and reinforcement learning.
    Provides a comprehensive overview of machine learning algorithms and techniques, with a focus on practical implementation using Python libraries such as Scikit-learn, Keras, and TensorFlow. It includes hands-on exercises to help readers apply what they learn.
    Provides a comprehensive overview of data science from the ground up, with a focus on practical implementation using Python. It covers a wide range of topics, from data cleaning and preparation to data analysis and machine learning.

    Share

    Help others find this course page by sharing it with your friends and followers:

    Similar courses

    Here are nine courses similar to Data Science in Python: Unsupervised Learning.
    Clustering Analysis
    Most relevant
    Unsupervised Machine Learning
    Most relevant
    Building Unsupervised Learning Models with TensorFlow
    Most relevant
    Machine Learning with Python
    Most relevant
    Unlocking the Secrets of Data: Unsupervised Learning with...
    Most relevant
    Applied Machine Learning in Python
    Most relevant
    Unsupervised Learning and Its Applications in Marketing
    Most relevant
    Building Machine Learning Models in Python with scikit...
    Most relevant
    Implementing Machine Learning Workflow with Weka
    Most relevant
    Our mission

    OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

    Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

    Find this site helpful? Tell a friend about us.

    Affiliate disclosure

    We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

    Your purchases help us maintain our catalog and keep our servers humming without ads.

    Thank you for supporting OpenCourser.

    © 2016 - 2024 OpenCourser