Open Source Models with Hugging Face

Younes Belkada, Marc Sun , and Maria Khalusova

The availability of models and their weights for anyone to download enables a broader range of developers to innovate and create.

In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library. Easily package your code into a user-friendly app that you can run on the cloud using Gradio and Hugging Face Spaces.

You will:

1. Use the transformers library to turn a small language model into a chatbot capable of multi-turn conversations to answer follow-up questions.

Here's a deal for you

We found an offer that may be relevant to this course.

Save money when you learn. All coupon codes, vouchers, and discounts are applied automatically unless otherwise noted.

Valid until April 15

Coursera Plus Sale

Get unlimited access to expert-led courses that give you job-ready certificates with instructors from Google, IBM, and more.

Take

25%

off

What's inside

Syllabus

Open Source Models with Hugging Face

The availability of models and their weights for anyone to download enables a broader range of developers to innovate and create. In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library. Easily package your code into a user-friendly app that you can run on the cloud using Gradio and Hugging Face Spaces. You will: (1) Use the transformers library to turn a small language model into a chatbot capable of multi-turn conversations to answer follow-up questions. (2) Translate between languages, summarize documents, and measure the similarity between two pieces of text, which can be used for search and retrieval. (3) Convert audio to text with Automatic Speech Recognition (ASR), and convert text to audio using Text to Speech (TTS). (4) Perform zero-shot audio classification, to classify audio without fine-tuning the model. (5) Generate an audio narration describing an image by combining object detection and text-to-speech models. (6) Identify objects or regions in an image by prompting a zero-shot image segmentation model with points to identify the object that you want to select. (7) Implement visual question answering, image search, image captioning and other multimodal tasks. (8) Share your AI app using Gradio and Hugging Face Spaces to run your applications in a user-friendly interface on the cloud or as an API. The course will provide you with the building blocks that you can combine into a pipeline to build your AI-enabled applications!

Save this course

Save Open Source Models with Hugging Face to your list so you can find it easily later:

Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Open Source Models with Hugging Face with these activities:

Review fundamentals of signal processing for audio applications

Show steps

A strong foundation in signal processing is essential for audio-related tasks. This activity refreshes your knowledge, ensuring a solid understanding.

Browse courses on Signal Processing

Show steps

Read through the Hugging Face documentation on audio processing
Review basic concepts of signal processing, such as sampling, quantization, and filtering

Practice language translation using the transformers library

Show steps

Hands-on practice with the transformers library helps solidify your understanding of language translation techniques.

Show steps

Translate a short paragraph from English to French in Python using the transformers library
Explore different translation models and compare their accuracy

Build an image classification model with Hugging Face Hub

Show steps

This activity provides practical experience in model deployment and serving, enhancing your understanding of the model deployment process.

Show steps