Pretraining LLMs

Lucy Park and Sung Kim

In Pretraining LLMs you’ll explore the first step of training large language models using a technique called pretraining. You’ll learn the essential steps to pretrain an LLM, understand the associated costs, and discover how starting with smaller, existing open source models can be more cost-effective.

Here's a deal for you

We found an offer that may be relevant to this course.

Save money when you learn. All coupon codes, vouchers, and discounts are applied automatically unless otherwise noted.

Valid until April 15

Coursera Plus Sale

Get unlimited access to expert-led courses that give you job-ready certificates with instructors from Google, IBM, and more.

Take

25%

off

What's inside

Syllabus

Pretraining LLMs

In Pretraining LLMs you’ll explore the first step of training large language models using a technique called pretraining. You’ll learn the essential steps to pretrain an LLM, understand the associated costs, and discover how starting with smaller, existing open source models can be more cost-effective.Pretraining involves teaching an LLM to predict the next token using vast text datasets, resulting in a base model, and this base model requires further fine-tuning for optimal performance and safety. In this course, you’ll learn to pretrain a model from scratch and also to take a model that’s already been pretrained and continue the pretraining process on your own data. In detail: 1. Explore scenarios where pretraining is the optimal choice for model performance. Compare text generation across different versions of the same model to understand the performance differences between base, fine-tuned, and specialized pre-trained models. 2. Learn how to create a high-quality training dataset using web text and existing datasets, which is crucial for effective model pretraining. 3. Prepare your cleaned dataset for training. Learn how to package your training data for use with the Hugging Face library. 4. Explore ways to configure and initialize a model for training and see how these choices impact the speed of pretraining. 5. Learn how to configure and execute a training run, enabling you to train your own model. 6. Learn how to assess your trained model’s performance and explore common evaluation strategies for LLMs, including important benchmark tasks used to compare different models’ performance. After taking this course, you’ll be equipped with the skills to pretrain a model—from data preparation and model configuration to performance evaluation.

Save this course

Save Pretraining LLMs to your list so you can find it easily later:

Save

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Pretraining LLMs with these activities:

Python Refresher

Show steps

Brush up on your Python skills before starting the course to ensure a strong foundation and minimize any potential obstacles in your learning journey.

Browse courses on Python

Show steps

Review basic syntax and data structures
Practice coding exercises
Complete online tutorials or refresher courses
Build a small project to apply your skills

Subject Matter Experts

Show steps

Seek out subject matter experts beyond your instructor to gain additional insights and perspectives on the course material, broadening your understanding and expanding your professional network.

Browse courses on Mentorship

Show steps

Identify potential mentors in the field
Reach out and introduce yourself
Schedule meetings or discussions
Seek guidance and advice

Peer-to-Peer Training

Show steps

Engage in peer-to-peer training by mentoring other students in the class, reinforcing your understanding of the course material and developing your leadership and communication skills.

Browse courses on Communication

Show steps

Identify a student who could benefit from your support
Schedule regular sessions to review course material
Provide constructive feedback and guidance
Create practice exercises and assignments

Five other activities

Expand to see all activities and additional details

Show all eight activities

GPT-3 Response Generator

Show steps

Build a GPT-3 based response generator using Hugging Face to practice your text generation skills and improve the quality of your responses in the course.

Browse courses on Hugging Face

Show steps

Install Hugging Face and transformers library
Load a pre-trained text generator model from Hugging Face
Write a function to generate text using the model
Create a simple web app that uses your function to generate responses

Fine-tuning for Specific Tasks

Show steps

Follow guided tutorials to fine-tune pre-trained models for specific tasks, allowing you to apply the concepts learned in the course and enhance your problem-solving skills.

Browse courses on Fine-tuning

Show steps

Identify a specific task and dataset
Select an appropriate pre-trained model
Fine-tune the model on your task
Evaluate the performance of the fine-tuned model

Pretrained Model Performance Evaluation

Show steps

Engage in practice drills to evaluate the performance of different pre-trained models on various benchmark tasks to reinforce your understanding of model evaluation techniques and gain practical experience.

Browse courses on Model Evaluation

Show steps

Create a dataset for model evaluation
Select relevant evaluation metrics
Implement model evaluation code
Evaluate and compare the performance of multiple models
Analyze the results and identify areas for improvement

State-of-the-Art Pretraining Techniques

Show steps

Compile resources and articles on state-of-the-art pre-training techniques to expand your knowledge and stay updated with the latest advancements in the field of Large Language Models.

Show steps

Conduct a literature search for relevant papers and articles
Summarize the key findings and insights
Organize and categorize the resources
Create a presentation or document to share your findings

Contribute to Hugging Face

Show steps

Engage with the open-source community by contributing to Hugging Face, the leading platform for Large Language Models, to gain practical experience and deepen your understanding of the ecosystem.

Browse courses on Open Source

Show steps