７ステップで作る日本語GPTスクラッチ開発ハンズオン from Udemy

※2024/2/27にgoogleのGemmaのコードをローカル(実演はColab)で実行するハンズオン追加しました。

日本国内からAttentionを利用した新しい事前学習モデルのアーキテクチャを提案し、そのモデルで最高性能を上回ったというニュースはなく、「GPT-NeoX」「LLaMa2」など、その背後には事前学習済みの「Transformer」ベースの言語モデルをすでに海外の組織が開発を行い、その上で日本語で後から事後学習（ファインチューニングやRLHF)を行ったものがほとんどです。

これは１つには計算資源の問題がありますが、もう一つの問題としてそもそもTransformerベースの言語モデルを実装できるという人材が日本では非常に少ないというのがあります。

今後ますます熾烈な戦いが予想されるLLMの分野で、その背景となる技術「Transformer」に着目し、スクラッチで実装し事前学習（Next Token(Character) Prediction)できるようになっておくことは、日本からまたエキサイティングな技術を生み出せる礎になると考えております。そこで、私自身が深層学習・自然言語処理業界で長らく現役で研究開発を行ってきた知見・スキルを惜しげもなく詰め込んだ「７ステップで作る日本語GPTスクラッチ開発ハンズオン」を開講いたしました。

※2024/2/27にgoogleのGemmaのコードをローカル(実演はColab)で実行するハンズオン追加しました。

この講座では、Google Colab環境の煩雑なセットアップや難しい環境構築の心配をせずに、現在大注目のChatGPTの魔法を解き明かす7つのステップをご案内します。更におまけとしてXwin-LM-13bやElyza-llama2の量子化されたバージョンをGoogle Colabから呼び出しソースコードを自動生成するハンズオンも実施いたします。その他「Zephyr 7b Alpha」に関しても実行するハンズオンを行っていきます。

Step 1: Google Colaboratoryのセットアップまずは煩雑な環境構築の心配をせずに、Google Colabをセットアップしましょう。手軽に始めることができるため、学習を始めるハードルを下げます。

Step 2: 青空文庫の夏目漱石「吾輩は猫である」をBeautifulSoupでWebスクレイピング実際のテキストデータを扱い、自然言語処理の基本を学びます。BeautifulSoupを使用してテキストを取得し、次のステップに備えます。

Step 3: 自然言語処理（正規表現・形態素解析・エンコーディング・デコーディング・訓練データ作成）テキストの前処理を通じて、正規表現や形態素解析の技術をマスターします。テキストデータを適切な形に整え、トークン化します。

Step 4: Bigram(2-gram)言語モデルの実装解説基本的な言語モデルを実装するステップです。Bigramモデルを通じて、単語の連続性を理解し、テキスト生成の基本原則に触れます。

Step 5: Attentionを計算するための数学的トリック（行列計算、トライアングル） Transformerモデルの魔法の一部であるAttentionメカニズムを理解するために、行列計算やtorch.tril()について学びます。

Step 6: Self-Attentionの実装 Self-Attentionメカニズムを実装し、トークン間の関係性を捉える方法を理解します。これはTransformerの中核です。

Step 7: Multi-Head Attention/Positional Encoding/GPTの実装・訓練・推論最終ステップでは、Multi-Head AttentionやPositional Encodingを組み合わせて、GPTモデルを実装、訓練、推論します。ChatGPTの中身を理解し、自分自身でもテキスト生成モデルを作成できる力を身につけます。

（おまけ）"TheBloke/Xwin-LM-70B-V0.1-GPTQ"を扱う

Xwin-LMをhuggingfaceライブラリを使用して呼び出し、Colab環境で日本語によるソースコード生成ができるようになります。

（おまけ）"ELYZA-japanese-Llama-2-7b-fast-instruct-GPTQ-calib-ja-2k"と"HuggingFaceH4/zephyr-7b-alpha"を扱う

huggingfaceライブラリを"ELYZA-japanese-Llama-2-7b-fast-instruct-GPTQ-calib-ja-2k"と"HuggingFaceH4/zephyr-7b-alpha"を使用して呼び出し、Colab環境で物語生成やソースコード生成ができるようになります。

（おまけ）OpenAIのAPIアップデート(2024/1/25)　をフル活用したRAGシステムの実装方法　text-embedding-3-smallとgpt-4-0125-previewを使ったRAG(Retrieval Augmentation Generation)で自社テキストから回答させる実装を20分で解説します。

この講座は、Google Colab環境での手軽な学習を通じて、Transformerの核心概念を深く理解し、自然言語処理の世界に踏み込むためのものです。プログラミング初心者から経験豊富な開発者まで、どなたにもおすすめです。

ぜひUdemyで「７ステップで作るGPTスクラッチ開発ハンズオン」をご覧いただき、Transformerの実装と自然言語処理の興奮を体験してください！

What's inside

Syllabus

紹介

Google ColaboratoryのセットアップしてPythonを実行できるようになります。

Google Colaboratoryで新規ノートブックを作成する。

日本語のデータをWebスクレイピングして取得できるようになる。

BeutifulSoupでWebスクレイピング

テキストデータの前処理を通じて自然言語処理の基礎を学ぶ

正規表現・形態素解析・カタカナひらがな変換を利用してテキストを前処理する。

エンコーディング

デコーディング

訓練データ・テストデータの分割

大規模言語モデル(LLM)用訓練データに変換・GPU有効化

Pytorchのnn.Moduleを利用してBigramモデルを定義し訓練できるようになります。

Bigramモデルの実装

Bigramモデルの訓練

Bigramモデルの訓練２

Attentionの計算を行うための実装の考え方（数学的トリック）がわかるようになる。

11_Attention準備編_言語モデルの説明

行列演算による計算の効率化

Step.6 Self-Attentionの考え方がわかり実装できるようになることを目指します。

Self-Attentionの実装

Positional Encoding, MultiHead Attention,

Multi-Head Attention/Positional Encoding/GPTの実装・訓練・推論

形態素解析を行い単語の表層系ベースに夏目漱石風の文章を生成するGPTを訓練・推論できるようになります。

単語ベース（表層系）ベースのGPT

kunishou/databricks-dolly-15k-jaを使って表層系で事前学習してみます。

言語モデルの歴史的経緯とPositional Encodingの必要性

Positional Encodingソース説明

LLaMa2の派生形で高精度・軽量な言語モデルを試します。

Xwin-LM

Elyza-japanese-Llama-2-7b-fast-instructをGPTQで量子化したモデルと、Zephyr-7bのモデルのGoogle Colaboratory無料版で動作させるハンズオンです。

※2023/10/21時点で動作確認しておりますが動作を保証するものではございませんので、ご留意ください。バージョンの組み合わせなどにより今後動作しなくなる可能性もございます。

Mixtral 8x7b をColab Freeで

スクラッチで最新のOpenAIモデルでRAGによるカスタムGPTを作る方法がわかります。

OpenAI 2024/1/25 UpdateとRAG

HuggingChatのAssistantsの使い方と作り方を解説します。

HuggingChat Assistantsとgpts

Mistral-NextをGradioで触っていきます。

Mistral-NextをGradioで試します。

Google GemmaをGoogle Colab内でローカルで動作できるようになります。

Google Gemma

Llama3/Command R+/GrokにCoTすると、それぞれどんな感じかがつかめます。

オープンソースの言語モデルはCoTがきくのか。

Good to know

Know what's good

, what to watch for

, and possible dealbreakers

Teaches learners about the core mathematical concepts behind Attention in Neural Networks, which may be foundational to many kinds of modeling

Explores innovative practices in the use of transformer-based language models, which are more computationally efficient than other current natural language processing models

Builds a solid knowledge base for learners by teaching them to understand the fundamentals of natural language processing and how to apply the transformer model in practice

Provides useful examples that walk learners through the process of creating advanced text generation models

Covers a broad range of topics related to text generation, including how to use the HuggingFace library, implement a GPT model, and generate text

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in ７ステップで作る日本語GPTスクラッチ開発ハンズオン with these activities:

Practice writing code

Show steps

Practice writing and debugging code to strengthen your understanding of core concepts.

Show steps

Find coding exercises or problems online
Attempt to solve the exercises on your own
Review solutions and debug your code
Repeat the process with more exercises

Show all one activities

Career center

Learners who complete ７ステップで作る日本語GPTスクラッチ開発ハンズオン will develop knowledge and skills that may be useful to these careers:

Natural Language Processing Scientist

Natural Language Processing (NLP) Scientists leverage deep learning, statistical algorithms, and linguistics to develop processes for data cleaning, feature engineering, and model building. The concepts and techniques you learn in this hands-on course will equip you with a strong foundation in NLP concepts, providing you with the knowledge and skills required to build advanced language-based solutions.

See salaries and explore the career path for Natural Language Processing Scientist

Linguist

Linguists study the structure and meaning of language. They analyze how languages are used and develop theories to explain how they change over time. The course will help linguists gain experience with natural language processing and machine learning, which are essential skills for analyzing language data.

See salaries and explore the career path for Linguist

Data Scientist

Data Scientists use scientific methods, processes, algorithms, and systems to extract knowledge and insights from data. This course will provide you with a solid foundation in natural language processing and deep learning, which are essential skills for data scientists who work with text and language data.

See salaries and explore the career path for Data Scientist

Machine Learning Engineer

Machine Learning Engineers design, develop, and maintain machine learning models. This course will introduce you to the theory and practice of natural language processing, providing you with the skills you need to build and deploy machine learning models that can understand and generate human language.

See salaries and explore the career path for Machine Learning Engineer

Software Engineer

Software Engineers design, develop, and maintain software systems. This course will provide you with the skills you need to build and deploy natural language processing systems, which are in high demand across a variety of industries.

See salaries and explore the career path for Software Engineer

NLP Engineer

NLP Engineers specialize in natural language processing. They develop and implement NLP models and systems. They also work on research and development of new NLP technologies. This course will provide you with the skills and knowledge you need to become an NLP Engineer.

See salaries and explore the career path for NLP Engineer

AI Researcher

AI Researchers conduct research in the field of artificial intelligence. They develop new theories and algorithms for solving problems in AI. This course will provide you with the skills and knowledge you need to become an AI Researcher who specializes in natural language processing.

See salaries and explore the career path for AI Researcher

Data Analyst

Data Analysts collect, clean, and analyze data to identify patterns and trends. This course will provide you with the skills you need to work with natural language data, which is a growing area of focus for data analysts.

See salaries and explore the career path for Data Analyst

Information Architect

Information Architects design and organize information systems. They work with users to understand their needs and then create systems that are easy to use and understand. This course will provide you with the skills you need to work with natural language data, which is a critical part of many information systems.

See salaries and explore the career path for Information Architect

User Experience Designer

User Experience Designers design and evaluate user interfaces. They work to make sure that users have a positive experience when interacting with products and services. This course will provide you with the skills you need to work with natural language data, which is a critical part of many user interfaces.

See salaries and explore the career path for User Experience Designer

Content Strategist

Content Strategists plan and create content for websites, blogs, and other digital platforms. They work to ensure that content is relevant, engaging, and effective. This course will provide you with the skills you need to work with natural language data, which is a critical part of content strategy.

See salaries and explore the career path for Content Strategist

Technical Writer

Technical Writers create documentation for software, hardware, and other technical products. They work to ensure that documentation is clear, accurate, and easy to understand. This course will provide you with the skills you need to work with natural language data, which is a critical part of technical writing.

See salaries and explore the career path for Technical Writer

Marketing Manager

Marketing Managers plan and execute marketing campaigns. They work to promote products and services to target audiences. This course may provide you with some of the skills you need to work with natural language data, which is a growing area of focus for marketing managers.

See salaries and explore the career path for Marketing Manager

Sales Manager

Sales Managers lead and motivate sales teams. They work to achieve sales goals and objectives. This course may provide you with some of the skills you need to work with natural language data, which is a growing area of focus for sales managers.

See salaries and explore the career path for Sales Manager

Customer Success Manager

Customer Success Managers work with customers to ensure that they are satisfied with products and services. They work to resolve issues and build relationships with customers. This course may provide you with some of the skills you need to work with natural language data, which is a growing area of focus for customer success managers.

See salaries and explore the career path for Customer Success Manager

７ステップで作る日本語GPTスクラッチ開発ハンズオン

What's inside

Syllabus

Good to know

Save this course

Activities

Career center

Reading list

Share

Similar courses