Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21c

Channel:

p3nGu1nZz

Subscribers:

375

Published on September 25, 2024 12:00:00 AM ● Video Link: https://www.youtube.com/watch?v=xCtMuXagqhs

Duration: 0:00

59 views

*Welcome back to our Tau LLM series! 🌟*

In this episode, we're diving into our fourth training attempt, known as **Series D**. Here's what we have planned:

**Series D Training Overview**: We'll be conducting a total of **50 million training steps**. The training will be broken down as follows:
**First 10 Million Steps**: Focused on the first column of our output vector. This serves as a warm-up phase.
**Second 10 Million Steps**: Training on the first and second columns, averaged together. The averaging is based on the difference between the expected and actual values.
**Third 10 Million Steps**: Training on the first three columns, averaged together in the same manner.
**Subsequent Steps**: We'll continue this pattern, progressively including more columns and averaging them together.

**Current Progress**: We've successfully completed the first 10 million steps on the first column. Now, we'll proceed with the second run of Series D, focusing on the first and second columns averaged together.

**Training Methodology**: Our approach involves calculating the expected value minus the actual value difference for each column. This method helps us fine-tune the model's accuracy and performance.

Join us as we continue to refine our LLM with these advanced training techniques. Whether you're new to machine learning or an experienced practitioner, this episode offers valuable insights into the intricacies of training large language models.

Stay tuned and let's get started! 🚀

Other Videos By p3nGu1nZz

2024-12-13	Exploring the Plasma-Arc \| Mastering WebGPU and Hugging Face Spaces \| Ep. 5
2024-12-11	Exploring the Plasma-Arc \| Mastering WebGPU and Hugging Face Spaces \| Ep. 4
2024-12-10	Exploring the Plasma-Arc \| Mastering WebGPU and Hugging Face Spaces \| Ep. 3
2024-12-10	Exploring the Plasma-Arc \| Mastering WebGPU and Hugging Face Spaces \| Ep. 2
2024-12-09	Exploring the Plasma-Arc \| Mastering WebGPU and Hugging Face Spaces \| Ep. 1
2024-09-28	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 22b
2024-09-27	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 22
2024-09-26	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21f
2024-09-25	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21e
2024-09-25	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21d
2024-09-24	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21c
2024-09-23	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21b
2024-09-22	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 21
2024-09-17	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20d
2024-09-15	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20c
2024-09-14	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20
2024-09-14	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20b
2024-09-12	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 19
2024-09-11	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 18
2024-09-10	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 16
2024-09-10	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 17

Channel	Latest
Vitor Hugo 🐻	7 hours ago
Garena RoV Thailand	7 hours ago
World Gamers	7 hours ago
Professor Gaming	7 hours ago
Ara Games	7 hours ago
ReyDiiOrNot	7 hours ago
Panoramic 29	8 hours ago
serratus gaming	8 hours ago
ASMR AI	8 hours ago
Tedi Budiman	8 hours ago
SDN GARDU PATOKBEUSI	8 hours ago
Gd sajan	8 hours ago
sute rider	8 hours ago
ExternalHyper	8 hours ago
Hart Gaming	8 hours ago
Jerick Sonnn	9 hours ago
Aa AHEN	9 hours ago
Kuchiro, Arrogante!	9 hours ago
arlon colon	9 hours ago
Tongbos_EN	9 hours ago
xShadow	9 hours ago
Dinosaur Y	9 hours ago
Filthycasual	9 hours ago
Bang Breww21	9 hours ago
Aditya Aslami	9 hours ago