Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21d

Channel:
Subscribers:
373
Published on ● Video Link: https://www.youtube.com/watch?v=SQCzKWSgb8k



Duration: 0:00
33 views
0


*Welcome back to our Tau LLM series! ๐ŸŒŸ*

In this episode, we're diving into our fourth training attempt, known as **Series D**. Here's what we have planned:

**Series D Training Overview**: We'll be conducting a total of **50 million training steps**. The training will be broken down as follows:
**First 10 Million Steps**: Focused on the first column of our output vector. This serves as a warm-up phase.
**Second 10 Million Steps**: Training on the first and second columns, averaged together. The averaging is based on the difference between the expected and actual values.
**Third 10 Million Steps**: Training on the first three columns, averaged together in the same manner.
**Subsequent Steps**: We'll continue this pattern, progressively including more columns and averaging them together.

**Current Progress**: We've successfully completed the first 10 million steps on the first column. Now, we'll proceed with the second run of Series D, focusing on the first and second columns averaged together.

**Training Methodology**: Our approach involves calculating the expected value minus the actual value difference for each column. This method helps us fine-tune the model's accuracy and performance.

Join us as we continue to refine our LLM with these advanced training techniques. Whether you're new to machine learning or an experienced practitioner, this episode offers valuable insights into the intricacies of training large language models.

Stay tuned and let's get started! ๐Ÿš€




Other Videos By p3nGu1nZz


2024-12-19Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 7
2024-12-15Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 6
2024-12-13Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 5
2024-12-11Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 4
2024-12-10Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 3
2024-12-10Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 2
2024-12-09Exploring the Plasma-Arc | Mastering WebGPU and Hugging Face Spaces | Ep. 1
2024-09-28Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 22b
2024-09-27Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 22
2024-09-26Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21f
2024-09-25Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21d
2024-09-24Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21c
2024-09-23Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21b
2024-09-22Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 21
2024-09-17Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20d
2024-09-15Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20c
2024-09-14Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20
2024-09-14Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20b
2024-09-12Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 19
2024-09-11Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 18
2024-09-10Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 16