Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15d

Channel:

p3nGu1nZz

Subscribers:

375

Published on September 9, 2024 12:00:00 AM ● Video Link: https://www.youtube.com/watch?v=KxOLhG7_7nM

Duration: 0:00

53 views

*Welcome back to our Tau LLM series! 🌟*

In this episode, we're taking our project to the next level with some exciting new developments. Our highlights include:

**Data File De-duplication**: We've automated the de-duplication process for any data file loaded into our database, ensuring cleaner and more efficient training data.
**Ophrase Python Module**: We've successfully completed our ophrase module, which generates multiple paraphrases from a given sentence using Ollama, enhancing our dataset diversity.
**New Python Module for Responses**: Today, we'll implement a new module that generates responses or answers to our paraphrased sentences. This will expand our dataset from 1,000 to 9,000 records, aiming to reduce entropy and loss.
**Encoder Deduplication**: We'll also introduce a deduplication process for our encoder. This will check if an embedding already exists before generating or adding it to the database, preventing duplicate entries and keeping our index count efficient.
**Upcoming Training**: If all goes well, we'll generate new embeddings for our expanded dataset and hopefully begin training later today or tomorrow.

Join us as we continue to build, debug, and optimize our LLM project step by step. Whether you're a beginner or an experienced developer, this episode offers valuable insights into developing, testing, and enhancing an LLM using custom tools and techniques.

Stay tuned and let's get started! 🚀

Other Videos By p3nGu1nZz

2024-09-17	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20d
2024-09-15	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20c
2024-09-14	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20
2024-09-14	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 20b
2024-09-12	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 19
2024-09-11	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 18
2024-09-10	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 16
2024-09-10	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 17
2024-09-09	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15f
2024-09-09	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15e
2024-09-08	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15d
2024-09-07	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15c
2024-09-05	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15
2024-09-05	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 15b
2024-09-04	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 14
2024-09-03	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 13
2024-08-30	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 12
2024-08-28	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 11
2024-08-28	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 10
2024-08-26	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 9
2024-08-24	Unity ML-Agents \| Pretrain an LLM from Scratch with Sentence Transformers \| Part 8

Channel	Latest
HellfireComms	10 hours ago
penguinz0	11 hours ago
Zanar Aesthetics	11 hours ago
Svarush	13 hours ago
LongplayArchive	13 hours ago
Õhtuleht	13 hours ago
Pico Shogun	13 hours ago
Momoterasu	13 hours ago
Bass City	13 hours ago
ETwo4Three	13 hours ago
Henry Chhouk	14 hours ago
TueurDeBikette	14 hours ago
Suns	14 hours ago
Mati Clips	14 hours ago
Carlotta ASMR	14 hours ago
Shazam Sakazaki	14 hours ago
Cardboard Tube Knight	14 hours ago
ÉducaTube	14 hours ago
Jaegerchere	14 hours ago
lucas gameplays	14 hours ago
Darth Luke	14 hours ago
RobertIDK	14 hours ago
Ajarn Spencer	14 hours ago
Lazycorner07	14 hours ago
Christopher Leon Johnson	14 hours ago