Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15c

Channel:
Subscribers:
373
Published on ● Video Link: https://www.youtube.com/watch?v=4MS9RvEviow



Duration: 0:00
49 views
0


*Welcome back to our Tau LLM series! ๐ŸŒŸ*

In this episode, we're taking our project to the next level with some exciting new developments. Our highlights include:

**Data File De-duplication**: We've automated the de-duplication process for any data file loaded into our database, ensuring cleaner and more efficient training data.
**Ophrase Python Module**: We've successfully completed our ophrase module, which generates multiple paraphrases from a given sentence using Ollama, enhancing our dataset diversity.
**New Python Module for Responses**: Today, we'll implement a new module that generates responses or answers to our paraphrased sentences. This will expand our dataset from 1,000 to 9,000 records, aiming to reduce entropy and loss.
**Encoder Deduplication**: We'll also introduce a deduplication process for our encoder. This will check if an embedding already exists before generating or adding it to the database, preventing duplicate entries and keeping our index count efficient.
**Upcoming Training**: If all goes well, we'll generate new embeddings for our expanded dataset and hopefully begin training later today or tomorrow.

Join us as we continue to build, debug, and optimize our LLM project step by step. Whether you're a beginner or an experienced developer, this episode offers valuable insights into developing, testing, and enhancing an LLM using custom tools and techniques.

Stay tuned and let's get started! ๐Ÿš€




Other Videos By p3nGu1nZz


2024-09-15Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20c
2024-09-14Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20
2024-09-14Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 20b
2024-09-12Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 19
2024-09-11Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 18
2024-09-10Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 16
2024-09-10Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 17
2024-09-09Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15f
2024-09-09Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15e
2024-09-08Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15d
2024-09-07Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15c
2024-09-05Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15
2024-09-05Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 15b
2024-09-04Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 14
2024-09-03Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 13
2024-08-30Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 12
2024-08-28Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 11
2024-08-28Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 10
2024-08-26Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 9
2024-08-24Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 8
2024-08-22Unity ML-Agents | Pretrain an LLM from Scratch with Sentence Transformers | Part 7