Are long context LLMs the death of RAG?

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on March 20, 2024 12:00:16 PM ● Video Link: https://www.youtube.com/watch?v=Ng-EnWrwsAg

Duration: 1:45

215 views

AF: You said people are saying there's long context LLMs, therefore RAG is not that interesting. I want to go back to that and underline. Why do you disagree?

AM: I think some people misunderstand what RAG is good for when they say that. Gemini with its really long context window, I can throw all the text I want at this and it can reference it inside its own context window. Why do I need RAG? For me, RAG is not just about giving an answer based on that information. I think having a data set that you manage separately is quite important from an audit point of view. In our line of work in financial services, it's important to see what's the whole lineage of the data you've used to come to this answer. Fundamentally, the data is sacrosanct.

I think RAG's going to stay for a long time. With the large context window model, it probably means your RAG systems can become more powerful, or they can do more things at one time, or they can solve slightly more complex problems.

AF: Maybe we need a new term for it because RAG has moved on from how it was proposed originally. At the retriever stage, there are so many things that we are layering in these days, there is the retrieval ranking, there is the privacy controls, there is the PII handling, there is access control, security control, domain knowledge. So many things we're doing there that a long context window is not a solution to.

Even with a 4000 token context window, you're seeing a lot of "lost in the middle" type of problems. When you fill up the context, they're not particularly good at figuring out where to pay attention to exactly.

MA: I think everything you've said about those other elements are super important when it comes to enterprise scale stuff. All of this architecture here, there's a lot of things in here that I will want to stick around for a long time. If I just replace this with "talk to an LLM", I lose so much control. I lose so much auditability. I lose so much security.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2024-04-11	LLMs - Chunking Strategies and Chunking Refinement
2024-04-10	Large Language Models as a Building Blocks
2024-04-04	Competitive Advantage for Startups in era of LLMs
2024-04-02	Intersection Between LLMs and Products
2024-03-28	What is the right team composition in era of LLMs?
2024-03-28	Building an LLM Teacher-bot
2024-03-27	What is the relationship between LLMs and multi-modality?
2024-03-26	What are the system level considerations for using LLMs?
2024-03-22	What is the relationship between language and intelligence?
2024-03-21	How do you improve your RAG pipeline?
2024-03-20	Are long context LLMs the death of RAG?
2024-03-19	How Do You choose between training, fine-tuning, and using small models?
2024-03-15	Multi-agent LLMs Course #business #startup https://maven.com/forms/30a683
2024-03-15	LLM Evaluation, Validation, and Verification
2024-03-14	How Do You Validate LLM Systems Beyond Benchmarks?
2024-03-13	Can Sherpa (multi-agent llm) Handle Multi-modality?
2024-03-12	What Kind of Risks Are Specific to LLMs?
2024-03-08	LLMs, What Skills to Learn? and What a Time to be Alive!
2024-03-07	How do you Force an LLM to Keep Track of the Assumptions a Document Makes?
2024-03-06	How to Annotate Data for LLM Applications
2024-03-05	What is the Role of Data Quality and Diversity in LLM Systems?

Tags:

deep learning

machine learning

Channel	Latest
AJ/Music	6 hours ago
ميد / Med	6 hours ago
XCyb3rneticX	6 hours ago
StyledSnail	6 hours ago
Microsoft Research	6 hours ago
mj	6 hours ago
The Starving Writer	7 hours ago
AJRockPkmn	7 hours ago
DantheBNN - Tips, Tutorial and FIFA Trade BR	7 hours ago
Franco Montúfar (Frankk's Solutions GT)	7 hours ago
G_A_A_R_A_9	7 hours ago
LEO DESANDE E ANA CLÁUDIA	7 hours ago
うでぃ	7 hours ago
INDUSTRIES	7 hours ago
Jay Khalaf	7 hours ago
Hora del recreo	7 hours ago
CDS_Leb	7 hours ago
Premfn	7 hours ago
OCMCTVGAMINGYT	7 hours ago
Electrico Gamers	7 hours ago
Geekset Podcast	7 hours ago
hassyboy11_	7 hours ago
Department of Public Information, Guyana	7 hours ago
𝗩𝗜𝗣 PANEL 𝗛𝗨𝗕	7 hours ago
herra pulu	7 hours ago