What is the relationship between LLMs and multi-modality?

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on March 27, 2024 11:30:20 AM ● Video Link: https://www.youtube.com/watch?v=O_G5WJPekTc

Duration: 3:12

91 views

Check out my essays: https://aisc.substack.com/
OR book me to talk: https://calendly.com/amirfzpr
OR subscribe to our event calendar: https://lu.ma/aisc-llm-school

AF: One of the interesting strategies that Cohere is using versus other providers where you're specializing a bunch of different models for different types of things: you have a re-ranker, you have a few other things that you mentioned. In a lot of real world systems, as you presented, you want to use other tools because they're specialized and much better at doing a particular task than LLMs would ever be, potentially. So, expand on that philosophy.

JA: This is one thing that sets Cohere apart, the focus on practical applications right now instead of chasing training AGI, or creating superhuman intelligence. So a lot of it is how can we build the best AI systems to empower the next generation of software systems, and that to us breaks down into these two families of models, the search and retrieval models that has been one of the deepest areas of computer science. It's just a massive and fascinating research area that continues to be improved. We have teams who are specifically focused on search and retrieval.

Companies want the ability to chat with their data or to make sense of their internal data through even private deployments. They want the model to come to their data, which is another focus area.

I also speak with a lot of developers that they're like, can I do this with a language model? Yes, you can send that a trillion parameter model to solve a problem, but a lot of the times it will be done better with a 300 million parameter model that's specifically geared for this use case. It does it at much better latency. You don't have to deploy it and shard it across 100 GPUs.

So just being practical and focusing on the best tool for the problem and efficiency is the major focus here because yes, you can build massive models that are general problem solvers, but then to deploy things into production, you need to think about costs, about latency, about how many GPUs, about memory space.

AF: And robustness. You could over engineer a huge system that is not fine tuned to do anything specifically, and then you would complain, why is it hallucinating all of these things?

What you presented was more focused on what happens to text, but the majority of the structurally available data is a structured and might be in other modalities. How does what you spoke about apply to those? Language and text is one portion of data. It is untapped but we also have a lot of systems that are set up to handle those other types of data. So how do how do all these different worlds talk to each other?

JA: I'm excited about multi modal embedding models. That's an area that can fit other modalities into vector search as it exists. One modality that is relevant to all of this and builds on text is code. Once you improve the code generation capability, you can improve things like tool use and reasoning and the abilities of the model to become an operator of the of these tools. I would rank that as the second highest important modality.

Based on use case, different modalities will be different. If you're working in media, it's going to be video and audio. If you're working in music, audio and waveforms are probably the modality that would be relevant to you.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2024-05-13	LLM Products for Regulated Industries
2024-04-25	LLMs and Business Workflows
2024-04-17	5 Commandments of Building LLM Products
2024-04-16	Building a LLM Testing API
2024-04-11	LLMs - Chunking Strategies and Chunking Refinement
2024-04-10	Large Language Models as a Building Blocks
2024-04-04	Competitive Advantage for Startups in era of LLMs
2024-04-02	Intersection Between LLMs and Products
2024-03-28	What is the right team composition in era of LLMs?
2024-03-28	Building an LLM Teacher-bot
2024-03-27	What is the relationship between LLMs and multi-modality?
2024-03-26	What are the system level considerations for using LLMs?
2024-03-22	What is the relationship between language and intelligence?
2024-03-21	How do you improve your RAG pipeline?
2024-03-20	Are long context LLMs the death of RAG?
2024-03-19	How Do You choose between training, fine-tuning, and using small models?
2024-03-15	Multi-agent LLMs Course #business #startup https://maven.com/forms/30a683
2024-03-15	LLM Evaluation, Validation, and Verification
2024-03-14	How Do You Validate LLM Systems Beyond Benchmarks?
2024-03-13	Can Sherpa (multi-agent llm) Handle Multi-modality?
2024-03-12	What Kind of Risks Are Specific to LLMs?

Tags:

deep learning

machine learning

Channel	Latest
Subodh Sinha	6 hours ago
Glint	6 hours ago
とっと	6 hours ago
AMMU GAMER	6 hours ago
ParKilleRz Ch.	6 hours ago
SCARY GAMING	7 hours ago
Trailer Vault	7 hours ago
Lazy Mattman	7 hours ago
Lutpe Reaction	7 hours ago
Vebv Gaming	7 hours ago
MR ABHI gaming	7 hours ago
Dj Music Club	7 hours ago
Sidorovich Jr.	7 hours ago
あしゅら	7 hours ago
NAMAKOOL GAMING	7 hours ago
SAPINHOyoutub	7 hours ago
たこまる/TAKOMARU	7 hours ago
YBMJETT	7 hours ago
天才カメレオン	7 hours ago
サワリドのゲーム実況部屋	7 hours ago
Barzêl Gameplay	7 hours ago
ResinWoodArt - jedrek29t	7 hours ago
SEADOTES	8 hours ago
TheBrakeTrain	8 hours ago
Anari Queen Gaming	8 hours ago