Build RAG-based large language model applications with Ray on Google Kubernetes Engine

Channel:

Google Cloud

Subscribers:

296,000

Published on July 1, 2024 8:25:51 AM ● Video Link: https://www.youtube.com/watch?v=YU8yIyuFeJo

Duration: 26:14

243 views

Large Language Models (LLMs) have changed the way we interact with information. A base LLM is only aware of the information it was trained on. Retrieval augmented generation (RAG) can address this issue by providing context of additional data sources. In this session, we’ll build a RAG-based LLM application that incorporates external data sources to augment an OSS LLM. We’ll show how to scale the workload with distributed kubernetes compute, and showcase a chatbot agent that gives factual answers.

Speakers: Kai-Hsun Chen , Winston Chiang

Watch more:
All sessions from Google Cloud Next → https://goo.gle/next24

#GoogleCloudNext

DEV100

Other Videos By Google Cloud

2024-07-01	AI for media: How Paramount+ uses artificial intelligence to streamline and personalize video
2024-07-01	Five ways AI-assisted API automation can supercharge your platform engineering
2024-07-01	From prototype to production: generative AI with Vertex AI
2024-07-01	Making education more personal and accessible
2024-07-01	The biology generative AI solution for pharma research and development
2024-07-01	Talk with your business data using generative AI
2024-07-01	Document understanding with Vertex AI
2024-07-01	Founder series: Dario Amodei CEO & Co-Founder Anthropic & Elad Gil ENTR, Investor, Startup Helper
2024-07-01	Integrate Gemini for Google Cloud with your applications and data
2024-07-01	Founder series panel: State of startups
2024-07-01	Build RAG-based large language model applications with Ray on Google Kubernetes Engine
2024-07-01	Fighting piracy with Globo: protect media streams in real time with CDN
2024-07-01	Shifting left, delivering right: Insights from Datadog's software delivery journey
2024-07-01	Build conversational experiences in a few clicks with generative AI
2024-07-01	Bayer and Insmed: Revolutionizing Healthcare with AI
2024-07-01	C-Suite dilemma: Generative AI and its applications
2024-07-01	Founder series panel: The top five things startup leaders need to know about generative AI
2024-07-01	A guide for enterprises: How to implement generative AI applications
2024-07-01	Impacting Global Marketing Efficiency in 30 Days: Bosch's GenAI Fast Track
2024-07-01	Supercharge your development workflow with Gemini
2024-07-01	Scaling AI in a highly regulated industry, powered by vertex AI

Channel	Latest
SmarioRPG	6 hours ago
year1blink	6 hours ago
JAyy YT	6 hours ago
Joshyblogz	7 hours ago
Last PlayZ	7 hours ago
Dota2 Prodigy	7 hours ago
Warrior	7 hours ago
shinysphinx	7 hours ago
Mintery	7 hours ago
ホワイト大佐ch	7 hours ago
Duke Fishron	7 hours ago
كورة لايف - KOORA LIVE	7 hours ago
PGL Counter-Strike Highlights	7 hours ago
Media toon	7 hours ago
E-League Thailand	7 hours ago
Mr.pool_twx	7 hours ago
Ardi Ogi 1993	7 hours ago
FREE FIRE - SAROJ GAMER	7 hours ago
Aitzu Arashi	7 hours ago
AnnoDominic Gaming	7 hours ago
Bettypvp	8 hours ago
Phenomenal	8 hours ago
Mike Gloria	8 hours ago
Samurai242YT	8 hours ago
617semaj	8 hours ago