Build RAG-based large language model applications with Ray on Google Kubernetes Engine

Channel:
Subscribers:
276,000
Published on ● Video Link: https://www.youtube.com/watch?v=YU8yIyuFeJo



Duration: 26:14
243 views
0


Large Language Models (LLMs) have changed the way we interact with information. A base LLM is only aware of the information it was trained on. Retrieval augmented generation (RAG) can address this issue by providing context of additional data sources. In this session, we’ll build a RAG-based LLM application that incorporates external data sources to augment an OSS LLM. We’ll show how to scale the workload with distributed kubernetes compute, and showcase a chatbot agent that gives factual answers.

Speakers: Kai-Hsun Chen , Winston Chiang

Watch more:
All sessions from Google Cloud Next → https://goo.gle/next24

#GoogleCloudNext

DEV100




Other Videos By Google Cloud


2024-07-01AI for media: How Paramount+ uses artificial intelligence to streamline and personalize video
2024-07-01Five ways AI-assisted API automation can supercharge your platform engineering
2024-07-01From prototype to production: generative AI with Vertex AI
2024-07-01Making education more personal and accessible
2024-07-01The biology generative AI solution for pharma research and development
2024-07-01Talk with your business data using generative AI
2024-07-01Document understanding with Vertex AI
2024-07-01Founder series: Dario Amodei CEO & Co-Founder Anthropic & Elad Gil ENTR, Investor, Startup Helper
2024-07-01Integrate Gemini for Google Cloud with your applications and data
2024-07-01Founder series panel: State of startups
2024-07-01Build RAG-based large language model applications with Ray on Google Kubernetes Engine
2024-07-01Fighting piracy with Globo: protect media streams in real time with CDN
2024-07-01Shifting left, delivering right: Insights from Datadog's software delivery journey
2024-07-01Build conversational experiences in a few clicks with generative AI
2024-07-01Bayer and Insmed: Revolutionizing Healthcare with AI
2024-07-01C-Suite dilemma: Generative AI and its applications
2024-07-01Founder series panel: The top five things startup leaders need to know about generative AI
2024-07-01A guide for enterprises: How to implement generative AI applications
2024-07-01Impacting Global Marketing Efficiency in 30 Days: Bosch's GenAI Fast Track
2024-07-01Supercharge your development workflow with Gemini
2024-07-01Scaling AI in a highly regulated industry, powered by vertex AI