Build a Gen-AI Application Across Multiple AWS Instances with OPEA | AI with Guy

Subscribers:
256,000
Published on ● Video Link: https://www.youtube.com/watch?v=ukHHMnqFrns



Duration: 0:00
209 views
9


Build a Gen-AI Application Across Multiple AWS Instances with OPEA

This video demonstrates how to build a real, production-grade retrieval-augmented generation (RAG) system using the OPEA framework on multiple AWS instances. Learn how to orchestrate distributed Gen-AI workloads that scale using Intel Gaudi accelerators and cloud-native architecture.

What you’ll learn:
– How to set up a multi-server Gen-AI application using OPEA
– How to deploy a RAG pipeline across AWS EC2 instances
– How to leverage Intel Gaudi for efficient large language model inference
– How to connect components like vector databases, OpenAI API, and vLLM
– How to optimize Gen-AI performance using open-source tools and Intel software

This walkthrough is ideal for developers, ML engineers, and architects looking to deploy cost-effective and scalable AI infrastructure in the cloud.

Resources:
https://opea.dev/
https://opea-project.github.io/latest/GenAIExamples/ChatQnA/README.html
https://cloud.intel.com/

#OPEA #GenAI #RAG #AWS #IntelGaudi #LLM #AIInfrastructure #OpenAI #vLLM #IntelSoftware #MachineLearning #AIatScaleAbout

Intel Software:
Intel® Developer Zone is committed to empowering and assisting software developers in creating applications for Intel hardware and software products. The Intel Software YouTube channel is an excellent resource for those seeking to enhance their knowledge. Our channel provides the latest news, helpful tips, and engaging product demos from Intel and our numerous industry partners. Our videos cover various topics; you can explore them further by following the links.

Connect with Intel Software:
INTEL SOFTWARE WEBSITE:https://intel.ly/2KeP1hDD
INTEL SOFTWARE on FACEBOOK:http://bit.ly/2z8MPFFF
INTEL SOFTWARE on TWITTER:http://bit.ly/2zahGSnn
INTEL SOFTWARE GITHUB:http://bit.ly/2zaih6zz
INTEL DEVELOPER ZONE LINKEDIN:http://bit.ly/2z979qss
INTEL DEVELOPER ZONE INSTAGRAM:http://bit.ly/2z9Xsbyy
INTEL GAME DEV TWITCH:http://bit.ly/2BkNshuu

#intelsoftware
Build a Gen-AI Application Across Multiple AWS Instances with OPEA | AI with Guy | Intel Software




Other Videos By Intel Software


2025-06-13Run Ollama + Web-UI on Your AI PC | AI With Guy
2025-06-13Get a GPU VM in One Click | AI With Guy
2025-06-11The Heart of HPC Today: Heterogeneous Computing | Intel Software
2025-06-10FAMU-FSU: Up-to-the-Minute Lessons in AI with the help of the Educator Program by Intel
2025-06-10Cornell University: A Support System to Optimize Curriculum, Course Materials and Student Engagement
2025-06-10Cal Poly: Breaking New Ground in Programming Curriculum, Without Reinventing the Wheel
2025-06-10RAG Pipeline Using Standard Libraries and OPEA | AI with Guy |
2025-06-09Run PyTorch 2.7 on Intel GPUs: A Step-by-Step Setup | AI with Guy
2025-06-06GPU Coding Using Triton Compiler | AI with Guy
2025-06-05vLLM Server Using OpenAI API on Gaudi 3 | AI with Guy
2025-06-04Build a Gen-AI Application Across Multiple AWS Instances with OPEA | AI with Guy
2025-06-03OPEA vs. NVIDIA NIM: What’s Best for Your GenAI Deployment?
2025-05-28PyTorch Export Quantization with Intel GPUs | Intel Software
2025-05-23Unlocking Gen AI: From Experimentation to Production with Red Hat & Intel | Intel Software
2025-05-23Overcoming Deployment Challenges: Scaling AI in Edge Computing w/ Red Hat AI & Intel Edge Platforms
2025-05-23Discover AI Innovations at Red Hat Summit with Intel: RHEL AI, OpenShift AI & Edge AI
2025-05-22Explore OpenVINO Model Hub – Instantly Compare AI Model Performance Across Devices | AI with Guy
2025-05-22Build a RAG Chatbot with OPEA on AWS | AI with Guy | Intel Software
2025-05-20Enterprise AI Inference with Intel: Bill Pearson on Infrastructure & Standards | Intel Software
2025-05-20Automatically Quantize LLMs with AutoRound | Intel Software
2025-05-13Deploy Compiled PyTorch Models on Intel GPUs with AOTInductor | Intel Software