Building a LLM Judge with Weights & Biases

Subscribers:
118,000
Published on ● Video Link: https://www.youtube.com/watch?v=zaNR3WaPTfo



Duration: 0:00
267 views
0


Evaluating LLM outputs accurately is critical to being able to iterate quickly on a LLM system. Human annotations can be slow and expensive and using LLMs instead promises to solve this. However, aligning a LLM Judge with human judgements is often hard with many implementation details to consider. In this workshop we will explore:\nEvaluating specialized LLMs using Weave\nProductionizing the latest LLM-as-a-judge research\nImproving on your existing judge\nBuilding annotation UIs\n\n#MicrosoftReactor\n\n[eventID:23760]




Other Videos By Microsoft Reactor


2024-10-30Segurança de Infra em Nuvem
2024-10-29Harnessing AI for Business Growth: Practical Strategies for Entrepreneurs
2024-10-29BAM Skilling: Microsoft Azure AI Fundamentals - Parte 2
2024-10-29Azure product retirements: unplugged (Option 2)
2024-10-29Semantic Kernel Office Hours for US/EMEA - October 30th, 2024
2024-10-29Petoi Robot Dog and Azure AI Model Inference API Integration
2024-10-29.NET Microservices in Azure Container Apps
2024-10-29Inspire-se com a experiência de Matheus competindo na Imagine Cup! Os benefícios são infinitos
2024-10-29[粵語] Learn Azure in Hong Kong | AI @ Industries
2024-10-29The Reliable Web App Pattern for Java with Enhanced Security and Scalability on Azure
2024-10-28Building a LLM Judge with Weights & Biases
2024-10-28Microsoft 365 Para Elas: próximos passos em seu plano de estudos!
2024-10-28Princípios básicos e conceitos de nuvem
2024-10-28Caminhos para uma Carreira em Cloud para Mulheres
2024-10-27Build and Extend copilot using Copilot Studio | #MVPConnect
2024-10-27Lançamento do Programa BAM Skilling + Microsoft Azure AI Fundamentals - Parte 1
2024-10-27Copilot for OneDrive:文件交互的新浪潮 | Global Season of AI:Copilot for Microsoft365 + LowCode [1]
2024-10-27Copilot for Word:解锁文档处理新途径 | Global Season of AI:Copilot for Microsoft365 + LowCode [3]
2024-10-27Copilot Studio:打造个性化的 AI 助手 | Global Season of AI:Copilot for Microsoft365 + LowCode [2]
2024-10-27Tips & tricks for Microsoft Fabric DP-600 exam day
2024-10-2704 Copilot for Power Platform:Copilot for Power BI 应用尝鲜