Transform Podcasts into Text with IBM Granite 3.3 & watsonx.ai!

Channel:
Subscribers:
73,500
Published on ● Video Link: https://www.youtube.com/watch?v=9EYOv93Yf54



Duration: 0:00
221 views
9


Discover how to convert spoken content into written form using IBM's latest AI Speech Model. In this tutorial, you'll learn to transcribe a podcast from YouTube using the open-source IBM Granite 3.3 speech model and summarize it with the Granite-3.3-8B-Instruct large language model (LLM) within a watsonx.ai notebook.

What You'll Learn:

— Setting up your environment using the Jupyter Notebook
— Installing and importing necessary libraries like torchaudio, moviepy, and transformers
— Downloading and processing podcast audio from YouTube
— Converting audio files for model inference and generating transcripts using Granite-speech-3.3-8b
— Summarizing transcripts using the Granite instruct model

Whether you're an AI enthusiast, developer, or researcher, this tutorial equips you with the tools to enhance your AI models' capabilities in processing and summarizing audio content.

🔗 Check out the full tutorial here: https://www.ibm.com/think/tutorials/automatic-speech-recognition-podcast-transcript-granite-watsonx-ai

🔗 Check out the Github notebook: https://github.com/IBM/ibmdotcom-tutorials/blob/main/generative-ai/granite-speech-3.3-8b.ipynb

🔗 Check out the models here:
https://huggingface.co/ibm-granite/granite-speech-3.3-8b
https://huggingface.co/ibm-granite/granite-3.3-8b-instruct


For more technical tutorials, articles, and learning resources, check out IBM Developer: https://ibm.biz/ibm-developer-yt

____________________________________________

Subscribe to see more developer content: https://ibm.biz/ibm-developer-yt-subscribe

Join the IBM TechXchange Community: https://ibm.biz/techxchange-community

Follow IBM Developer on LinkedIn: https://ibm.biz/ibm-developer-linkedin-yt

#AutomaticSpeechRecognition
#ASR
#Granite3.3
#watsonx.ai
#IBMDeveloper
#Developer
#Coding




Other Videos By IBM Developer


2025-04-18Diving deeper into common Granite 3.3 questions from r/LocalLLaMa launch post
2025-04-17Transform Podcasts into Text with IBM Granite 3.3 & watsonx.ai!
2025-04-17Master Chain-of-Thought Reasoning with IBM Granite 3.3-8b Instruct!
2025-04-09IBM Launches Llama 4 on Day 0! Exclusive watsonx.ai Team Conversation + Live Demo
2025-04-07Solve your complex IT Automations with low code/no code drag-and-drop options | S11 | TechCon 2025
2025-04-07Apply GitOps principles to tailor DataPower operations for a modern enterprise | G22 | TechCon 2025
2025-04-07EASeJ - A Cloud-managed Java application platform | D21 | TechCon 2025
2025-04-03Accelerate Application Migration & Modernization with Layer 7 Connectivity | S13 | TechCon 2025
2025-04-03Beyond Hours: The Science of Dual Capacity Measurement | T34 | TechCon 2025
2025-04-03Kubernetes Cost Optimization: Real-Time Visibility for Smarter Cloud Spend | T33 | TechCon 2025
2025-04-03AI increases Omni-Channel Sales & Profitability with Sterling Order Management | T31 | TechCon 2025
2025-04-03Unified observability and cloud cost management | T24 | TechCon 2025
2025-04-03Optimize Client Value for IT Financial Management with Strategic Portfolio Mgmt | T23 | TechCon 2025
2025-04-03Unlocking Resilience with IBM Concert + Instana | T22 | TechCon 2025
2025-04-03Maximo + AI infused visual inspection and defect detection | T21 | TechCon 2025
2025-04-03Unify Operations across components, tools, and teams | T14 | TechCon 2025
2025-04-03Optimizing IT Operations with IBM Turbonomic—Unlocking Efficiency & Performance | T13 | TechCon 2025
2025-04-03Automate Application Resilience and Risk with AI and Unified Observability | T12 | TechCon 2025
2025-04-03Cloud-Native Observability and AI-Driver Remediation with Instana | T11 | TechCon 2025
2025-04-03Beyond the Silo: Integrating IT Ops, Cybersecurity, and Observability | S34 | TechCon 2025
2025-04-03Hashi Corp - Security Lifecycle Management | S33 | TechCon 2025