Trillium TPU, built to power the future of AI

Channel:
Subscribers:
291,000
Published on ● Video Link: https://www.youtube.com/watch?v=RjRQ1DYnuJA



Duration: 0:00
43,088 views
1,400


To deliver the next frontier of models and enable you to do the same, we’re excited to announce Trillium, our sixth-generation TPU, the most performant and most energy-efficient TPU to date.

More than a decade ago, Google recognized the need for a first-of-its-kind chip for machine learning. In 2013, we began work on the world’s first purpose-built AI accelerator, TPU v1, followed by the first Cloud TPU in 2017. Without TPUs, many of Google’s most popular services — such as real-time voice search, photo object recognition, and interactive language translation, along with the state-of-the-art foundation models such as Gemini, Imagen, and Gemma — would not be possible.

Trillium TPUs achieve an impressive 4.7X increase in peak compute performance per chip compared to TPU v5e. We doubled the High Bandwidth Memory (HBM) capacity and bandwidth, and also doubled the Interchip Interconnect (ICI) bandwidth over TPU v5e. Additionally, Trillium is equipped with third-generation SparseCore, a specialized accelerator for processing ultra-large embeddings common in advanced ranking and recommendation workloads. Trillium TPUs make it possible to train the next wave of foundation models faster and serve those models with reduced latency and lower cost. Critically, our sixth-generation TPUs are also our most sustainable: Trillium TPUs are over 67% more energy-efficient than TPU v5e.

https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview




Other Videos By Google Cloud


2024-11-12How to use AI to streamline your compliance process with SoftServe
2024-11-12How to use AI to quickly and easily search media with Globant
2024-11-12How to use AI for a better customer experience with Quantum Metric
2024-11-12How to use AI to visualize data using natural text prompts with Searce
2024-11-12How to use AI for personalized meal planning with Quantiphi
2024-11-12How to use AI to plan your next marketing event with Thoughtworks
2024-11-11How Rentokil digitally transformed without growing pains using Google Cloud VMware Engine
2024-11-11Remove threats up to eight times faster than competitors with Doppel and Google Cloud.
2024-11-06A Day in the Life at Google: Data Center Server Operations Area Lead Manager
2024-10-30How do VMs communicate?
2024-10-29Trillium TPU, built to power the future of AI
2024-10-27Take your ML/AI to new heights with Outerbounds on Google Cloud
2024-10-24Fine-tune your LLMs in minutes with Weights & Biases & Vertex AI (watch how!)
2024-10-24New Way Now: Quantiphi reimagines the SDLC with Gemini, boosting developer productivity by 30% 🚀
2024-10-23New Way Now: Arpalus is reimagining retail inventory management with Google Cloud
2024-10-22A Day in the Life at Google: Data Center Operations Facility Technician
2024-10-22Nokia mastering the AI-Powered Network
2024-10-21What is Compute Engine?
2024-10-18VM Types and Families
2024-10-17Unleash the Power of Gemini: Transforming Telecommunications
2024-10-17Gen AI making waves in Contact Center support