From transformer to Terraform: Cohere’s large language model (LLM) takes flight

Channel:
Subscribers:
273,000
Published on ● Video Link: https://www.youtube.com/watch?v=k0QBgQJ-78A



Duration: 30:10
66 views
0


In the race to AI, it’s more crucial than ever to have strong, dependable infrastructure to build on. Since its founding in 2019, Cohere has experienced hypergrowth by empowering developers and enterprises to build amazing products while leveraging AI. Making these capabilities available to the world requires a tremendous amount of scale, both in Cohere’s LLMs and the underlying mass computational power. Come to this session featuring Sudip Roy, Director of Engineering at Cohere and Craig Alleva, Director of Customer Engineering, Google Cloud to discuss the passage to scale: from massive training on TPUs, to creating and operating a performant language model ecosystem served by Kubernetes and GPUs, and the technical and team choices made along the way.

Speakers: Sudip Roy, Craig Alleva

Watch more:
All sessions from Google Cloud Next → https://goo.gle/next23

#GoogleCloudNext




Other Videos By Google Cloud


2023-12-11Supercharging the public sector with AI
2023-12-11Transforming analytics and driving innovation with SAP and Google Cloud
2023-12-11How Goldman Sachs applies many layers of defense to secure container apps
2023-12-11Generative AI and search: Better together to unlock enterprise data
2023-12-11No app needed: How Volkswagen developed and delivered immersive experiences
2023-12-11Foundation models powering relevancy and ranking in retail search
2023-12-11How reverse mentoring can break down barriers and create a more inclusive workplace
2023-12-11Network security fundamentals: Creating layered network defenses with built-in tools
2023-12-11Identity Federation on Google Cloud with Goldman Sachs
2023-12-11Building belonging in the workplace
2023-12-11From transformer to Terraform: Cohere’s large language model (LLM) takes flight
2023-12-11Learn how CERC has disrupted the exchange receivables market with new Spanner innovations
2023-12-11The future of telecommunications: How data, AI, and networks are changing the game
2023-12-11How Ordaōs Bio takes advantage of generative AI on Google Kubernetes Engine
2023-12-11A new era for managed detection and response: Accenture MxDR powered by Google Chronicle
2023-12-11Generative AI for enterprises: What is it? How does it impact businesses? How do I incorporate it?
2023-12-11Data and AI Cloud for Marketing: Your marketing, multiplied by Google Cloud
2023-12-11Accenture and Google partner on next-generation cybersecurity
2023-12-11Intelligent interactions: How the best sales and contact center teams will use generative AI
2023-12-11Prevent cloud compromises: Learn how Uber discovers cyber risks and remediates threats
2023-12-11The modern customer journey and the rise of data-driven marketing