[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)

Subscribers:
251,000
Published on ● Video Link: https://www.youtube.com/watch?v=Kk8YhCpo1b8



Duration: 27:31
24,470 views
1,019


A flurry of new models continues to appear.

Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n




Other Videos By Yannic Kilcher


2 days agoTransformerFAM: Feedback attention is working memory
3 days ago[ML News] Devin exposed | NeurIPS track for high school students
6 days agoLeave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
2024-04-23[ML News] Llama 3 changes the game
2024-04-17Hugging Face got hacked
2024-04-15[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news)
2024-04-13[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)
2024-04-08Flow Matching for Generative Modeling (Paper Explained)
2024-04-06Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer)
2024-03-26[ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act
2024-03-17[ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction
2024-03-10[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama
2024-03-07On Claude 3
2024-03-05No, Anthropic's Claude 3 is NOT sentient
2024-03-01[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles
2024-02-22Gemini has a Diversity Problem
2024-02-19V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)
2024-02-18What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news)
2024-02-04Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained)
2024-01-21AlphaGeometry: Solving olympiad geometry without human demonstrations (Paper Explained)
2024-01-13Mixtral of Experts (Paper Explained)



Tags:
deep learning
machine learning
arxiv
explained
neural networks
ai
artificial intelligence
paper