[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃) VIDEO
A flurry of new models continues to appear.
Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
Other Videos By Yannic Kilcher 2024-06-01 xLSTM: Extended Long Short-Term Memory 2024-05-21 [ML News] OpenAI is in hot waters (GPT-4o, Ilya Leaving, Scarlett Johansson legal action) 2024-05-01 ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained) 2024-04-30 [ML News] Chips, Robots, and Models 2024-04-28 TransformerFAM: Feedback attention is working memory 2024-04-27 [ML News] Devin exposed | NeurIPS track for high school students 2024-04-24 Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention 2024-04-23 [ML News] Llama 3 changes the game 2024-04-17 Hugging Face got hacked 2024-04-15 [ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news) 2024-04-13 [ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃) 2024-04-08 Flow Matching for Generative Modeling (Paper Explained) 2024-04-06 Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer) 2024-03-26 [ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act 2024-03-17 [ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction 2024-03-10 [ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama 2024-03-07 On Claude 3 2024-03-05 No, Anthropic's Claude 3 is NOT sentient 2024-03-01 [ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles 2024-02-22 Gemini has a Diversity Problem 2024-02-19 V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)
Tags: deep learning
machine learning
arxiv
explained
neural networks
ai
artificial intelligence
paper