Two AI Researchers - Ravid Shwartz Ziv, and Allen Roush, discuss the latest trends, news, and research within Generative AI, LLMs, GPUs, and Cloud Systems.
The player is loading ...
Stefano Ermon on Diffusion LLMs, Mercury & Why the Future of AI Won't Be Autoregressive
New to The Information Bottleneck?
Here are some great episodes to start with.Or, check out episodes by topic.
In this episode, we talk with Stefano Ermon, Stanford professor, co-founder & CEO of Inception AI, and co-inventor of DDIM, FlashAttention, DPO, and score-based/diffusion models, about why diffusion-based language models may...
Naomi Saphra, Kempner Research Fellow at Harvard and incoming Assistant Professor at Boston University, joins us to explain why you can't do interpretability without understanding training dynamics, in the same way you can't...
Stefano Soatto, VP for AI at AWS and Professor at UCLA, joins us to explore how the agentic era fundamentally redefines machine learning, from static train-and-test models to dynamic, interactive control systems. This shift u...
Tanishq Abraham , CEO and co-founder of Sophont.ai , joins us to talk about building foundation models specifically for medicine. Sophont is trying to be something like an OpenAI or Anthropic but for healthcare - training mo...
Anastasios Angelopoulos , Co-Founder and CEO of Arena AI (formerly LMArena), joins us to talk about why static benchmarks are failing, how human preference data actually works under the hood, and what it takes to be the "gold...
Fred Sala, Assistant Professor at UW-Madison and Chief Scientist at Snorkel AI, joins us to talk about why personalization might be the next frontier for LLMs, why data still matters more than architecture, and how weak super...