Two AI Researchers - Ravid Shwartz Ziv, and Allen Roush, discuss the latest trends, news, and research within Generative AI, LLMs, GPUs, and Cloud Systems.
The player is loading ...
EP26: Measuring Intelligence in the Wild - Arena and the Future of AI Evaluation
New to The Information Bottleneck?
Here are some great episodes to start with.Or, check out episodes by topic.
Anastasios Angelopoulos , Co-Founder and CEO of Arena AI (formerly LMArena), joins us to talk about why static benchmarks are failing, how human preference data actually works under the hood, and what it takes to be the "gold...
Fred Sala, Assistant Professor at UW-Madison and Chief Scientist at Snorkel AI, joins us to talk about why personalization might be the next frontier for LLMs, why data still matters more than architecture, and how weak super...
Bayan Bruss, VP of Applied AI at Capital One, joins us to talk about building AI systems that can make autonomous financial decisions, and why money might be the hardest problem in machine learning. Bayan leads Capital One's ...
David Mezzetti , creator of TextAI, joins us to talk about building open source AI frameworks as a solo developer - and why local-first AI still matters in the age of API-everything. David's path from running a 50-person IT c...
Cody Blakeney from Datology AI joins us to talk about data curation - the unglamorous but critical work of figuring out what to actually train models on. Cody's path from writing CUDA kernels to spending his days staring at w...
Guest: Niloofar Mireshghallah (Incoming Assistant Professor at CMU, Member of Technical Staff at Humans and AI) In this episode, we dive into AI privacy, frontier model capabilities, and why academia still matters. We kick of...