AI Papers by Henri Nguembi - podcast cover

AI Papers by Henri Nguembi

Claude Henri Nguembirss.com

We use Notebook LM to explain latest and important AI papers. Our two hosts explain complex matters in a simple and fun way.

Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Introduction to Reinforcement Learning

In this episode we explore Reinforcement Learning, an AI framework used in systems such as ChatGPT. Reinforcement Learning , a subfield of Artificial Intelligence, is a method for machines to learn optimal decision-making through trial and error by receiving rewards or penalties for their actions. This beginner-friendly introduction covers fundamental aspects, such as basic terminology like agents, environments, and rewards, alongside core concepts like the Markov Decision Process. The text furt...

Apr 20, 202522 min

DeepSeek-R1: Reasoning LLMs via Reinforcement Learning

We talk about DeepSeek-R1 , a novel language model with enhanced reasoning capabilities achieved through reinforcement learning ( RL ). The researchers explored training methodologies, including DeepSeek-R1-Zero which uniquely utilizes large-scale RL without initial supervised fine-tuning ( SFT ), demonstrating emergent reasoning behaviors. To improve readability and further boost performance, DeepSeek-R1 incorporates a multi-stage training process with cold-start data before RL and achieves res...

Apr 02, 202531 min

Biology of a Large Language Model

In this first episode we dive into this paper from AnthropicAI called Biology of a Large Langage Model where the autors present a detailed investigation into the inner workings of the large language model Claude 3.5 Haiku, employing a methodology centered around attribution graphs to understand how it processes information and generates responses. Through various case studies, the authors explore phenomena such as multi-step reasoning , planning in poetry generation , and multilingual understand...

Mar 31, 202527 min
For the best experience, listen in Metacast app for iOS or Android