AI Models Break New Ground, Human Feedback Shapes Video Generation, and Open-Source Projects Challenge Tech Giants

AI Papers Podcast

Dec 09, 2024•10 min

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Today's tech landscape sees a dramatic shift as artificial intelligence reaches new milestones in understanding and creating content, with open-source projects increasingly rivaling commercial giants. At the heart of these developments is a growing focus on human preferences and feedback, suggesting a future where AI systems become more attuned to human needs while remaining accessible to the broader research community. Links to all the papers we discussed: Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling, Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling, LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment, LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment, MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale, MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

For the best experience, listen in Metacast app for iOS or Android