AI Models Break New Ground, Human Feedback Shapes Video Generation, and Open-Source Projects Challenge Tech Giants - podcast episode cover

AI Models Break New Ground, Human Feedback Shapes Video Generation, and Open-Source Projects Challenge Tech Giants

Dec 09, 202410 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Today's tech landscape sees a dramatic shift as artificial intelligence reaches new milestones in understanding and creating content, with open-source projects increasingly rivaling commercial giants. At the heart of these developments is a growing focus on human preferences and feedback, suggesting a future where AI systems become more attuned to human needs while remaining accessible to the broader research community. Links to all the papers we discussed: Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling, Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling, LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment, LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment, MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale, MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

For the best experience, listen in Metacast app for iOS or Android