LlamaGen's Image Revolution, Husky: The Multi-Step Reasoner, Vript's Video Breakthrough, VALL-E 2 Achieves Human Parity
Jun 14, 2024•11 min•Ep. 49
Episode description
Autoregressive Model Beats Diffusion: Llama for Scalable Image
Generation
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Vript: A Video Is Worth Thousands of Words
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering
for HDR View Synthesis
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text
to Speech Synthesizers
For the best experience, listen in Metacast app for iOS or Android
