Transformers and State-Space Models Unite, Multi-modal LLM Benchmark, Perplexity in Data Pruning, Advancing 4D Content Generation
Jun 05, 2024•10 min•Ep. 42
Episode description
Transformers are SSMs: Generalized Models and Efficient Algorithms
Through Structured State Space Duality
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of
Multi-modal LLMs in Video Analysis
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small
Reference Models
Kaleido Diffusion: Improving Conditional Diffusion Models with
Autoregressive Latent Modeling
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
For the best experience, listen in Metacast app for iOS or Android
