Evaluating LLMs with Diverse Models, Novel Robotic Skills Framework, Editing 3D Graphics with VLMs
Apr 30, 2024•11 min•Ep. 19
Episode description
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
LEGENT: Open Platform for Embodied Agents
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual
and Action Representations
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
For the best experience, listen in Metacast app for iOS or Android
