Value-Guided Search for Efficient Chain-of-Thought Reasoning - podcast episode cover

Value-Guided Search for Efficient Chain-of-Thought Reasoning

May 29, 202518 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

This paper introduces Value-Guided Search (VGS), a novel method for improving the reasoning capabilities and efficiency of large language models (LLMs) on complex tasks like competition math. Unlike prior methods that rely on fine-grained, step-by-step feedback, VGS uses a token-level value model trained on large datasets of reasoning traces. This model guides a block-wise search process, selecting the most promising continuations at intervals rather than individual steps. The paper demonstrates that VGS significantly enhances performance and reduces the computational resources required compared to existing techniques like majority voting or search guided by process reward models. The authors also release their dataset, model, and codebase to support future research.

For the best experience, listen in Metacast app for iOS or Android