Tina: Tiny LoRA Reasoning Models - podcast episode cover

Tina: Tiny LoRA Reasoning Models

Apr 25, 202516 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

We discuss Tina, a family of efficient reasoning models achieved by applying Low-Rank Adaptation (LoRA) during reinforcement learning to a small 1.5B parameter language model. This approach demonstrates that strong reasoning performance, competitive with larger models, can be attained with significantly reduced computational costs. The authors explore the effectiveness of this minimalist strategy across various reasoning tasks and ablation studies, hypothesizing that LoRA facilitates rapid adaptation to the structural format of reasoning. Ultimately, Tina aims to democratize the development of reasoning models by showcasing a highly cost-effective and reproducible methodology, with all code and models being open-sourced.

For the best experience, listen in Metacast app for iOS or Android