Exploring the Latent Capacity of LLMs for One-Step Text Generation - podcast episode cover

Exploring the Latent Capacity of LLMs for One-Step Text Generation

May 29, 2025•21 min•Ep. 819
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

🤗 Upvotes: 40 | cs.CL, cs.AI, cs.LG

Authors:
Gleb Mezentsev, Ivan Oseledets

Title:
Exploring the Latent Capacity of LLMs for One-Step Text Generation

Arxiv:
http://arxiv.org/abs/2505.21189v1

Abstract:
A recent study showed that large language models (LLMs) can reconstruct surprisingly long texts - up to thousands of tokens - via autoregressive generation from just one specially trained input embedding. In this work, we explore whether such reconstruction is possible without autoregression. We show that frozen LLMs can generate hundreds of accurate tokens in just one forward pass, when provided with only two learned embeddings. This reveals a surprising and underexplored capability of LLMs - multi-token generation without iterative decoding. We investigate the behaviour of these embeddings and provide insight into the type of information they encode. We also empirically show that although these representations are not unique for a given text, they form connected and local regions in embedding space - a property that suggests the potential of learning a dedicated encoder into that space.

For the best experience, listen in Metacast app for iOS or Android