Pondering into the Future: Latent Lookahead Coaching for Transformers

This paper was accepted on the Workshop on Latent & Implicit Pondering – Going Past CoT Reasoning 2026 at ICLR.

Autoregressive language fashions educated with next-token prediction generate textual content by sampling one discrete token at a time. Though very scalable, this goal forces the mannequin to commit at each step, stopping it from exploring or reflecting upon a number of believable continuations. Moreover, the compute allocation throughout tokens is uniform; each token is shaped primarily based on a single forward-pass, probably limiting the mannequin’s expressiveness in instances the place tough tokens require inherently extra compute. In direction of addressing these limitations, we introduce latent lookahead, a coaching technique that allows fashions to “suppose” earlier than producing: at chosen positions within the sequence, earlier than committing to the following token, the mannequin performs a multi-step lookahead in latent area. Extra exactly, as a substitute of sampling future tokens, we leverage the community’s latent area by recursively feeding its hidden states again into the context for τ steps, investing extra compute on predicting that token. This produces τ latent predictions which might be supervised in opposition to the following τ ground-truth tokens, encouraging the mannequin to “lookahead” and refine its prediction. We present that latent lookahead considerably outperforms each autoregressive and non-autoregressive baselines on planning duties similar to maze fixing, Sudoku, and ProsQA, the place foresight is important.

** Work carried out whereas at Apple

Main Menu

What's Hot

FCC ban on overseas routers

Pondering into the Future: Latent Lookahead Coaching for Transformers

Comau and Reis Robotics Signal a Cooperation Settlement to Pursue Superior Automation Initiatives Throughout A number of Industries

Pondering into the Future: Latent Lookahead Coaching for Transformers

Unlocking video insights at scale with Amazon Bedrock multimodal fashions

Vibe Coding a Non-public AI Monetary Analyst with Python and Native LLMs

5 Sensible Strategies to Detect and Mitigate LLM Hallucinations Past Immediate Engineering

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

FCC ban on overseas routers

Pondering into the Future: Latent Lookahead Coaching for Transformers

Comau and Reis Robotics Signal a Cooperation Settlement to Pursue Superior Automation Initiatives Throughout A number of Industries

AI system learns to maintain warehouse robotic visitors operating easily | MIT Information

Main Menu

Subscribe to Updates

What's Hot

Pondering into the Future: Latent Lookahead Coaching for Transformers

Related Posts