Deep Papers
Deep Papers
Arize AI
Training Large Language Models to Reason in Continuous Latent Space
24 minutes Posted Jan 14, 2025 at 6:00 pm.
0:00
24:58
Download MP3
Show notes
LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not always be the best for reasoning. In this paper read, we cover an exciting new technique from a team at Meta called Chain of Continuous Thought—also known as "Coconut." In the paper, "Training Large Language Models to Reason in a Continuous Latent Space" explores the potential of allowing LLMs to rea...