Deep Papers
Deep Papers
Arize AI
LLM Interpretability and Sparse Autoencoders: Research from OpenAI and Anthropic
44 minutes Posted Jun 14, 2024 at 6:00 pm.
0:00
44:00
Download MP3
Show notes
It’s been an exciting couple weeks for GenAI! Join us as we discuss the latest research from OpenAI and Anthropic. We’re excited to chat about this significant step forward in understanding how LLMs work and the implications it has for deeper understanding of the neural activity of language models. We take a closer look at some recent research from both OpenAI and Anthropic. These two recent papers both focus on the sparse autoencoder--an unsupervised approach for extracting interpretabl...