Deep Papers
Deep Papers
Arize AI
Google's NotebookLM and the Future of AI-Generated Audio
43 minutes Posted Oct 15, 2024 at 1:00 am.
0:00
43:28
Download MP3
Show notes
This week, Aman Khan and Harrison Chu explore NotebookLM’s unique features, including its ability to generate realistic-sounding podcast episodes from text (but this podcast is very real!). They dive into some technical underpinnings of the product, specifically the SoundStorm model used for generating high-quality audio, and how it leverages a hierarchical vector quantization approach (RVQ) to maintain consistency in speaker voice and tone throughout long audio durations. The discussion...