NotebookLM Integration
Understanding Aparture's NotebookLM integration and podcast generation.
What is NotebookLM?
NotebookLM is Google's AI-powered research assistant that can:
- Synthesize information across multiple documents
- Generate conversational audio overviews (podcasts)
- Create natural dialogue between two AI hosts
- Produce high-quality audio summaries
Aparture integrates with NotebookLM to transform your daily paper analysis into engaging podcasts.
The Workflow
Manual Workflow (Web Interface)
- Run Aparture analysis - Generate report
- Generate NotebookLM document - Click "Generate NotebookLM Document"
- Download document - Get
.mdfile - Upload to NotebookLM - Visit notebooklm.google.com, create notebook, upload file
- Generate audio - Click "Generate" in Audio Overview section
- Download podcast - Wait 10-20 minutes, download
.m4afile
Automated Workflow (CLI)
npm run analyzeComplete automation:
- Generates analysis report
- Creates NotebookLM document
- Uploads to NotebookLM automatically
- Generates podcast
- Downloads audio file
See CLI Automation for details.
First-time setup
CLI automation requires interactive Google login on first run. Subsequent runs are fully automated.
NotebookLM Document Structure
The NotebookLM document is specifically structured for audio generation:
Overview Section
# Research Highlights: Computer Science (cs.LG, cs.AI)
Today's analysis examined 47 papers in machine learning and AI.
Several exciting developments emerged across transformer efficiency,
Bayesian methods, and interpretability research.Purpose: Set context and frame the discussion
Characteristics:
- Conversational tone
- High-level summary
- Engaging opening
Major Themes
## Major Themes
### Transformer Efficiency
Multiple papers tackle the computational challenges of transformers.
Smith et al. introduce "Sparse Hierarchical Attention" which reduces
complexity from O(n²) to O(n log n) while maintaining accuracy...
### Bayesian Deep Learning
There's renewed interest in Bayesian approaches. Jones et al. present
a scalable method for uncertainty estimation in neural networks...Purpose: Organize papers into coherent themes
Characteristics:
- Thematic grouping
- Clear section headers
- Connected narrative
Deep Dives
## Deep Dive: Sparse Hierarchical Attention
This paper by Smith et al. addresses a fundamental challenge in AI:
transformers are computationally expensive on long sequences.
**The Problem:** Standard attention requires O(n²) operations, making
it impractical for documents beyond a few thousand tokens.
**The Approach:** The authors use hierarchical clustering to create
sparse attention patterns that approximate full attention.
**The Results:** Three times faster training with minimal loss in
accuracy. This opens doors to processing much longer documents.
**Why It Matters:** Could enable new applications in document
understanding, code analysis, and scientific literature review.Purpose: Provide detailed analysis of top papers
Characteristics:
- Problem-Solution-Impact structure
- Concrete details
- Context and implications
- Audio-friendly narrative
Connections and Insights
## Connecting the Dots
Several papers share common threads:
- Both Smith et al. and Chen et al. use hierarchical structures to
reduce computational complexity
- Jones et al.'s Bayesian methods could be applied to the attention
mechanisms in Lee et al.'s work
- The interpretability focus in Brown et al. aligns with growing
concerns about black-box models
These connections suggest a broader shift toward efficient,
interpretable, and uncertainty-aware AI systems.Purpose: Synthesize insights across papers
Characteristics:
- Cross-paper connections
- Broader implications
- Forward-looking perspective
Podcast Customization
Duration Settings
Configure podcast length in Aparture settings:
- 5 minutes - Quick highlights
- 10 minutes - Standard summary
- 15 minutes - Detailed overview (default)
- 20 minutes - Comprehensive discussion
- 25 minutes - Extended analysis
- 30 minutes - Deep dive
Recommendation: 15-20 minutes for daily listening
Custom Prompts
Aparture uses custom prompts to guide podcast generation. Prompts are defined in NOTEBOOKLM_PROMPTS.md:
5-minute prompt (excerpt):
Focus on the top 3-5 most significant papers only. Keep technical
depth appropriate for audio but maintain accuracy. Each paper should
get 30-60 seconds of discussion maximum...15-minute prompt (excerpt):
Provide a balanced overview covering:
- 5-7 top papers with moderate technical depth
- Key methodological approaches
- Implications and connections between papers
Target 2-3 minutes per major paper...30-minute prompt (excerpt):
Create a comprehensive discussion covering:
- Detailed analysis of top papers
- Technical methodology discussion
- Broader context and related work
- Connections across papers
Aim for 3-5 minutes per major paper...Customizing Prompts
Edit NOTEBOOKLM_PROMPTS.md to customize podcast style:
Example customizations:
- Add domain-specific context
- Adjust technical depth
- Emphasize certain aspects (methodology, results, implications)
- Change conversation style
- Add humor or personality
Iterative refinement
Generate a few podcasts with default prompts, then adjust based on what you want more/less of.
Audio Quality
NotebookLM Audio Features
Voice characteristics:
- Two AI hosts (one male, one female by default)
- Natural conversation flow
- Appropriate intonation and emphasis
- Technical terms pronounced correctly (usually)
Audio format:
- File format: M4A (AAC encoding)
- Bitrate: ~128 kbps
- Sample rate: 44.1 kHz
- File size: ~1-3 MB per minute
Common Issues
Mispronunciations:
- Novel technical terms
- Author names (especially non-English)
- Acronyms
Solution: Edit NotebookLM document to include phonetic hints or expand acronyms
Awkward transitions:
- Abrupt topic changes
- Repetitive phrases
Solution: Improve document structure with better thematic organization
Too technical or too simple:
- Doesn't match your background
Solution: Adjust custom prompts to target your knowledge level
Use Cases
Daily Commute
Scenario: 30-minute drive, want to stay current
Configuration:
- 20-25 minute podcast
- Balanced technical depth
- Focus on top papers
Workflow:
- Run
npm run analyzebefore bed - Podcast ready in the morning
- Listen during commute
- Read full papers that interest you
Research Group Meetings
Scenario: Weekly paper discussions
Configuration:
- 15-20 minute podcast
- Higher technical depth
- Include methodology details
Workflow:
- Weekly analysis run
- Share podcast with group
- Everyone listens before meeting
- Discussion focuses on implications
Interdisciplinary Exploration
Scenario: Discovering connections across fields
Configuration:
- 25-30 minute podcast
- Multiple diverse categories
- Emphasize connections and themes
Workflow:
- Select broad category set
- Generate comprehensive podcast
- Listen for unexpected connections
- Deep-dive into promising areas
Literature Review Preparation
Scenario: Starting new research direction
Configuration:
- 30-minute podcast
- Deep technical detail
- Include future directions
Workflow:
- Run analysis on target categories
- Generate detailed podcast
- Listen multiple times
- Read full PDFs of key papers
- Repeat weekly to build knowledge
Automation Details
CLI Automation Process
When you run npm run analyze:
- Analysis completes - Report generated
- NotebookLM document created - Structured for audio
- Browser automation starts
- Opens Chrome with persistent profile
- Navigates to notebooklm.google.com
- Authenticates (first run only)
- File upload
- Creates new notebook
- Uploads analysis report
- Uploads NotebookLM document
- Podcast generation
- Opens customization menu
- Enters duration-specific prompt
- Clicks "Generate"
- Wait for completion (10-20 minutes)
- Polls every 30 seconds
- Refreshes page to check status
- Download audio
- Finds three-dot menu
- Clicks Download
- Saves to
reports/
- Cleanup
- Takes screenshots
- Closes browser
- Reports success
Authentication Caching
First run:
- Browser opens to Google login
- You manually sign in and grant permissions
- Session cached in
temp/notebooklm-profile/
Subsequent runs:
- Uses cached session
- No manual interaction needed
- Fully automated
Session expiration:
- Google sessions last ~weeks to months
- If expired, you'll need to re-authenticate
- CLI will detect and prompt you
Error Handling
Upload failures:
- Retries up to 3 times
- Takes screenshots for debugging
- Detailed error messages
Generation timeouts:
- 30-minute default timeout
- Configurable in code
- Logs elapsed time regularly
Download failures:
- Retries menu navigation
- Screenshots current state
- Suggests manual download if failed
Cost Considerations
NotebookLM Costs
Good news: NotebookLM is currently free to use.
Google provides:
- Unlimited notebooks
- Unlimited source uploads
- Unlimited audio generation
- No credit card required
Limitations:
- 50 sources per notebook (plenty for daily use)
- Rate limiting (rarely hit with daily runs)
- Subject to change (Google may introduce pricing)
Aparture Costs
NotebookLM document generation:
- Uses Claude Opus 4.1 (fixed)
- Typical cost: $0.20-0.50 per document
- Depends on paper count and detail level
Included in daily costs:
- Balanced config: $3.95/day includes NotebookLM document
- Budget config: $2.05/day includes NotebookLM document
Comparison to Alternatives
Manual Reading
Pros of podcasts:
- Multitask-friendly
- Lower cognitive load
- Good for initial filtering
- Identifies papers for deep reading
Cons of podcasts:
- Less detailed than reading
- Can't skim/skip
- No direct access to figures
- May miss nuances
Best practice: Use podcasts for discovery, read PDFs for deep understanding
Other Audio Services
NotebookLM vs. podcast apps:
- NotebookLM: Custom content, your specific interests
- Podcast apps: General audience, broader topics
- NotebookLM: Free, unlimited
- Podcast apps: Often paid, limited selection
NotebookLM vs. text-to-speech:
- NotebookLM: Conversational, synthesized
- TTS: Literal reading of papers
- NotebookLM: Engaging, contextual
- TTS: Dry, hard to follow