NotebookLM Integration

Understanding Aparture's NotebookLM integration and podcast generation.

What is NotebookLM?

NotebookLM is Google's AI-powered research assistant that can:

Synthesize information across multiple documents
Generate conversational audio overviews (podcasts)
Create natural dialogue between two AI hosts
Produce high-quality audio summaries

Aparture integrates with NotebookLM to transform your daily paper analysis into engaging podcasts.

The Workflow

Manual Workflow (Web Interface)

Run Aparture analysis - Generate report
Generate NotebookLM document - Click "Generate NotebookLM Document"
Download document - Get .md file
Upload to NotebookLM - Visit notebooklm.google.com, create notebook, upload file
Generate audio - Click "Generate" in Audio Overview section
Download podcast - Wait 10-20 minutes, download .m4a file

Automated Workflow (CLI)

bash

npm run analyze

Complete automation:

Generates analysis report
Creates NotebookLM document
Uploads to NotebookLM automatically
Generates podcast
Downloads audio file

See CLI Automation for details.

First-time setup

CLI automation requires interactive Google login on first run. Subsequent runs are fully automated.

NotebookLM Document Structure

The NotebookLM document is specifically structured for audio generation:

Overview Section

markdown

# Research Highlights: Computer Science (cs.LG, cs.AI)

Today's analysis examined 47 papers in machine learning and AI.
Several exciting developments emerged across transformer efficiency,
Bayesian methods, and interpretability research.

Purpose: Set context and frame the discussion

Characteristics:

Conversational tone
High-level summary
Engaging opening

Major Themes

markdown

## Major Themes

### Transformer Efficiency

Multiple papers tackle the computational challenges of transformers.
Smith et al. introduce "Sparse Hierarchical Attention" which reduces
complexity from O(n²) to O(n log n) while maintaining accuracy...

### Bayesian Deep Learning

There's renewed interest in Bayesian approaches. Jones et al. present
a scalable method for uncertainty estimation in neural networks...

Purpose: Organize papers into coherent themes

Characteristics:

Thematic grouping
Clear section headers
Connected narrative

Deep Dives

markdown

## Deep Dive: Sparse Hierarchical Attention

This paper by Smith et al. addresses a fundamental challenge in AI:
transformers are computationally expensive on long sequences.

**The Problem:** Standard attention requires O(n²) operations, making
it impractical for documents beyond a few thousand tokens.

**The Approach:** The authors use hierarchical clustering to create
sparse attention patterns that approximate full attention.

**The Results:** Three times faster training with minimal loss in
accuracy. This opens doors to processing much longer documents.

**Why It Matters:** Could enable new applications in document
understanding, code analysis, and scientific literature review.

Purpose: Provide detailed analysis of top papers

Characteristics:

Problem-Solution-Impact structure
Concrete details
Context and implications
Audio-friendly narrative

Connections and Insights

markdown

## Connecting the Dots

Several papers share common threads:

- Both Smith et al. and Chen et al. use hierarchical structures to
  reduce computational complexity
- Jones et al.'s Bayesian methods could be applied to the attention
  mechanisms in Lee et al.'s work
- The interpretability focus in Brown et al. aligns with growing
  concerns about black-box models

These connections suggest a broader shift toward efficient,
interpretable, and uncertainty-aware AI systems.

Purpose: Synthesize insights across papers

Characteristics:

Cross-paper connections
Broader implications
Forward-looking perspective

Podcast Customization

Duration Settings

Configure podcast length in Aparture settings:

5 minutes - Quick highlights
10 minutes - Standard summary
15 minutes - Detailed overview (default)
20 minutes - Comprehensive discussion
25 minutes - Extended analysis
30 minutes - Deep dive

Recommendation: 15-20 minutes for daily listening

Custom Prompts

Aparture uses custom prompts to guide podcast generation. Prompts are defined in NOTEBOOKLM_PROMPTS.md:

5-minute prompt (excerpt):

Focus on the top 3-5 most significant papers only. Keep technical
depth appropriate for audio but maintain accuracy. Each paper should
get 30-60 seconds of discussion maximum...

15-minute prompt (excerpt):

Provide a balanced overview covering:
- 5-7 top papers with moderate technical depth
- Key methodological approaches
- Implications and connections between papers
Target 2-3 minutes per major paper...

30-minute prompt (excerpt):

Create a comprehensive discussion covering:
- Detailed analysis of top papers
- Technical methodology discussion
- Broader context and related work
- Connections across papers
Aim for 3-5 minutes per major paper...

Customizing Prompts

Edit NOTEBOOKLM_PROMPTS.md to customize podcast style:

Example customizations:

Add domain-specific context
Adjust technical depth
Emphasize certain aspects (methodology, results, implications)
Change conversation style
Add humor or personality

Iterative refinement

Generate a few podcasts with default prompts, then adjust based on what you want more/less of.

Audio Quality

NotebookLM Audio Features

Voice characteristics:

Two AI hosts (one male, one female by default)
Natural conversation flow
Appropriate intonation and emphasis
Technical terms pronounced correctly (usually)

Audio format:

File format: M4A (AAC encoding)
Bitrate: ~128 kbps
Sample rate: 44.1 kHz
File size: ~1-3 MB per minute

Common Issues

Mispronunciations:

Novel technical terms
Author names (especially non-English)
Acronyms

Solution: Edit NotebookLM document to include phonetic hints or expand acronyms

Awkward transitions:

Abrupt topic changes
Repetitive phrases

Solution: Improve document structure with better thematic organization

Too technical or too simple:

Doesn't match your background

Solution: Adjust custom prompts to target your knowledge level

Use Cases

Daily Commute

Scenario: 30-minute drive, want to stay current

Configuration:

20-25 minute podcast
Balanced technical depth
Focus on top papers

Workflow:

Run npm run analyze before bed
Podcast ready in the morning
Listen during commute
Read full papers that interest you

Research Group Meetings

Scenario: Weekly paper discussions

Configuration:

15-20 minute podcast
Higher technical depth
Include methodology details

Workflow:

Weekly analysis run
Share podcast with group
Everyone listens before meeting
Discussion focuses on implications

Interdisciplinary Exploration

Scenario: Discovering connections across fields

Configuration:

25-30 minute podcast
Multiple diverse categories
Emphasize connections and themes

Workflow:

Select broad category set
Generate comprehensive podcast
Listen for unexpected connections
Deep-dive into promising areas

Literature Review Preparation

Scenario: Starting new research direction

Configuration:

30-minute podcast
Deep technical detail
Include future directions

Workflow:

Run analysis on target categories
Generate detailed podcast
Listen multiple times
Read full PDFs of key papers
Repeat weekly to build knowledge

Automation Details

CLI Automation Process

When you run npm run analyze:

Analysis completes - Report generated
NotebookLM document created - Structured for audio
Browser automation starts
- Opens Chrome with persistent profile
- Navigates to notebooklm.google.com
- Authenticates (first run only)
File upload
- Creates new notebook
- Uploads analysis report
- Uploads NotebookLM document
Podcast generation
- Opens customization menu
- Enters duration-specific prompt
- Clicks "Generate"
Wait for completion (10-20 minutes)
- Polls every 30 seconds
- Refreshes page to check status
Download audio
- Finds three-dot menu
- Clicks Download
- Saves to reports/
Cleanup
- Takes screenshots
- Closes browser
- Reports success

Authentication Caching

First run:

Browser opens to Google login
You manually sign in and grant permissions
Session cached in temp/notebooklm-profile/

Subsequent runs:

Uses cached session
No manual interaction needed
Fully automated

Session expiration:

Google sessions last ~weeks to months
If expired, you'll need to re-authenticate
CLI will detect and prompt you

Error Handling

Upload failures:

Retries up to 3 times
Takes screenshots for debugging
Detailed error messages

Generation timeouts:

30-minute default timeout
Configurable in code
Logs elapsed time regularly

Download failures:

Retries menu navigation
Screenshots current state
Suggests manual download if failed

Cost Considerations

NotebookLM Costs

Good news: NotebookLM is currently free to use.

Google provides:

Unlimited notebooks
Unlimited source uploads
Unlimited audio generation
No credit card required

Limitations:

50 sources per notebook (plenty for daily use)
Rate limiting (rarely hit with daily runs)
Subject to change (Google may introduce pricing)

Aparture Costs

NotebookLM document generation:

Uses Claude Opus 4.1 (fixed)
Typical cost: $0.20-0.50 per document
Depends on paper count and detail level

Included in daily costs:

Balanced config: $3.95/day includes NotebookLM document
Budget config: $2.05/day includes NotebookLM document

Comparison to Alternatives

Manual Reading

Pros of podcasts:

Multitask-friendly
Lower cognitive load
Good for initial filtering
Identifies papers for deep reading

Cons of podcasts:

Less detailed than reading
Can't skim/skip
No direct access to figures
May miss nuances

Best practice: Use podcasts for discovery, read PDFs for deep understanding

Other Audio Services

NotebookLM vs. podcast apps:

NotebookLM: Custom content, your specific interests
Podcast apps: General audience, broader topics
NotebookLM: Free, unlimited
Podcast apps: Often paid, limited selection

NotebookLM vs. text-to-speech:

NotebookLM: Conversational, synthesized
TTS: Literal reading of papers
NotebookLM: Engaging, contextual
TTS: Dry, hard to follow

NotebookLM Integration ​

What is NotebookLM? ​

The Workflow ​

Manual Workflow (Web Interface) ​

Automated Workflow (CLI) ​

NotebookLM Document Structure ​

Overview Section ​

Major Themes ​

Deep Dives ​

Connections and Insights ​

Podcast Customization ​

Duration Settings ​

Custom Prompts ​

Customizing Prompts ​

Audio Quality ​

NotebookLM Audio Features ​

Common Issues ​

Use Cases ​

Daily Commute ​

Research Group Meetings ​

Interdisciplinary Exploration ​

Literature Review Preparation ​

Automation Details ​

CLI Automation Process ​

Authentication Caching ​

Error Handling ​

Cost Considerations ​

NotebookLM Costs ​

Aparture Costs ​

Comparison to Alternatives ​

Manual Reading ​

Other Audio Services ​

Next Steps ​

NotebookLM Integration

What is NotebookLM?

The Workflow

Manual Workflow (Web Interface)

Automated Workflow (CLI)

NotebookLM Document Structure

Overview Section

Major Themes

Deep Dives

Connections and Insights

Podcast Customization

Duration Settings

Custom Prompts

Customizing Prompts

Audio Quality

NotebookLM Audio Features

Common Issues

Use Cases

Daily Commute

Research Group Meetings

Interdisciplinary Exploration

Literature Review Preparation

Automation Details

CLI Automation Process

Authentication Caching

Error Handling

Cost Considerations

NotebookLM Costs

Aparture Costs

Comparison to Alternatives

Manual Reading

Other Audio Services

Next Steps