Notebooklm Podcast - Generate Human-Like Podcasts

Video Tutorial
🎉 NEW: Gemini TTS Model (Freelancer+) - Experience revolutionary AI voices with remarkably natural human-like effects and incredibly smooth dialogue that rivals professional podcasters! Try our latest Gemini model for the most realistic and authentic podcast conversations yet. Need help with script optimization or brainstorming? Use our chat assistant (logged-in users). See "How to Use" section below for model details.

0 / 3000 Character count
Make podcast public (shown on featured list)

🎙️Try Our AI Podcast Host! (31 languages and 120+ voices)

Have real-time conversations with an AI host and create interactive podcasts on the fly.
(Counts as 1 podcast generation)

Start Recording

How to Use

Create professional AI-powered podcasts with our podcast generator by following these comprehensive steps:

1. Content Input Methods

  • Direct Text Input:
    • Type or paste your content into the text area, and our AI will generate a podcast with a well-structured script for you.
    • If you already have a script, select "read directly from a script (Every language)" template and use format (each line starts with speaker-1 or speaker-2):
      speaker-1: [Your content here] 
      speaker-2: [Your content here]
  • File Upload (Freelancer, Professional & Enterprise):
    • Supported formats: PDF, TXT
    • Content is automatically extracted and editable

2. Script Editing & Regeneration

Step-by-Step Process:

  1. Initial Generation:
    • Generate your first podcast version
    • Review the generated transcript below the audio player
  2. Making Edits:
    • The transcript is automatically copied to the input field
    • Edit text while maintaining speaker markers
    • Example format (each line starts with speaker-1 or speaker-2):
      speaker-1: Welcome to our show. 
      speaker-2: Thanks for having me.
  3. Regeneration Settings:
    • Select "read directly from a script (Every language)" template
    • Choose desired voices for each speaker
    • Adjust audio model if needed
  4. Final Steps:
    • Click "Generate Podcast" to create new version
    • Preview and repeat if necessary

3. Voice & Audio Configuration

  • Audio Models:
    • Standard: Standard quality (All tiers)
    • High Resolution: Enhanced quality (Freelancer+)
    • Premium voices (Freelancer+)
    • International voices (Professional+)
  • Voice Selection:
    • Choose different voices for each speaker
    • Premium voices available for higher tiers
    • Language-specific voices for international content
  • Podcast Length:
    • Trial: Up to 1 minute
    • Short: 1-3 minutes
    • Medium: 4-6 minutes
    • Long: 7-10 minutes
    • Very Long (Enterprise only): 10+ minutes for extended content like audiobooks or long-form podcasts

4. Output & Sharing

  • Download Options:
    • Audio file (MP3/WAV format)
    • Transcript with speaker markers
  • Sharing Features:
    • Direct link to podcast page
    • Social media sharing buttons
    • Mobile-friendly access

5. Advanced Features & Language Options

  • Language-Specific Templates
    • English podcast: Natural conversation style with cultural context
    • European languages (French/German/Spanish/etc.):
      • Culturally adapted content and expressions
      • Native-speaking voices available (Professional+)
      • Region-specific language patterns
    • Asian languages (Chinese/Japanese/Korean):
      • Proper honorifics and cultural nuances
      • Native voice options (Professional+)
      • Cultural context preservation
    • Scientific/Technical content:
      • Structured for academic presentation
      • Technical terminology handling
      • Clear explanation patterns

6. Script Formatting Guide

  • Conversation Formats
    • Natural Dialogue:
      speaker-1: Today we're exploring AI technology. 
      speaker-2: That's a fascinating topic! What aspects should we focus on?
      speaker-1: Let's start with practical applications.
    • Interview Style:
      speaker-1: Could you share your experience in this field? 
      speaker-2: I've been working on AI projects for over a decade...
      speaker-1: That's interesting. How has the technology evolved?
    • Educational Content:
      speaker-1: Let's break down this concept step by step. 
      speaker-2: First, we need to understand the basics...
      speaker-1: Could you provide an example?

7. Voice Selection Strategy

  • Content-Based Selection
    • News & Professional Content:
      • Use broadcaster-style voices
      • Select clear, authoritative tones
      • Maintain professional pacing
    • Casual & Entertainment:
      • Choose conversational voices
      • Mix different personality types
      • Consider age-appropriate voices
    • Educational Content:
      • Select engaging, clear voices
      • Use patient, explanatory tones
      • Balance authority with accessibility

8. Common Issues & Solutions

  • Audio Quality Issues
    • Pronunciation problems:
      • Try WorldSpeak (Pro) model for more voices
      • Use phonetic spelling for complex words
      • Select native language voices when available
    • Pacing issues:
      • Add punctuation for natural pauses
      • Break long sentences into shorter ones
      • Use commas for better rhythm

8. Understanding Text and Audio Models

  • Text Generation Models
    • Base Models:
      • gpt-4o-mini: Fast, efficient for short content generation
      • gpt-4o: Balanced performance for general podcast scripts
      • gpt-4-turbo: Enhanced capabilities for complex topics
    • Advanced Models (Professional & Enterprise):
      • o1-preview: Latest model with enhanced understanding and creativity
      • o1-mini: Optimized for quick, precise responses
      • chatgpt-4o-latest: Up-to-date knowledge and improved context handling
  • Audio Models and Voice Options
    • Standard Model:
      • 6 basic voices (alloy, echo, fable, onyx, nova, shimmer)
      • Suitable for general podcast content
      • Available in all subscription tiers
    • High Resolution Model:
      • Same 6 basic voices with enhanced audio quality
      • Clearer pronunciation and natural intonation
      • Available in Freelancer tier and above
    • WorldSpeak Model (Freelancer+):
      • 100+ diverse voices for different styles and languages
      • Regional accents and speaking styles
      • Specialized voices for news, storytelling, and education
    • WorldSpeak Pro Model (Professional+):
      • Premium version of WorldSpeak with additional features
      • Enhanced multilingual capabilities
      • Professional-grade voice quality
      • Advanced emotional expression and tone control
    • Gemini TTS Model (Freelancer+):
      • Revolutionary 30 unique voices with exceptionally natural sound
      • Human-like intonation and emotional expression
      • Remarkably smooth dialogue flow that rivals professional podcasters
      • Advanced pronunciation and cultural nuance handling
  • Model Selection Tips
    • Text Model Selection:
      • Use gpt-4o-mini for quick, straightforward content
      • Choose o1-preview for complex or technical topics
      • Select chatgpt-4o-latest for current events discussion
    • Audio Model Selection:
      • Standard/High Resolution: Best for English content with basic voices
      • WorldSpeak: Ideal for multilingual content and diverse speaking styles
      • WorldSpeak Pro: Perfect for professional productions and international audiences
      • Gemini TTS: Choose for the most natural-sounding, human-like podcast conversations

Pro Tips for Better Results

  • Test voice combinations with short samples first
  • Use the direct script template for precise control
  • Consider your target audience when selecting voices
  • Review and edit transcripts before final generation
  • Save successful combinations for future reference

Frequently Asked Questions (FAQ)

What subscription tiers does our podcast generator offer?

Our podcast generator offers four comprehensive tiers: • Hobby: Up to 40 AI podcasts/month, standard audio quality, basic voices • Freelancer: Up to 70 AI podcasts/month, high audio quality, 35+ voices • Professional: Up to 100 AI podcasts/month, all models, 48+ international voices • Enterprise: Unlimited podcasts, personalized voices, voice cloning (coming soon)

What voice options are available?

Our podcast generator provides extensive voice options: • Basic tier: 6 standard voices (alloy, echo, fable, onyx, nova, shimmer) • Freelancer: 35+ additional versatile voices • Professional: 48+ international voices including various English accents • Enterprise: Custom voice creation and voice cloning capabilities

What are the text and audio model options?

Available models vary by tier: • Text models: From gpt-4o-mini to o1-preview (Professional+) • Audio models: Standard (all tiers), High Resolution (Freelancer+), WorldSpeak (Freelancer+), Gemini TTS (Freelancer+) • Gemini TTS offers the most natural-sounding, human-like voices with exceptional realism • Professional tier includes exclusive access to o1-preview • Enterprise tier offers unlimited length generation

How does podcast length vary across different tiers?

Input text character limits by tier: • Hobby: Up to 3000 characters • Freelancer: Up to 8000 characters • Professional: Up to 15000 characters • Enterprise: Up to 30000 characters These limits apply to your input text, not the generated podcast length.

What file upload capabilities does our podcast generator support?

File upload features: • Hobby: Direct text input only • Freelancer and above: PDF and TXT file upload support • Automatic text extraction from uploaded files • Edit extracted content before generation • Professional tier includes enhanced file processing

How does script editing work?

Script editing features: • Direct script reading with speaker markers • Easy editing of generated transcripts • Support for multiple languages • Automatic speaker attribution • Real-time preview before regeneration

What output formats are available?

Each generation includes: • High-quality audio file (MP3/WAV) • Detailed transcript with speaker identification • Downloadable subtitles • Social sharing capabilities • Mobile-optimized playback

How does our podcast generator handle multilingual content?

Comprehensive language support: • Multiple language-specific templates • International voice options (Professional tier) • Support for major world languages • Language-optimized text generation • WorldSpeak (Pro) model for more voices

What are the storage and saving options?

Storage capabilities by tier: • Hobby: Save up to 5 recent podcasts • Freelancer: Save up to 20 recent podcasts • Professional: Save all generated podcasts • Enterprise: Unlimited storage and archiving • All tiers: Manual download option

What unique features are available in our podcast generator's Enterprise tier?

Enterprise exclusive features: • Design 3 personalized voices from text prompts • Voice cloning capability (coming soon) • Unlimited podcast generation • Priority support • Custom voice creation tools

Return to Home