YouTube Transcript Scraper

YouTube Transcript Scraper

by ultimate

Instantly extract full transcripts from any YouTube video. Choose between timestamped or clean text formats for research, content creation, and SEO.

292 runs
16 users
Try This Actor

Opens on Apify.com

About YouTube Transcript Scraper

Need the exact words from a YouTube video? This scraper pulls complete transcripts in seconds, just like you'd copy them yourself. I use it all the time for research, content repurposing, and checking my own videos for SEO keywords. You simply paste in the video or channel URL, set how much data you want to pull, and go. It gives you the text in two useful formats: one that's broken down with timestamps, perfect for creating clips or quotes, and one that's a clean, continuous block of text ready for analysis or summarization. It's straightforward—no fluff, just the transcript. For me, it's replaced hours of manual note-taking and guesswork. Whether you're a marketer analyzing competitor content, a student researching a topic, or a creator looking to subtitle your own work, this gets you the raw text data you need to move faster. The output is reliable and complete, so you can trust you're not missing any part of the dialogue.

What does this actor do?

YouTube Transcript Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

YouTube Transcript Scraper

Retrieve complete transcripts for any YouTube video. Provide video or channel URLs, set data limits, and choose between timeline-based or continuous text output. You only pay for the data you scrape.

Overview

This actor scrapes the full transcript (captions) from YouTube videos. Input can be individual video URLs or channel URLs for bulk scraping. Output includes the transcript text, optional timestamps, and video metadata.

Key Features

  • Target Flexibility: Scrape transcripts from specific video URLs or from multiple videos on a channel.
  • Complete Transcripts: Retrieves the entire available caption set for each video.
  • Output Customization: Choose your preferred transcript language and output format (timeline with segments or continuous text).
  • Granular Billing: Pay per data element (video transcript) scraped.
  • Simple Setup: Configure with a few input settings in the Apify Console.

How to Use

Configure the actor's input in the Apify Console.

1. Enter Target URLs

Provide a list of YouTube URLs. You can mix video and channel URLs in a single run.

URL Type Example Behavior
Video URL https://www.youtube.com/watch?v=VIDEO_ID Scrapes the transcript from one specific video.
Channel URL https://www.youtube.com/@CHANNEL_HANDLE Scrapes transcripts from multiple videos on that channel. The number is controlled by maxChannelVideos.

2. Set a Video Crawl Limit (for channels)

If using channel URLs, set maxChannelVideos to limit how many recent videos to scrape (e.g., 5). This setting is ignored for direct video URLs. A value of 0 for a channel will result in no videos being scraped.

3. Configure Transcript Preferences

  • transcriptFormat: Choose timeline (default) for segmented text with timestamps, or continuous for plain text.
  • transcriptLanguage: Specify the language code (e.g., en) for the transcript if multiple are available.

Input

The actor accepts the following input configuration (typically as a JSON object in the Apify Console):

{
  "startUrls": [
    { "url": "https://www.youtube.com/watch?v=abc123" },
    { "url": "https://www.youtube.com/@ExampleChannel" }
  ],
  "maxChannelVideos": 10,
  "transcriptFormat": "timeline",
  "transcriptLanguage": "en"
}
  • startUrls: (Required) Array of URLs to scrape.
  • maxChannelVideos: Maximum number of videos to scrape from each channel URL. Required for channels.
  • transcriptFormat: Output format. Options: "timeline", "continuous".
  • transcriptLanguage: Preferred transcript language code (e.g., "en", "es").

Output

The actor outputs a dataset where each item represents one scraped video transcript. Each item includes:

  • Video metadata (URL, title, ID, channel).
  • The full transcript in the requested format.
  • For timeline format: an array of segments with text, start (seconds), and duration (seconds).
  • For continuous format: a single string containing the full transcript text.

Sample output item (timeline format):

{
  "url": "https://www.youtube.com/watch?v=abc123",
  "title": "Example Video Title",
  "transcript": [
    { "text": "First segment of speech.", "start": 0.0, "duration": 5.4 },
    { "text": "Next segment of speech.", "start": 5.4, "duration": 4.1 }
  ]
}

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try YouTube Transcript Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
ultimate
Pricing
Paid
Total Runs
292
Active Users
16
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support