YouTube Transcript Extractor & Video Detail Scraper

YouTube Transcript Extractor & Video Detail Scraper

by memo23

Extract full YouTube video metadata, channel details, and clean transcripts in any language. Perfect for content research, lead gen, and data analysis without API hassle.

51,277 runs
142 users
Try This Actor

Opens on Apify.com

About YouTube Transcript Extractor & Video Detail Scraper

Need to pull data from YouTube without getting tangled in their API? I built this actor because I got tired of jumping through hooves for simple video info. It grabs everything you'd expect—titles, descriptions, channel details, publish dates, view and like counts, tags, categories, and thumbnails. But where it really saves time is with transcripts. It fetches the full transcript for any video, whether it's in the original language or an auto-translated one, and gives you clean, plain text without the timestamps or clutter. I use it for content research to analyze what topics are performing well in a niche, for building datasets to train models, or just to get a readable summary of a long tutorial without watching the whole thing. It runs on Apify, so you can schedule it, pipe the data wherever you need it, and avoid rate limiting issues you might hit doing this manually. If you're working with video data at any scale, this handles the tedious extraction part so you can focus on the analysis.

What does this actor do?

YouTube Transcript Extractor & Video Detail Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

YouTube Transcript Extractor & Video Detail Scraper

Overview

This Apify actor scrapes comprehensive metadata and transcripts from YouTube videos. It extracts over 35 data points, including core metrics, engagement analytics, and multilingual transcripts, outputting structured JSON data. It's designed for developers, researchers, and marketers who need programmatic access to YouTube video details and spoken content.

Key Features

  • Video Metadata: Extracts title, description, view count, likes, keywords, categories, and thumbnail URLs.
  • Channel Details: Retrieves channel name, ID, and subscriber information.
  • Engagement Analytics: Captures view counts, comment data, and performance metrics like CTR and retention (when available).
  • Multilingual Transcripts: Fetches full video transcripts. Supports original language or auto-translation into over 100 languages. Outputs both timestamped and clean text versions.
  • Technical Robustness: Handles all YouTube URL formats (including Shorts and embeds), includes proxy support for reliability, and offers configurable concurrency and retry logic.

How to Use

  1. Ensure you have an active Apify account.
  2. Configure the actor's input with your target YouTube video URLs and desired settings (see Input Configuration).
  3. Run the actor on the Apify platform.
  4. Retrieve the structured output dataset, which includes all scraped details and transcripts.

Input Configuration

Configure the actor using a JSON input. Here is the primary structure:

{
  "startUrls": [
    { "url": "https://www.youtube.com/watch?v=VIDEO_ID" }
  ],
  "maxConcurrency": 10,
  "maxRequestRetries": 10,
  "proxy": { "useApifyProxy": true },
  "includeTranscript": true,
  "language": "en"
}

Key Input Fields:
* startUrls: (Required) An array of objects containing the YouTube video URLs to scrape.
* maxConcurrency: Maximum number of concurrent page processing requests.
* maxRequestRetries: Number of retry attempts for failed requests.
* proxy: Configuration for Apify Proxy to avoid blocks.
* includeTranscript: Set to true to fetch the video transcript.
* language: The language code (e.g., 'en', 'es') for the transcript. Falls back to default if requested language is unavailable.

Output

The actor outputs an array of items, one per video. Each item is a JSON object containing the scraped data. A transcript example is shown below; the full output includes all mentioned metadata fields.

{
  "transcript": [
    {
      "text": "so if you want immediate access to",
      "startMs": "160",
      "endMs": "3760",
      "startTimeText": "0:00"
    }
  ],
  "title": "Video Title Here",
  "viewCount": "1234567",
  "channelId": "UCxxxxx",
  // ... (all other video details)
}

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try YouTube Transcript Extractor & Video Detail Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
memo23
Pricing
Paid
Total Runs
51,277
Active Users
142
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support