🧾 YouTube Extractor (Transcripts + Metadata)

🧾 YouTube Extractor (Transcripts + Metadata)

by dz_omar

Instantly extract clean transcripts with timestamps and full video metadata from any YouTube URL. Perfect for research, content analysis, and automation.

39,461 runs
508 users
Try This Actor

Opens on Apify.com

About 🧾 YouTube Extractor (Transcripts + Metadata)

Need the full text from a YouTube video, not just a messy copy-paste? I built this actor because I was tired of cobbling together scripts to get clean, structured data. It pulls the complete transcript with accurate timestamps and pairs it with all the essential metadata—title, description, view count, upload date, you name it—straight from the video's URL. It's the reliable workhorse I use for my own projects when I need to analyze video content, create searchable archives, or generate subtitles without the manual hassle. The output is neatly formatted JSON, ready to pipe into your data pipelines, research tools, or custom apps. Whether you're a researcher cataloging interviews, a marketer analyzing competitor content, or a developer building a media database, this handles the data extraction so you can focus on the actual work. It runs fast, avoids the typical rate-limiting headaches, and just works every time I need it.

What does this actor do?

🧾 YouTube Extractor (Transcripts + Metadata) is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

YouTube Extractor (Transcripts + Metadata)

Extract structured transcripts and metadata from YouTube videos. This Apify actor is built for speed and reliability, outputting clean JSON for data pipelines and analysis.

Overview

This actor takes YouTube video URLs and returns complete transcripts with precise timestamps alongside comprehensive video and channel metadata. It supports batch processing and offers an optional integration for extracting comments via a separate scraper in standby mode.

Key Features

  • Complete Data Extraction: Full transcripts with second-by-second timestamps, video metadata (title, views, likes, date), and channel info (name, ID, subscribers).
  • Performance & Reliability: Optimized for speed with smart caching, batch processing, and automatic fallback between free and paid processing modes.
  • Developer-Friendly: Outputs clean JSON, CSV, or Excel. Offers flexible configuration for timestamp inclusion, transcript cleaning level, and caching.
  • Optional Comment Extraction: Integrates with the YouTube Comments Scraper in standby mode for concurrent, real-time comment scraping when enabled.

How to Use

  1. Provide Input: Supply one or more YouTube video URLs.
  2. Configure: Set options like cleaningLevel, includeTimestamps, and caching. Optionally enable comment extraction.
  3. Run: Execute the actor. It processes videos and extracts data.
  4. Get Results: Download the structured output in your chosen format.

Basic Input Example

{
  "youtubeUrl": [
    { "url": "https://www.youtube.com/watch?v=EXAMPLE_ID_1" },
    { "url": "https://www.youtube.com/watch?v=EXAMPLE_ID_2" }
  ],
  "cleaningLevel": "mild",
  "includeTimestamps": true
}

Enabling Comment Extraction

To extract comments, set extractcomments: true and configure the related parameters. This leverages a dedicated standby scraper for faster, resumable comment collection.

{
  "youtubeUrl": [ { "url": "https://www.youtube.com/watch?v=kOO31qFmi9A" } ],
  "cleaningLevel": "mild",
  "includeTimestamps": true,
  "extractcomments": true,
  "sortBy": "top",
  "maxComments": 50,
  "maxRepliesPerComment": 5
}

Comment Parameters:
* extractcomments (boolean, default false): Enable comment extraction.
* sortBy (string, default "top"): Sort method: "top" or "newest".
* maxComments (integer, default 10): Max top-level comments per video (10-100,000).
* maxRepliesPerComment (integer, default 0): Max replies per comment. Set to 0 to disable replies for faster scraping.

Input/Output

Input: Accepts an array of YouTube video URLs via the youtubeUrl field, plus configuration options.

Output: Returns a structured JSON object per video. The core output includes:
* videoId, VideoURL, Video_title
* transcript: An array of objects with text, start (seconds), and duration.
* metadata: Contains views, likes, channelName, channelId, channelSubscribers, publishedAt, etc.
* analytics: Includes wordCount and estimatedDuration.

When extractcomments is enabled, the output also includes a comments array containing the scraped comment data.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try 🧾 YouTube Extractor (Transcripts + Metadata) now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
dz_omar
Pricing
Paid
Total Runs
39,461
Active Users
508
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support