YouTube Transcript Extractor & Video Detail Scraper
by memo23
Extract full YouTube video metadata, channel details, and clean transcripts in any language. Perfect for content research, lead gen, and data analysis without API hassle.
Opens on Apify.com
About YouTube Transcript Extractor & Video Detail Scraper
Need to pull data from YouTube without getting tangled in their API? I built this actor because I got tired of jumping through hooves for simple video info. It grabs everything you'd expect—titles, descriptions, channel details, publish dates, view and like counts, tags, categories, and thumbnails. But where it really saves time is with transcripts. It fetches the full transcript for any video, whether it's in the original language or an auto-translated one, and gives you clean, plain text without the timestamps or clutter. I use it for content research to analyze what topics are performing well in a niche, for building datasets to train models, or just to get a readable summary of a long tutorial without watching the whole thing. It runs on Apify, so you can schedule it, pipe the data wherever you need it, and avoid rate limiting issues you might hit doing this manually. If you're working with video data at any scale, this handles the tedious extraction part so you can focus on the analysis.
What does this actor do?
YouTube Transcript Extractor & Video Detail Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
YouTube Transcript Extractor & Video Detail Scraper
Overview
This Apify actor scrapes comprehensive metadata and transcripts from YouTube videos. It extracts over 35 data points, including core metrics, engagement analytics, and multilingual transcripts, outputting structured JSON data. It's designed for developers, researchers, and marketers who need programmatic access to YouTube video details and spoken content.
Key Features
- Video Metadata: Extracts title, description, view count, likes, keywords, categories, and thumbnail URLs.
- Channel Details: Retrieves channel name, ID, and subscriber information.
- Engagement Analytics: Captures view counts, comment data, and performance metrics like CTR and retention (when available).
- Multilingual Transcripts: Fetches full video transcripts. Supports original language or auto-translation into over 100 languages. Outputs both timestamped and clean text versions.
- Technical Robustness: Handles all YouTube URL formats (including Shorts and embeds), includes proxy support for reliability, and offers configurable concurrency and retry logic.
How to Use
- Ensure you have an active Apify account.
- Configure the actor's input with your target YouTube video URLs and desired settings (see Input Configuration).
- Run the actor on the Apify platform.
- Retrieve the structured output dataset, which includes all scraped details and transcripts.
Input Configuration
Configure the actor using a JSON input. Here is the primary structure:
{
"startUrls": [
{ "url": "https://www.youtube.com/watch?v=VIDEO_ID" }
],
"maxConcurrency": 10,
"maxRequestRetries": 10,
"proxy": { "useApifyProxy": true },
"includeTranscript": true,
"language": "en"
}
Key Input Fields:
* startUrls: (Required) An array of objects containing the YouTube video URLs to scrape.
* maxConcurrency: Maximum number of concurrent page processing requests.
* maxRequestRetries: Number of retry attempts for failed requests.
* proxy: Configuration for Apify Proxy to avoid blocks.
* includeTranscript: Set to true to fetch the video transcript.
* language: The language code (e.g., 'en', 'es') for the transcript. Falls back to default if requested language is unavailable.
Output
The actor outputs an array of items, one per video. Each item is a JSON object containing the scraped data. A transcript example is shown below; the full output includes all mentioned metadata fields.
{
"transcript": [
{
"text": "so if you want immediate access to",
"startMs": "160",
"endMs": "3760",
"startTimeText": "0:00"
}
],
"title": "Video Title Here",
"viewCount": "1234567",
"channelId": "UCxxxxx",
// ... (all other video details)
}
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try YouTube Transcript Extractor & Video Detail Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- memo23
- Pricing
- Paid
- Total Runs
- 51,277
- Active Users
- 142
Related Actors
🏯 Tweet Scraper V2 - X / Twitter Scraper
by apidojo
Google Search Results Scraper
by apify
Instagram Profile Scraper
by apify
Tweet Scraper|$0.25/1K Tweets | Pay-Per Result | No Rate Limits
by kaitoeasyapi
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support