Youtube Video Transcript Scraper
by alpha-scraper
Instantly extract full transcripts from any public YouTube video. No login or API key needed—just paste the URL to get clean, structured text for research, accessibility, or analysis.
Opens on Apify.com
About Youtube Video Transcript Scraper
Need the exact words from a YouTube video? This scraper pulls full, clean transcripts from any public video. Just paste the URL and you'll get structured text back in seconds. I use it all the time when I'm researching a topic and want to quote someone accurately, or when I need to quickly summarize a long tutorial without watching the whole thing again. It's perfect for creating accessible content, generating study notes, or feeding video data into your own analysis projects. You don't need a YouTube account or an API key—it just works. The output is plain text, so you can easily copy it into a document, a spreadsheet, or another app. It saves me hours of manual transcription and lets me focus on the actual work, whether that's writing a blog post, compiling research, or building a dataset.
What does this actor do?
Youtube Video Transcript Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
YouTube Video Transcript Scraper
Extracts transcripts and captions from YouTube videos and Shorts. This Apify actor is built with Python, processes input URLs, and outputs structured transcript data to a dataset.
Key Features
- Fetches available transcripts and captions from standard YouTube videos and YouTube Shorts.
- Processes single or multiple video URLs in one run.
- Implements retry logic with exponential backoff for handling failed requests.
- Supports proxy rotation using Apify Proxy or custom proxy configurations.
- Outputs both the raw API response and a clean, normalized array of text segments with timestamps.
How to Use
Configure the actor run by providing a JSON input object. The main required field is startUrls, containing the YouTube video links to process.
Input
The actor accepts the following input configuration:
| Field | Type | Required | Description | Example |
|---|---|---|---|---|
startUrls |
Array | Yes | A list of YouTube video URLs to scrape. | [{"url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}] |
proxyConfiguration |
Object | No | Apify Proxy settings for the request. | { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] } |
Example Input:
{
"startUrls": [
{ "url": "https://www.youtube.com/watch?v=BthfXVCRWEQ" },
{ "url": "https://www.youtube.com/watch?v=FWAdfuPpLOc" }
],
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
Output
For each processed video, the actor outputs a JSON object to the dataset containing the transcript data.
Output Fields:
* video_id: The YouTube video ID.
* title: The video title.
* url: The source video URL.
* lang: The language code of the transcript.
* format: The transcript format (e.g., vtt).
* segments: An array of transcript segments, each with start time, end time, and text.
* inputUrl: The original URL processed.
* fetchedAt: ISO timestamp of when the transcript was fetched.
* success: Boolean indicating if the fetch was successful.
* raw_response: The raw transcript API response object.
Example Output:
{
"video_id": "BthfXVCRWEQ",
"title": "I Hatched the World's Biggest vs Smallest Eggs",
"url": "https://www.youtube.com/watch?v=BthfXVCRWEQ",
"lang": "en",
"format": "vtt",
"segments": [
{
"start": 0.12,
"end": 5.2,
"text": "Here is the world's biggest egg...\nand here is one of the smallest."
}
],
"inputUrl": "https://www.youtube.com/watch?v=BthfXVCRWEQ",
"fetchedAt": "2023-10-26T14:30:00.000Z",
"success": true,
"raw_response": { ... }
}
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Youtube Video Transcript Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- alpha-scraper
- Pricing
- Paid
- Total Runs
- 320
- Active Users
- 6
Related Actors
🏯 Tweet Scraper V2 - X / Twitter Scraper
by apidojo
Instagram Scraper
by apify
TikTok Scraper
by clockworks
Instagram Profile Scraper
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support