YouTube Transcript Scraper
by igview-owner
Extract complete transcripts from any YouTube video with precise timestamps. Auto-selects English captions. Perfect for AI training, content analysis,...
Opens on Apify.com
About YouTube Transcript Scraper
Extract complete transcripts from any YouTube video with precise timestamps. Auto-selects English captions. Perfect for AI training, content analysis, SEO & research. Export to JSON/CSV/Excel.
What does this actor do?
YouTube Transcript Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
YouTube Transcript Scraper 🎬📝 Extract complete transcripts from YouTube videos with precise timestamps and metadata. This actor processes a single YouTube video and returns the full transcript with timing information, perfect for content analysis, research, accessibility, and video archiving. --- ### 📚 Table of Contents 1. Overview 2. Key Features 3. When to Use This Actor 4. Input Parameters 5. Output Data 6. How to Use on Apify 7. Examples 8. Best Practices 9. FAQ 10. Tags --- ### 🔎 Overview The YouTube Transcript Scraper actor makes it easy to: - Extract complete transcripts from any YouTube video - Get precise timestamps (start/end times in milliseconds) for each text segment - Auto-select English transcripts when default transcript is not available - Receive clean, structured JSON output ready for processing You can export the dataset to JSON, CSV or Excel and plug it into your own tools, dashboards, NLP/AI pipelines or automations. --- ### ✨ Key Features - Flexible input – accepts either a full YouTube URL or just the video ID - Auto-language selection – automatically selects English transcript if default is unavailable - Precise timestamps – includes startMs, endMs, and startTime for each entry - Section headers – captures chapter/section information when available - Duration metadata – includes duration in milliseconds for each transcript entry - Clean, structured output – organized transcript array with consistent fields - Clear error messages – failures are stored as simple, readable error items --- ### 🎯 When to Use This Actor Use this actor when you want to: - Build a transcript dataset for NLP, AI training, or content analysis - Create searchable text from video content - Generate video summaries or content indexes - Enable accessibility features for video content - Archive video transcripts for research or compliance --- ### ⚙️ Input Parameters Your input lives in the actor's INPUT store and is controlled by the Apify UI form. | Field | Type | Required | Description | |------|------|----------|-------------| | videoId | string | Yes | YouTube video ID or full video URL. The actor automatically extracts the video ID. | Notes - videoId accepts both: - Full URL: https://www.youtube.com/watch?v=5MgBikgcWnY - Raw ID: 5MgBikgcWnY - Shorts URL: https://www.youtube.com/shorts/abc123 --- ### 📤 Output Data Each successful dataset item contains the complete transcript for the requested video. | Field | Type | Description | |------|------|-------------| | transcript | array | Array of transcript entries (text segments and section headers). | | language | string | Language of the transcript (only present when auto-selected). | Transcript Entry Fields (type: text) | Field | Type | Description | |------|------|-------------| | type | string | Entry type: text for transcript text. | | text | string | The transcript text content. | | startMs | string | Start time in milliseconds. | | endMs | string | End time in milliseconds. | | startTime | string | Formatted start time (e.g., 0:05). | | duration | number | Duration in milliseconds. | Section Header Fields (type: section) | Field | Type | Description | |------|------|-------------| | type | string | Entry type: section for chapter headers. | | headline | string | Section/chapter title. | | startMs | string | Start time in milliseconds. | | endMs | string | End time in milliseconds. | If the actor cannot fetch a transcript for the video, you will see an error item: | Field | Type | Description | |------|------|-------------| | videoId | string | YouTube video ID. | | videoUrl | string | Full video URL. | | status | string | no_transcript or failed. | | message | string | Human-readable status message. | | availableTranscripts | array | List of available transcript options (if any). | | error | string | Error message (for failed status). | | httpStatus | number | HTTP status code (for failed status). | --- ### 🚀 How to Use on Apify 1. Open the actor on Apify. 2. Click Run. 3. In the input form: - Paste a YouTube video URL or ID into the videoId field. 4. Start the run. 5. When the run finishes, open the Dataset tab to see the transcript. 6. Export the dataset as JSON, CSV or Excel for further use. --- ### 💡 Examples #### Example 1 – Using full YouTube URL json { "videoId": "https://www.youtube.com/watch?v=5MgBikgcWnY" } #### Example 2 – Using video ID only json { "videoId": "5MgBikgcWnY" } #### Sample Output json { "transcript": [ { "type": "text", "text": "Hello and welcome to this video", "startMs": "0", "endMs": "3500", "startTime": "0:00", "duration": 3500 }, { "type": "section", "headline": "Introduction", "startMs": "5000", "endMs": "60000" } ] } --- ### ✅ Best Practices - Use public YouTube videos that are available worldwide. - Prefer full video URLs if you are unsure about the video ID format. - The actor auto-selects English transcripts when the default is unavailable. - After the run finishes, export the dataset for processing in external tools. --- ### ❓ FAQ Q: Does this actor download the actual video? A: No, the actor only extracts the transcript text and metadata. No video files are downloaded. Q: Can I scrape multiple videos in one run? A: This actor is designed for one video per run. For multiple videos, create a workflow that runs the actor for each video ID. Q: What languages are supported? A: The actor fetches the default transcript. If unavailable, it auto-selects English if available. Other languages are listed in availableTranscripts. Q: What happens if no transcript is available? A: The actor returns a no_transcript status with a list of available transcript options (if any). Q: Why do I see error items in the dataset? A: Error items appear when the transcript service cannot return data (invalid video ID, unavailable video, or technical issues). --- ### 🏷️ Tags youtube transcript scraper, youtube transcript api, youtube captions extractor, youtube transcript dataset, download youtube transcript, youtube video transcript, youtube text extraction, video transcript scraper, youtube subtitle text, youtube transcript downloader --- ## 🚀 Get Started Now Ready to extract YouTube video transcripts? 1. 🎥 Enter a YouTube video URL or ID 2. ▶️ Click Start to run the actor 3. 📥 Download your transcript in JSON, CSV, or Excel format 4. Click "Try for free" to test the actor ⭐ Star this actor if it helps you extract YouTube transcripts efficiently! --- ## Related Actors 🤝 Looking for more YouTube data? Check out these complementary actors: - YouTube Video Downloader - YouTube Comment Scraper - YouTube Search Scraper - YouTube Video Scraper - YouTube Subtitles Scraper --- Built with ❤️ by Sachin Kumar Yadav using Apify Platform
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try YouTube Transcript Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- igview-owner
- Pricing
- Paid
- Total Runs
- 37
- Active Users
- 5
Related Actors
Google Search Results Scraper
by apify
Website Content Crawler
by apify
🔥 Leads Generator - $3/1k 50k leads like Apollo
by microworlds
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support