Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Need to pull clean, structured transcripts from videos across YouTube, X (Twitter), Facebook, TikTok, and other platforms? This scraper handles it. I ...
Opens on Apify.com
About Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
Need to pull clean, structured transcripts from videos across YouTube, X (Twitter), Facebook, TikTok, and other platforms? This scraper handles it. I use it to grab transcripts in any available language, saving hours of manual work or dealing with unreliable APIs. It outputs data in both JSON and a plain text format that’s ready to feed directly into large language models (LLMs) and other AI systems. That makes it incredibly useful for a few key things: training conversational agents, performing content analysis at scale, and gathering data for academic or market research. Instead of getting messy, inconsistent text, you get a structured dataset you can actually use. The setup is straightforward. You give it video URLs, and it returns the transcripts, the detected language, and other metadata. Having the data in multiple formats means I can plug the JSON into my analytics pipelines or dump the clean text straight into an AI model for summarization or sentiment analysis. It’s become a core part of my toolkit for any project that needs to turn video content into actionable text data.
What does this actor do?
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc. is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
🎬 Video Transcript Scraper: Turn Video Content into Actionable Insights! Unlock the potential of videos across multiple platforms, YouTube, X (Twitter), TikTok, Facebook, Dailymotion, Vimeo, Loom and more! With our powerful, Apify actor, you can extract clean, AI-ready transcripts / captions and key video details with a single click. This tool is your key to transforming raw video content into valuable insights, perfect for AI applications, data analytics, and beyond 💥 --- ## ✨ What is Video Transcript Scraper ? Video Transcript Scraper is the ultimate video to text solution for extracting unlimited subtitles / captions from online videos, giving you all the essential information at your fingertips! Need full transcripts, timestamps, or video metadata? We've got it all. This actor’s precise, reliable data is presented in both JSON and LLM-ready formats, ready for instant integration into AI-powered applications Note: Some videos do not provide subtitles/captions. For those cases, use our speech‑to‑text companion actor: Video Transcriber. --- ## 📈 Why This Scraper / API ? Here’s What It Extracts… A comprehensive data set that’s perfect for various use cases: - Video Metadata: Instantly access video titles, descriptions, and thumbnails to help you contextualize the content. - Timestamped Transcripts: Get every word with precise timestamps, ideal for indexing and AI models. - LLM-Ready Data: Tailor-made for Large Language Models (LLMs) and AI applications, this data is pre-cleaned and ready to use. Ideal for NLP, sentiment analysis, or any ML workflow. #### 🚀 Key Advantages - 🌐 Compatible with multiple platforms: YouTube, X, Tiktok, Dailymotion, Vimeo, you name it, we scrap it - 🤖 AI-Ready Output: Designed for quick integration into ML and AI applications. - 📊 Rich Metadata: Get all the essential video details in one place. --- ## 🔗 Supported Platforms This video transcript scraper has been extensively tested on the following platforms: - YouTube - X (ex Twitter) (just copy the tweet video link) - TikTok - Facebook (public videos) - Dailymotion - Vimeo - Loom Important: Instagram is not supported in this actor. To transcribe Instagram videos, use Video Transcriber. The tool may also work on other platforms not listed here, like media websites, though they have not been as thoroughly tested. --- ## 🚀 Simple Inputs, Powerful Results Using this actor couldn’t be easier! Just enter: - video_urls: Array of video links; processes multiple videos efficiently. - ~~video_url~~: (Deprecated) Single video URL. Kept to avoid introducing breaking changes for user experience; use video_urls when possible. - Language (optional): Specify if you want a particular transcript language, e.g. "en", "fr". It will be matched with a track that is in that language. Leave blank to get the default language or verify that the chosen language is available for the video. We recommand not using the language input if you are unsure. For youtube videos you can check out the available_languages field in the JSON output. Advanced for Youtube videos (workarounds; you rarely need these): - use_residential_proxy_for_yt (optional): This is a workaround for some non‑English YouTube videos with only auto‑generated captions. If you enable this, if unable to retrieve the transcript, the actor will fallback to a different method that uses an Apify residential proxy to retrieve the transcript, which will incur extra charges, but only for the videos which the actor was unable to retrieve transcript from. - proxy_country (optional): Two‑letter country code ([ISO 3166‑1 alpha‑2]) used when the residential proxy is enabled for YouTube. Default is US. If a proxy isn’t available for the selected country, the actor automatically falls back to US. Example values: US, DE, FR, GB, JP, IN. - get_yt_original_metadata (optional): Retrieves the original YouTube metadata (e.g., title/description) in the creator’s original language rather than a localized variant. Useful for metadata fidelity; typically not required. - youtube_api_key (optional, secret): Your YouTube Data API v3 key, used only when get_yt_original_metadata is enabled. If not provided, the actor uses a bundled default key (subject to shared daily quota). We recommend supplying your own key. See Google’s guide: Registering an application. ### Sample Input json { "video_urls": ["https://www.youtube.com/watch?v=example1"], "language": "en" } ### Output json [ { "title": "Video Title 1", "description": "Video Description 1", "duration": "180", "thumbnail": "https://example.com/thumbnail.jpg", "like_count": 200, "view_count": 1000, "published_date": "2025-08-10", "channel": "Example", "channel_url": "https://example.com/video-id=example", "channel_id": "A123456789", "available_languages": ["English", "French"], "selected_language": "English", "text": "Hello, world! Welcome to our channel.", "transcript": [ { "text": "Hello, world!", "start": 0.0, "end": 1.5 }, { "text": "Welcome to our channel.", "start": 1.5, "end": 3.0 } ] } ] ### 🔍 Important consideration The following fields are only available for youtube videos: ["like_count", "channel", "channel_url", "available_languages", "selected_language"] --- ## ⚙️ Ready-to-Use Request Examples Quickly integrate and get started with just a few lines: ### Curl bash curl -X POST "https://api.apify.com/v2/acts/invideoiq~video-transcript-scraper/run-sync?token=YOUR_TOKEN" \ -H "Content-Type: application/json" \ -d '{"video_urls": ["https://www.youtube.com/watch?v=vl6jn-DdafM", "https://www.youtube.com/watch?v=4czjS9h4Fpg"]}' ### Python python import requests import json url = "https://api.apify.com/v2/acts/invideoiq~video-transcript-scraper/run-sync?token=YOUR_TOKEN" data = {"video_urls": ["https://www.youtube.com/watch?v=vl6jn-DdafM", "https://www.youtube.com/watch?v=4czjS9h4Fpg"]} response = requests.post(url, json=data, headers={"Content-Type": "application/json"}) print(json.dumps(response.json(), indent=4, ensure_ascii=False)) ### Javascript javascript const data = { "video_urls": ["https://www.youtube.com/watch?v=vl6jn-DdafM", "https://www.youtube.com/watch?v=4czjS9h4Fpg"] }; console.log(JSON.stringify(data)) fetch('https://api.apify.com/v2/acts/invideoiq~video-transcript-scraper/run-sync?token=YOUR_TOKEN', { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify(data) }) .then(response => response.json()) .then(json => console.log(json)) .catch(error => console.error('Error:', error)); --- ## 🚑 Troubleshooting Encountering issues? Here are some common solutions: - Invalid URL: Ensure the video URL is complete and valid, as the actor supports only direct links from YouTube, X (a tweet), Vimeo, etc ... - Transcript Not Found: Please ensure that the video has subtitles available (either manually added or auto-generated). If no subtitles are present, the transcript cannot be extracted. If the video has no subtitles/captions, use our speech‑to‑text companion actor: Video Transcriber. - Language Not Supported: Sometimes the selected subtitles language is not available. In which case, you will get a clear message stating that. Verify that the chosen language is available for the video. We recommand not using the language input if you are unsure. For youtube videos you can check out the available_languages field in the JSON output. If issues persist, please reach out to us, we will try and fix ASAP. --- ## 🔥 Why Wait? Start Extracting Valuable Video Data Now! Stop missing out on the insights hidden within video content. Turn hours of video into actionable data with Video Transcript Scraper today! Whether for AI development, market research, or content analytics, this tool gets you the data you need fast.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc. now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- invideoiq
- Pricing
- Paid
- Total Runs
- 13,561,814
- Active Users
- 1,362
Related Actors
Google Search Results Scraper
by apify
Website Content Crawler
by apify
🔥 Leads Generator - $3/1k 50k leads like Apollo
by microworlds
Smart Article Extractor
by lukaskrivka
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support