Video Link Crawler

Video Link Crawler

by infoweaver

Effortlessly discover and extract video links from any website with our powerful Video Link Crawler within few seconds. Starting from a specified URL,...

6,106 runs
272 users
Try This Actor

Opens on Apify.com

About Video Link Crawler

Effortlessly discover and extract video links from any website with our powerful Video Link Crawler within few seconds. Starting from a specified URL, it navigates through web pages, identifies video content, and compiles structured datasets.! Try it Now!

What does this actor do?

Video Link Crawler is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

🎥 Video Link Crawler The Video Link Crawler is a powerful web crawler designed to extract 🎞️ video links from websites, starting from a specified URL, it navigates through web pages, discovers video content, and compiles the extracted links into a structured dataset. It supports customizable crawling settings and efficient video extraction. Whether you're building a video catalog or analyzing video trends, this tool is your go-to solution for efficient video discovery. --- ## 📝 Description The Video Link Crawler specializes in extracting video URLs from various HTML elements, including: - <video> and <source> tags - <embed> and <object> tags - <iframe> tags (e.g., YouTube, Vimeo) - <a> tags linking to video files (e.g., .mp4, .webm) With advanced features like proxy support, crawl depth control, and retry mechanisms, it adapts seamlessly to your web scraping needs. Extracted data is stored in an Apify Dataset, ready for analysis or integration into your workflows. --- ## ✨ Key Features - Customizable Crawling ⬩ Define starting URLs ⬩ Restrict crawling using glob patterns ⬩ Set maximum crawl depth and request limits - Video URL Extraction ⬩ Extract video links from multiple HTML tags ⬩ Convert relative URLs to absolute links for seamless results - Proxy Support ⬩ Ensure anonymity with Apify Proxy ⬩ Optionally specify proxy groups - Error Handling ⬩ Detailed logs for failed requests ⬩ Handles redirect loops and HTTP 403 errors gracefully - Data Output ⬩ Structured and easy-to-use dataset with video links --- ## ⚙️ How It Works - Initialization Reads input parameters such as startUrl, glob patterns, and crawl depth to configure the crawler. - Crawling Begins at the startUrl, follows links matching the glob patterns, and stops upon reaching the crawl limits. - Video Extraction Detects video links from supported tags and resolves relative URLs to absolute ones. - Data Storage Extracted URLs and video links are saved to an Apify Dataset. - Error Logging Logs all failed or skipped requests for review and troubleshooting. --- ## 🔧 Input Configuration The Video Link Crawler accepts the following input parameters in JSON format: | Field | Type | Required | Description | |----------------------|------------|----------|----------------------------------------------------------------------------------------------| | startUrl | string | Yes | The URL where the crawler begins. | | globs | string[] | Yes | Array of glob patterns to restrict crawling to specific domains or paths. | | maxRequestsPerCrawl| number | Optional | Maximum number of requests the crawler processes. | | maxCrawlDepth | number | Optional | Maximum depth of pages to crawl from the starting URL. | | proxyConfig | object | Optional | Proxy configuration for anonymized crawling (e.g., Apify Proxy). | ### 🛡️ Proxy Configuration - useApifyProxy (boolean): Enable Apify Proxy for anonymized crawling. - apifyProxyGroups (string[]): Specify proxy groups for more granular control (optional). --- ## 📝 Example Input json { "startUrl": "https://example.com", "globs": ["https://example.com/**"], "maxRequestsPerCrawl": 50, "maxCrawlDepth": 3, "proxyConfig": { "useApifyProxy": true, "apifyProxyGroups": ["SHARED"] } } --- ## 📄 Example Output The output of the Actor is saved to an Apify Dataset in the following structured format: json [ { "url": "https://example.com/page1", "videoLinks": [ "https://example.com/videos/video1.mp4", "https://youtube.com/watch?v=abc123" ] }, { "url": "https://example.com/page2", "videoLinks": [] } ] ## ℹ️ Need Help? If you have any questions, suggestions, or need assistance, feel free to: 💬 Open an issue in the Issues section. 📧 Contact us: infoweaver7@gmail.com

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Video Link Crawler now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
infoweaver
Pricing
Paid
Total Runs
6,106
Active Users
272
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support