Site Health Scanner

Site Health Scanner

by constant_quadruped

Crawl a website to detect broken and problematic links, identify redirects and blocked URLs, capture screenshots, and return structured site health da...

26 runs
2 users
Try This Actor

Opens on Apify.com

About Site Health Scanner

Crawl a website to detect broken and problematic links, identify redirects and blocked URLs, capture screenshots, and return structured site health data for audits, automation, and monitoring.

What does this actor do?

Site Health Scanner is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Site Health Scanner πŸ” Find broken links before Google does. Automatically crawl your site, detect real issues, and capture screenshots as proof. --- ## TL;DR - Crawls your site and checks every link (internal + optional external) - Clearly separates broken links from bot-blocked external sites - Captures screenshots of broken pages for verification - Outputs clean, structured data for automation, reporting, or client delivery --- ## What does it do? Site Health Scanner crawls your website and checks every discovered link β€” internal pages, external URLs, images, scripts, and stylesheets. When issues are found, it doesn’t just report status codes. It: - classifies why a link failed - flags confidence level - captures screenshots for broken pages - avoids false positives from bot-protected external sites This makes the output usable for automation, SEO audits, and client reporting. --- ## Common use cases - SEO professionals auditing client sites - Web agencies delivering site health reports - Site owners monitoring link integrity - QA teams validating sites before launch --- ## Features | Feature | Description | |------|-------------| | πŸ”— Broken link detection | Finds real 4xx and 5xx errors across the site | | 🚫 Bot-block detection | Distinguishes broken links from external bot protection | | πŸ“Έ Screenshot capture | Takes screenshots of broken pages automatically | | β†ͺ️ Redirect chain tracking | Detects redirect chains and loops | | ⚠️ Health warnings | Flags mixed content and slow responses | | ⏱️ Response time monitoring | Records response times per resource | | 🌐 External link checking | Optionally checks outbound links | --- ## Cost estimate | Site type | Pages | Est. time | Est. cost | |---------|-------|-----------|-----------| | Small blog | ~50 | 2–3 min | $0.02–0.05 | | Business site | ~200 | 8–12 min | $0.10–0.20 | | E-commerce | ~1,000 | 30–45 min | $0.50–1.00 | | Large portal | ~5,000 | 2–3 hrs | $2.00–4.00 | Based on Apify platform pricing. Actual costs vary by site complexity and settings. --- ## Input | Field | Type | Description | Default | |------|------|-------------|---------| | startUrls | array | URLs to start crawling | Required | | maxDepth | integer | Crawl depth (0–10) | 3 | | maxPages | integer | Max pages to crawl | 100 | | checkExternalLinks | boolean | Validate external links | true | | screenshotBrokenPages | boolean | Capture screenshots | true | | followRedirects | boolean | Track redirect chains | true | | timeout | integer | Request timeout (seconds) | 30 | | includeWarnings | boolean | Include health warnings | true | | userAgent | string | Custom user agent | "" | ### Example input ```json { "startUrls": [{ "url": "https://example.com" }], "maxDepth": 3, "maxPages": 500, "checkExternalLinks": true, "screenshotBrokenPages": true }

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Site Health Scanner now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
constant_quadruped
Pricing
Paid
Total Runs
26
Active Users
2
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support