Site Health Scanner
by constant_quadruped
Crawl a website to detect broken and problematic links, identify redirects and blocked URLs, capture screenshots, and return structured site health da...
Opens on Apify.com
About Site Health Scanner
Crawl a website to detect broken and problematic links, identify redirects and blocked URLs, capture screenshots, and return structured site health data for audits, automation, and monitoring.
What does this actor do?
Site Health Scanner is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Site Health Scanner π Find broken links before Google does. Automatically crawl your site, detect real issues, and capture screenshots as proof. --- ## TL;DR - Crawls your site and checks every link (internal + optional external) - Clearly separates broken links from bot-blocked external sites - Captures screenshots of broken pages for verification - Outputs clean, structured data for automation, reporting, or client delivery --- ## What does it do? Site Health Scanner crawls your website and checks every discovered link β internal pages, external URLs, images, scripts, and stylesheets. When issues are found, it doesnβt just report status codes. It: - classifies why a link failed - flags confidence level - captures screenshots for broken pages - avoids false positives from bot-protected external sites This makes the output usable for automation, SEO audits, and client reporting. --- ## Common use cases - SEO professionals auditing client sites - Web agencies delivering site health reports - Site owners monitoring link integrity - QA teams validating sites before launch --- ## Features | Feature | Description | |------|-------------| | π Broken link detection | Finds real 4xx and 5xx errors across the site | | π« Bot-block detection | Distinguishes broken links from external bot protection | | πΈ Screenshot capture | Takes screenshots of broken pages automatically | | βͺοΈ Redirect chain tracking | Detects redirect chains and loops | | β οΈ Health warnings | Flags mixed content and slow responses | | β±οΈ Response time monitoring | Records response times per resource | | π External link checking | Optionally checks outbound links | --- ## Cost estimate | Site type | Pages | Est. time | Est. cost | |---------|-------|-----------|-----------| | Small blog | ~50 | 2β3 min | $0.02β0.05 | | Business site | ~200 | 8β12 min | $0.10β0.20 | | E-commerce | ~1,000 | 30β45 min | $0.50β1.00 | | Large portal | ~5,000 | 2β3 hrs | $2.00β4.00 | Based on Apify platform pricing. Actual costs vary by site complexity and settings. --- ## Input | Field | Type | Description | Default | |------|------|-------------|---------| | startUrls | array | URLs to start crawling | Required | | maxDepth | integer | Crawl depth (0β10) | 3 | | maxPages | integer | Max pages to crawl | 100 | | checkExternalLinks | boolean | Validate external links | true | | screenshotBrokenPages | boolean | Capture screenshots | true | | followRedirects | boolean | Track redirect chains | true | | timeout | integer | Request timeout (seconds) | 30 | | includeWarnings | boolean | Include health warnings | true | | userAgent | string | Custom user agent | "" | ### Example input ```json { "startUrls": [{ "url": "https://example.com" }], "maxDepth": 3, "maxPages": 500, "checkExternalLinks": true, "screenshotBrokenPages": true }
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Site Health Scanner now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- constant_quadruped
- Pricing
- Paid
- Total Runs
- 26
- Active Users
- 2
Related Actors
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Linkedin Profile Details Scraper + EMAIL (No Cookies Required)
by apimaestro
Twitter (X.com) Scraper Unlimited: No Limits
by apidojo
Content Checker
by jakubbalada
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support