Clutch Scraper Pro

Clutch Scraper Pro

by procoders

The most comprehensive and reliable Clutch scraper on Apify marketplace just by Link. Extract complete company profiles with 50+ data fields, includin...

120 runs
16 users
Try This Actor

Opens on Apify.com

About Clutch Scraper Pro

The most comprehensive and reliable Clutch scraper on Apify marketplace just by Link. Extract complete company profiles with 50+ data fields, including reviews, intelligent deduplication, KV caching, and blazing-fast performance.

What does this actor do?

Clutch Scraper Pro is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

🏆 Clutch Scraper Pro - Most Complete Data Extraction Apify Actor TypeScript License The most comprehensive and reliable Clutch scraper on Apify marketplace. Extract complete company profiles with 70+ data fields including pricing information, detailed business metrics, enhanced review analysis, LinkedIn company pages, business verification details, credit reports, and social media links. Features intelligent deduplication, KV caching, and blazing-fast performance. ## 🚀 Why Choose This Scraper? ### ✨ Why Choose This Scraper? - 70+ Data Fields: The most complete profile coverage on Clutch including pricing & business details - Comprehensive Business Data: Min project size, hourly rates, employee count, founding year 🆕 - Enhanced Review Analysis: Project summaries, feedback analysis, detailed ratings breakdown 🆕 - Client Testimonials: Complete "What Clients Have Said" sections with pricing insights 🆕 - LinkedIn Company Pages: Smart extraction of real LinkedIn company URLs (not policy links) - Business Verification: Complete Business Entity details, Credit Risk Assessment, legal filings - Social Media Links: LinkedIn, Facebook, X (Twitter), Instagram from official social sections - Intelligent Deduplication: Removes duplicates across list pages and runs - Real Domain Extraction: Optional redirect following for accurate domains - KV Profile Cache: Reuse parsed detail pages across runs to save time and CUs - Apify Proxy Support: Use residential/datacenter groups and optional country pinning - Review Sorting: relevance, recent, ratingHighToLow, ratingLowToHigh - Robust Error Handling: Retries, timeouts, and safe fallbacks - Detailed Statistics: Duplicates, errors, speed, runtime summary - Multiple Export Formats: CSV, Excel, JSON, XML with clean, flat columns - Blazing Fast: 90+ companies/minute in LIST_DETAIL in real-world runs ### 📊 Data Fields Extracted #### Company Information - Company name, logo, tagline - Clutch URL and website URL - Real domain (with redirect following option) - Rating and reviews count - Verification status (VERIFIED, PREMIER VERIFIED) - Founded year - Timezone - Full description #### Business Details 🆕 - Min project size: Minimum project budget requirement ($USD) - Hourly rate range: From/to hourly rates ($USD/hr) - Employees count: Team size range (e.g., "50-249", "2-9") - Year founded: Company establishment year - Most common project size: Typical project budget based on reviews - Client testimonials: "What Clients Have Said" summary with pricing insights - Multiple locations: Complete address information with country/region #### Service Breakdown - Services with percentages - Focus areas with percentages - Industries served with percentages - Client focus (size/type) with percentages #### Social Media & Contact - LinkedIn Company Pages: Smart URL extraction (e.g., /company/acme-corp/) - Facebook Pages: Official business pages - X (Twitter): Company Twitter handles - Instagram: Business Instagram accounts - Real website domains (with redirect resolution) #### Verification & Trust - Business Entity: Legal name, status, jurisdiction, formation date, entity ID - Credit Report Results: International Credit Risk Assessment, source, last updated - Verification Status: VERIFIED, PREMIER VERIFIED, GOLD VERIFIED badges - Legal Filings: Bankruptcy, tax liens, judgments, collections count - Last Updated: When verification data was last refreshed #### Reviews (Optional) - Review title and rating - Review date and comments - Reviewer details - Project services, size, and length - Quality, schedule, cost ratings - Willingness to refer rating #### Additional Data - Multiple office addresses - Portfolio items - Custom fields via extension function ## 🎯 Scraping Modes | Mode | Description | Best For | |------|-------------|----------| | LIST | Extract company data from listing pages only | Quick overview, basic info | | LIST_DETAIL | Extract from listings + full detail pages | Complete profiles with all data | | LIST_WEBSITES | Optimized mode - details only for missing websites | Fast extraction with websites | ## 🔧 Input Configuration json { "startUrls": [ { "url": "https://clutch.co/us/agencies/digital-strategy" } ], "scrapeMode": "LIST_DETAIL", "maxResults": 300, "includeReviews": true, "maxReviewsPerCompany": 5, "includePortfolio": true, "exportFormat": "CSV", "followRedirects": true, "clearDataset": true } ### Input Parameters | Parameter | Type | Default | Description | |-----------|------|---------|-------------| | startUrls | Array | Required | Clutch.co URLs to scrape (list or profile pages) | | scrapeMode | String | LIST_DETAIL | Scraping mode: LIST, LIST_DETAIL, LIST_WEBSITES | | includeReviews | Boolean | false | Extract company reviews | | maxReviewsPerCompany | Number | 10 | Maximum reviews per company | | reviewSortBy | String | relevance | Sort reviews: relevance, recent, ratingHighToLow, ratingLowToHigh | | includePortfolio | Boolean | true | Extract portfolio items | | maxResults | Number | - | Maximum total companies to extract | | maxPagesPerSearch | Number | - | Maximum pages per search URL | | maxRequestsPerCrawl | Number | 0 | Maximum total requests (0 = unlimited) | | exportFormat | String | JSON | Export format: JSON, CSV, EXCEL, XML | | followRedirects | Boolean | false | Follow redirects to get real domains | | useApifyProxy | Boolean | false | Use Apify Proxy | | apifyProxyGroups | Array | ["RESIDENTIAL"] | Proxy groups to use | | apifyProxyCountry | String | - | Proxy country code (e.g., US) | | useHeadless | Boolean | true | Run browser in headless mode | | clearDataset | Boolean | false | Clear dataset before run | | extendOutputFunction | String | - | Custom JS function to extend output | | useProfileCache | Boolean | false | Use KV cache for detail pages | | profileCacheTtlDays | Number | 0 | Cache TTL in days (0 = never expire) | | profileCacheVersion | String | "" | Bump to invalidate old cache | ### 🚀 Performance - In a recent LIST_DETAIL benchmark on https://clutch.co/us/agencies/digital-strategy with reviews enabled (max 5 per company), the scraper achieved: - Average speed: ~90 companies/minute - Total unique companies: 9,750 - Runtime: ~108 minutes - Settings: scrapeMode=LIST_DETAIL, includeReviews=true, reviewSortBy=recent, maxReviewsPerCompany=5, exportFormat=CSV Actual speed depends on filters, network, anti-bot pressure, and selected options (reviews/portfolio/proxy). ## 📈 Performance & Statistics After each run, you'll see detailed statistics: ================================================================================ SCRAPING COMPLETED SUCCESSFULLY ================================================================================ 📊 SUMMARY STATISTICS: • Total unique companies collected: 275 • Total companies processed (with duplicates): 436 • Duplicates removed: 161 • Runtime: 3m 51s • Average speed: 71 companies/minute 🔍 SCRAPING DETAILS: • Scrape mode: LIST_DETAIL • Reviews included: Yes (max 3 per company) • Export format: CSV • Exported file: clutch-export-2024-01-15.csv ✅ NO ERRORS DETECTED 🎯 PERFORMANCE METRICS: • Average time per company: 842ms • Total runtime: 231614ms ================================================================================ ## 💾 Output Formats ### CSV Export - Perfectly formatted with all fields as columns - Special handling for arrays (services, industries, etc.) - HTML entities decoded and cleaned - Review and portfolio data in indexed columns ### Output Schema (CSV/JSON) Below is the complete list of fields the scraper can output and how they appear in CSV (flattened) vs JSON (nested): - Top-level fields (LIST and LIST_DETAIL) - companyName (string) - clutchUrl (string) - websiteUrl (string) - domain (string, without www.) - logoUrl (string) - tagline (string) - rating (number) - reviewsCount (number) - verificationStatus (string: VERIFIED | PREMIER VERIFIED | GOLD VERIFIED | empty) - minProjectSize (number, e.g., 25000) 🆕 - hourlyRateFrom (number) 🆕 - hourlyRateTo (number) 🆕 - employees (string, e.g., "50-249", "2-9") 🆕 - yearFounded (number, e.g., 2015) 🆕 - location (string) - Social Media Links 🆕 - JSON: socialLinks: { linkedIn, facebook, x, instagram } - CSV: socialLinks_linkedIn, socialLinks_facebook, socialLinks_x, socialLinks_instagram - Services (list page chips) - JSON: services: [{ name: string, percent|percentage: number }] - CSV: services_{Service_Name} = numeric percentage - Focus Areas (detail) - JSON: focusAreas: [{ name: string, percentage: number }] - CSV: focusAreas_{Area_Name} = numeric percentage - Industries (detail) - JSON: industries: [{ name: string, percentage: number }] - CSV: industries_{Industry_Name} = numeric percentage - Client Focus (detail) - JSON: clientFocus: [{ name: string, percentage: number }] - CSV: clientFocus_{Client_Type} = numeric percentage - Most Common Project Size (detail) - JSON: mostCommonProjectSize (number) - CSV: mostCommonProjectSize - Client Testimonials (detail) 🆕 - JSON: clientTestimonial (string, "What Clients Have Said" section) - CSV: clientTestimonial - Timezone (detail) - JSON: timezone (string) - CSV: timezone - Full Description (detail) - JSON: fullDescription (string) - CSV: fullDescription - Addresses (detail) - JSON: addresses: [{ title, street, locality, region, postalCode, country, phone }] - CSV: addresses_0_title, addresses_0_street, addresses_0_locality, addresses_0_region, addresses_0_postalCode, addresses_0_country, addresses_0_phone, ... (indexed per address) - Portfolio (detail, optional) - JSON: portfolio: [{ image, description }] - CSV: portfolio_0_image, portfolio_0_description, portfolio_1_image, ... - Reviews (detail, optional) 🆕 Enhanced - JSON per review item: - name (string) - rating (number) - date (string) - comments (string; main review quote) - projectSummary (string; detailed project description) 🆕 - feedbackSummary (string; results and outcomes summary) 🆕 - detailedRatings (array; Quality, Schedule, Cost ratings with comments) 🆕 - fullReview (string; complete review text) 🆕 - reviewerName (string) - reviewerPosition (string) - projectServices (string; semicolon-separated) - projectSize (string) - projectLength (string) - CSV (indexed and flattened): - review_0_name, review_0_rating, review_0_date, review_0_comments, review_0_projectSummary, review_0_feedbackSummary, review_0_detailedRatings_0_category, review_0_detailedRatings_0_rating, review_0_detailedRatings_0_comment, review_0_fullReview, review_0_reviewerName, review_0_reviewerPosition, review_0_projectServices, review_0_projectSize, review_0_projectLength, ... - Verification Details (detail) 🆕 - JSON: verificationDetails: { verificationStatus, businessEntity: { name, status, jurisdictionOfFormation, ID, source, lastUpdated, dateOfFormation }, paymentLegalFilings: { bankruptcy, taxLienFilings, judgementFilings, collectionsCount, source, lastUpdated, fullBusinessCreditReport } } - CSV (flattened): - verificationDetails_verificationStatus - verificationDetails_businessEntity_name, verificationDetails_businessEntity_status, verificationDetails_businessEntity_jurisdictionOfFormation, verificationDetails_businessEntity_ID, verificationDetails_businessEntity_source, verificationDetails_businessEntity_lastUpdated, verificationDetails_businessEntity_dateOfFormation - verificationDetails_paymentLegalFilings_bankruptcy, verificationDetails_paymentLegalFilings_taxLienFilings, verificationDetails_paymentLegalFilings_judgementFilings, verificationDetails_paymentLegalFilings_collectionsCount, verificationDetails_paymentLegalFilings_source, verificationDetails_paymentLegalFilings_lastUpdated, verificationDetails_paymentLegalFilings_fullBusinessCreditReport - Credit Report Results (detail) 🆕 - JSON: creditReportResults: { internationalCreditRiskAssessment, source, lastUpdated } - CSV: creditReportResults_internationalCreditRiskAssessment, creditReportResults_source, creditReportResults_lastUpdated - Verification Last Updated 🆕 - JSON/CSV: verificationLastUpdated (string, e.g., "August 8, 2025") Notes: - 🆕 Social media extraction from official "Connect with [Company] on Social" sections - 🆕 Business verification data including legal entity details and credit assessments - CSV экспорт автоматически декодирует HTML-сущности и чистит лишние пробелы - Списки (reviews, portfolio, addresses) индексируются: _0_, _1_, ... - Процентные поля сохраняются числом; фильтрация по ним в таблицах проста ### JSON Output Complete nested structure with all relationships preserved. ## 🛠️ Advanced Usage ### Extract Companies with Specific Filters json { "startUrls": [ { "url": "https://clutch.co/agencies/digital-marketing?employees=10%20-%2049" } ], "scrapeMode": "LIST_DETAIL", "includeReviews": true, "followRedirects": true, "maxResults": 100 } ### Custom Data Extension json { "extendOutputFunction": "($, data) => ({ customScore: data.rating * data.reviewsCount, scrapedAt: new Date().toISOString(), hasPortfolio: data.portfolio?.length > 0 })" } ## 🐛 Error Handling The scraper includes robust error handling: - Continues on individual page failures - Retries failed requests up to 3 times - Handles frame detachment errors - Manages navigation timeouts - Provides detailed error statistics ## 🔄 Deduplication Intelligent deduplication ensures you only get unique companies: - Tracks companies globally across all pages - Removes duplicates automatically - Reports exact number of duplicates removed - Maintains data quality and accuracy ## 🤝 Support - 📧 Email: support@your-email.com - 🐛 Issues: GitHub Issues - 📖 Documentation: Full Docs ## 📄 License ISC License - feel free to use in your projects! --- Note: This scraper is not affiliated with Clutch.co. Please respect their Terms of Service and use responsibly.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Clutch Scraper Pro now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
procoders
Pricing
Paid
Total Runs
120
Active Users
16
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support