Website Contact & Socials Extractor

Name: Website Contact & Socials Extractor
Author: embion

by embion

Crawl company websites and extract emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, T...

29,437 runs

650 users

Try This Actor

Opens on Apify.com

About Website Contact & Socials Extractor

Crawl company websites and extract emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, TikTok, Twitch, Twitter/X and YouTube. 2 hour trial available.

What does this actor do?

Website Contact & Socials Extractor is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

Cloud-based execution - no local setup required
Scalable infrastructure for large-scale operations
API access for integration with your applications
Built-in proxy rotation and anti-blocking measures
Scheduled runs and webhooks for automation

How to Use

Click "Try This Actor" to open it on Apify
Create a free Apify account if you don't have one
Configure the input parameters as needed
Run the actor and download your results

Documentation

Website Contact & Socials Extractor ## Get emails, phone numbers, and social media links from the company websites A crawler designed for lead generation and market research. Built for our own needs while collecting datasets for our directory of corporate service providers, it incorporates years of experience in web scraping. Now, we're making it available to you. ## 🚀 How to start 1. Log into your Apify account. 2. Enter links to target websites in "Start URLs" input field. 3. Enable proxy settings if needed. 4. Click start button. 5. Wait until the job is complete. 6. Collect results from the Default dataset ("Output" tab) ## 🎯 Features Actor automatically detects each type of contact information and deduplicates it in the output, making it easy to process and analyze the collected data. Here's what makes it stand out from the crowd: - Domain-based grouping: actor automatically organizes extracted data by domain. - Emails, phones and social media platforms: actor extracts emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, TikTok, Twitch, Twitter/X, YouTube. - Adjustable crawl limits: actor allows setting the limit on how many pages it will crawl per domain and how deep it will go into the website. Algorithm prioritises "team" and "about us" pages, making useful data available with less pages crawled. - Cost-efficiency: written in Rust with performance in mind. Actor skips full browser rendering, which reduces memory use and speeds up crawling, allowing more work on less hardware. Limitation: actor only validates the format of links found on the target website; it does not verify whether the contact information is active or correct. ## 💫 Leave a review or report an issue If this actor helps your workflow, you can leave a review on its Apify page. A review shows us how you use the actor, which helps us prioritize improvements that support your project. Reviews also show other Apify users what kinds of projects this actor handles well. If the actor misses data or behaves unexpectedly, open an Issue. Add one or two example URLs so we can check the exact case and respond quickly. ## 🛠 Troubleshooting ### Why hasn't the actor discovered any emails, even though I can clearly see one on the website? Some webmasters use services like Cloudflare to protect their websites. These services can hide email addresses from crawlers like ours. That's expected behavior, and we currently can't bypass it. Still, feel free to share your case with us: we're happy to take a look and help if we can. ### Why hasn't the actor discovered any pages? There are a few possible reasons: - Different subdomain: the actor treats each subdomain as a separate website. It won't follow links from `domain.com` to `second.domain.com`. If the content you need is on another subdomain, include a direct link to it in the starting URL. The only exception is `www.`: the actor treats `www.example.com` and `example.com` as the same. - JavaScript-rendered website: if the page loads content using JavaScript, the actor won't see it because it doesn't support JavaScript rendering at the moment. - Blocked or missing links: if no internal links are found on the starting URL, the actor can't proceed to other pages. ## ⭐ Premium support Not sure if our actor fits your case? The developer is around to help. You can reach out directly on Telegram: https://t.me/nikmadebeykin ## 📒 Supported Contact Types The actor detects and extracts the following types of contact information: ### Basic Contact Information - Email addresses. - Phone numbers (outputs E164 standard). ### Social Media Platforms - Discord. - Facebook. - Instagram. - LinkedIn. - Pinterest. - Reddit. - Snapchat. - Telegram. - TikTok. - Twitch. - Twitter/X. - YouTube. ### Business Listings and Reviews - Google Maps. - TripAdvisor. - Trustpilot. - Yelp. ### Messaging Platforms - WhatsApp. ## 🏗 Output data format Default output dataset will include a table with the following rows: { "domain": { "label": "Domain", "format": "text" }, "emails": { "label": "Emails", "format": "array" }, "phones": { "label": "Phone Numbers", "format": "array" }, "possible_phones": { "label": "Possible Phone Numbers", "format": "array" }, "discord_urls": { "label": "Discord", "format": "array" }, "facebook_urls": { "label": "Facebook", "format": "array" }, "google_maps_urls": { "label": "Google Maps", "format": "array" }, "instagram_urls": { "label": "Instagram", "format": "array" }, "linkedin_urls": { "label": "LinkedIn", "format": "array" }, "pinterest_urls": { "label": "Pinterest", "format": "array" }, "reddit_urls": { "label": "Reddit", "format": "array" }, "snapchat_urls": { "label": "Snapchat", "format": "array" }, "telegram_urls": { "label": "Telegram", "format": "array" }, "tiktok_urls": { "label": "TikTok", "format": "array" }, "tripadvisor_urls": { "label": "Tripadvisor", "format": "array" }, "trustpilot_urls": { "label": "Trustpilot", "format": "array" }, "twitch_urls": { "label": "Twitch", "format": "array" }, "whatsapp_urls": { "label": "WhatsApp", "format": "array" }, "twitter_urls": { "label": "Twitter", "format": "array" }, "yelp_urls": { "label": "Yelp", "format": "array" }, "youtube_urls": { "label": "YouTube", "format": "array" }, "pages_crawled": { "label": "Source pages", "format": "number" }, "max_depth_reached": { "label": "Depth reached", "format": "number" }, "start_urls": { "label": "Starting URLs", "format": "array" } } ## ⚖️ Legal Consider the legal implications of scraping contact information and verify the legality of scraping each target website. Ensure compliance with applicable laws and website terms of service before using this tool. The authors assume no liability for improper use.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Website Contact & Socials Extractor now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer: embion
Pricing: Paid
Total Runs: 29,437
Active Users: 650

Related Actors

🏯 Tweet Scraper V2 - X / Twitter Scraper

by apidojo

Google Search Results Scraper

by apify

Instagram Profile Scraper

by apify

Tweet Scraper|$0.25/1K Tweets | Pay-Per Result | No Rate Limits

by kaitoeasyapi

Browse All Actors

Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support