Website Contact & Socials Extractor

Website Contact & Socials Extractor

by embion

Crawl company websites and extract emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, T...

29,437 runs
650 users
Try This Actor

Opens on Apify.com

About Website Contact & Socials Extractor

Crawl company websites and extract emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, TikTok, Twitch, Twitter/X and YouTube. 2 hour trial available.

What does this actor do?

Website Contact & Socials Extractor is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Website Contact & Socials Extractor ## Get emails, phone numbers, and social media links from the company websites A crawler designed for lead generation and market research. Built for our own needs while collecting datasets for our directory of corporate service providers, it incorporates years of experience in web scraping. Now, we're making it available to you. ## 🚀 How to start 1. Log into your Apify account. 2. Enter links to target websites in "Start URLs" input field. 3. Enable proxy settings if needed. 4. Click start button. 5. Wait until the job is complete. 6. Collect results from the Default dataset ("Output" tab) ## 🎯 Features Actor automatically detects each type of contact information and deduplicates it in the output, making it easy to process and analyze the collected data. Here's what makes it stand out from the crowd: - Domain-based grouping: actor automatically organizes extracted data by domain. - Emails, phones and social media platforms: actor extracts emails, phone numbers and links to Discord, Facebook, Instagram, LinkedIn, Pinterest, Reddit, Snapchat, Telegram, TikTok, Twitch, Twitter/X, YouTube. - Adjustable crawl limits: actor allows setting the limit on how many pages it will crawl per domain and how deep it will go into the website. Algorithm prioritises "team" and "about us" pages, making useful data available with less pages crawled. - Cost-efficiency: written in Rust with performance in mind. Actor skips full browser rendering, which reduces memory use and speeds up crawling, allowing more work on less hardware. Limitation: actor only validates the format of links found on the target website; it does not verify whether the contact information is active or correct. ## 💫 Leave a review or report an issue If this actor helps your workflow, you can leave a review on its Apify page. A review shows us how you use the actor, which helps us prioritize improvements that support your project. Reviews also show other Apify users what kinds of projects this actor handles well. If the actor misses data or behaves unexpectedly, open an Issue. Add one or two example URLs so we can check the exact case and respond quickly. ## 🛠 Troubleshooting ### Why hasn't the actor discovered any emails, even though I can clearly see one on the website? Some webmasters use services like Cloudflare to protect their websites. These services can hide email addresses from crawlers like ours. That's expected behavior, and we currently can't bypass it. Still, feel free to share your case with us: we're happy to take a look and help if we can. ### Why hasn't the actor discovered any pages? There are a few possible reasons: - Different subdomain: the actor treats each subdomain as a separate website. It won't follow links from domain.com to second.domain.com. If the content you need is on another subdomain, include a direct link to it in the starting URL. The only exception is www.: the actor treats www.example.com and example.com as the same. - JavaScript-rendered website: if the page loads content using JavaScript, the actor won't see it because it doesn't support JavaScript rendering at the moment. - Blocked or missing links: if no internal links are found on the starting URL, the actor can't proceed to other pages. ## ⭐ Premium support Not sure if our actor fits your case? The developer is around to help. You can reach out directly on Telegram: https://t.me/nikmadebeykin ## 📒 Supported Contact Types The actor detects and extracts the following types of contact information: ### Basic Contact Information - Email addresses. - Phone numbers (outputs E164 standard). ### Social Media Platforms - Discord. - Facebook. - Instagram. - LinkedIn. - Pinterest. - Reddit. - Snapchat. - Telegram. - TikTok. - Twitch. - Twitter/X. - YouTube. ### Business Listings and Reviews - Google Maps. - TripAdvisor. - Trustpilot. - Yelp. ### Messaging Platforms - WhatsApp. ## 🏗 Output data format Default output dataset will include a table with the following rows: { "domain": { "label": "Domain", "format": "text" }, "emails": { "label": "Emails", "format": "array" }, "phones": { "label": "Phone Numbers", "format": "array" }, "possible_phones": { "label": "Possible Phone Numbers", "format": "array" }, "discord_urls": { "label": "Discord", "format": "array" }, "facebook_urls": { "label": "Facebook", "format": "array" }, "google_maps_urls": { "label": "Google Maps", "format": "array" }, "instagram_urls": { "label": "Instagram", "format": "array" }, "linkedin_urls": { "label": "LinkedIn", "format": "array" }, "pinterest_urls": { "label": "Pinterest", "format": "array" }, "reddit_urls": { "label": "Reddit", "format": "array" }, "snapchat_urls": { "label": "Snapchat", "format": "array" }, "telegram_urls": { "label": "Telegram", "format": "array" }, "tiktok_urls": { "label": "TikTok", "format": "array" }, "tripadvisor_urls": { "label": "Tripadvisor", "format": "array" }, "trustpilot_urls": { "label": "Trustpilot", "format": "array" }, "twitch_urls": { "label": "Twitch", "format": "array" }, "whatsapp_urls": { "label": "WhatsApp", "format": "array" }, "twitter_urls": { "label": "Twitter", "format": "array" }, "yelp_urls": { "label": "Yelp", "format": "array" }, "youtube_urls": { "label": "YouTube", "format": "array" }, "pages_crawled": { "label": "Source pages", "format": "number" }, "max_depth_reached": { "label": "Depth reached", "format": "number" }, "start_urls": { "label": "Starting URLs", "format": "array" } } ## ⚖️ Legal Consider the legal implications of scraping contact information and verify the legality of scraping each target website. Ensure compliance with applicable laws and website terms of service before using this tool. The authors assume no liability for improper use.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Website Contact & Socials Extractor now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
embion
Pricing
Paid
Total Runs
29,437
Active Users
650
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support