Indeed Comprehensive Crawler
by memo23
Struggling to get the full picture from Indeed job postings? Most scrapers just grab the basic listing, but you know there's more to the story—salary ...
Opens on Apify.com
About Indeed Comprehensive Crawler
Struggling to get the full picture from Indeed job postings? Most scrapers just grab the basic listing, but you know there's more to the story—salary insights, company reviews, specific application criteria. That's why I built this crawler to go deeper. It systematically extracts the complete dataset from Indeed: job titles, descriptions, company names, locations, and salaries. But it also pulls the often-missed details like employee review snippets, remote/work-from-home tags, and direct links to the company's Indeed profile. You get everything in a structured format (JSON, CSV, Excel) that's ready for your database or analysis. I use it for two main things. First, for competitive intelligence and market research—tracking what skills are in demand, what companies are hiring for, and what salary bands they're offering in real time. Second, for lead generation; it's perfect for recruiters or sales teams building targeted lists of companies that are actively hiring and growing. The setup is straightforward. You provide search queries, locations, and filters just like you would on Indeed.com. Then, let it run. It handles the pagination, avoids blocks with smart request patterns, and compiles all the data into a single, clean dataset. Stop piecing together incomplete information. Get the comprehensive Indeed data you actually need to make decisions.
What does this actor do?
Indeed Comprehensive Crawler is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Overview The Ultimate Indeed Scraper is a powerful tool designed for the Apify platform, tailored to bypass protections and efficiently extract job listing information directly from Indeed. It's engineered to navigate and parse data from Indeed's job listings and company profiles, offering users a seamless experience in collecting valuable job market insights. ## Features - Customizable Searches: Input URLs directly from Indeed to target specific job searches or categories. - Comprehensive Data Extraction: Gathers detailed information from job listings, including job titles, company names, locations, salary estimates, job descriptions, and more. - Company Intelligence: - CEO information with photos - Company founding year and revenue - Employee size range and industry classification - Active job categories and locations ## How to Use 1. Set Up: Ensure you have an Apify account and access to the Indeed Scraper Browser Automation actor. 2. Input Configuration: - Start URLs: Enter your target Indeed job listing URLs. For example: https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck - Max jobs to scrape (optional): Set the maximum number of job listings to scrape. Default is 20. - Enable company details scraping (new option) 3. Proxy Configuration: Configure proxy settings for enhanced reliability and to avoid IP blocks. 4. Run the Scraper: Launch the scraper on the Apify platform. 5. Data Collection: Extracted data will be available in your preferred format (JSON, CSV, etc.). ## Input Example json { "startUrls": [ { "url": "https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck" } ], "includeCompanyDetails": true, "maxJobsToScrape": 20, "proxyConfiguration": { "useApifyProxy": true } } ## Output Structure The scraper's output includes detailed information for each job listing. Here's a sample output with explanations for each field: json { "source_link": "", // Original source link if the job is aggregated from another site "adBlob": "...", // Encoded string containing additional job information "adId": "433403349", // Unique identifier for the advertisement "additionalLocationLink": { "label": "+3 Orte", // Label for additional locations "url": "..." // URL to view jobs in additional locations }, "advn": "625754421564269", // Advertiser identification number "company": "Ober Scharrer Gruppe GmbH'", // Name of the company offering the job "companyRating": 0, // Company rating on Indeed (if available) "companyReviewCount": 0, // Number of company reviews on Indeed "createDate": 1721054703000, // Timestamp of when the job was created "displayTitle": "MFA / Medizinische Fachangestellte (w/m/d) für den OP", // Job title as displayed "formattedLocation": "76532 Baden-Baden", // Formatted job location "formattedRelativeTime": "vor 30+ Tagen", // Relative time since job was posted "jobkey": "ea2f2dbc75f8a438", // Unique identifier for the job listing "jobTypes": [ // Types of employment offered "Vollzeit", "Teilzeit" ], "locationCount": 4, // Number of locations this job is available in "pubDate": 1721019600000, // Timestamp of when the job was published "salarySnippet": { // Salary information (if available) "currency": "EUR", "salaryTextFormatted": false }, "snippet": "...", // Brief excerpt or summary of the job description "title": "MFA / Medizinische Fachangestellte (w/m/d) für den OP" // Job title } ## Field Explanations - source_link: If the job is aggregated from another site, this field contains the original link. - adBlob: An encoded string that may contain additional job information. - adId: A unique identifier for the job advertisement. - additionalLocationLink: Information about other locations where this job is available. - advn: The advertiser's identification number. - company: The name of the company offering the job. - companyRating: The company's rating on Indeed, if available. - companyReviewCount: The number of reviews the company has on Indeed. - createDate: The timestamp when the job listing was created. - displayTitle: The job title as it's displayed on the listing. - formattedLocation: The job's location in a formatted string. - formattedRelativeTime: How long ago the job was posted, in a human-readable format. - jobkey: A unique identifier for the specific job listing. - jobTypes: An array of job types (e.g., full-time, part-time) offered for this position. - locationCount: The number of locations where this job is available. - pubDate: The timestamp when the job was published. - salarySnippet: Information about the salary, if provided. - snippet: A brief excerpt or summary of the job description. - title: The title of the job position. ## 📋 Enhanced Output Structure ### Company Details Output (New Section) json "companyDetails": { "aboutSectionViewModel": { "aboutCeo": { "name": "Sophie Bellon", "photoUrls": { "48px": "https://...photo48.jpg", "96px": "https://...photo96.jpg", "512px": "https://...photo512.jpg" } }, "aboutCompany": { "founded": 1966, "revenue": "RRv1_OVER_10B", "employeeRange": "ERv1_10000_PLUS", "industry": "Catering & Verpflegungsdienstleistungen", "description": "Die Sodexo Group...", "headquarters": "Issy-les-Moulineaux" } }, "jobsSectionViewModel": { "totalJobCount": 65, "jobCategories": [ { "displayName": "Reinigungsdienste", "jobCount": 47, "key": "sanitation" } ], "activeListings": [ { "title": "Reinigungskraft", "location": "Wien, W", "jobTypes": ["Teilzeit"], "postDate": "vor 30+ Tagen" } ] } } ### New Output Fields Explanation - CEO Details: - Name and multiple resolution photos - Executive leadership insights - Company Metadata: - Founding year and revenue bracket - Employee count range and industry classification - Physical headquarters location - Organizational Insights: - Job distribution by category - Geographic presence analysis - Active job postings catalog ## Benefits (Enhanced) - Competitive Analysis: Compare company hiring patterns and organizational structures - Talent Acquisition: Identify companies with high hiring activity in specific sectors - Market Research: Analyze industry distributions and company growth indicators - Leadership Networking: Access CEO and executive team information ## Support - For issues or feature requests, please use the Issues section of this actor. - If you need customization or have questions, feel free to contact the author: - Author's website: https://muhamed-didovic.github.io/ - Email: muhamed.didovic@gmail.com - My Apify Actors/Scrapers: https://apify.com/memo23 ## Additional Services - Request customization or whole dataset: muhamed.didovic@gmail.com - If you need anything else scraped, or this actor customized, email: muhamed.didovic@gmail.com - For API services of this scraper (no Apify fee, just usage fee for the API), contact: muhamed.didovic@gmail.com
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Indeed Comprehensive Crawler now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- memo23
- Pricing
- Paid
- Total Runs
- 20,275
- Active Users
- 161
Related Actors
Company Employees Scraper
by build_matrix
🔥 LinkedIn Jobs Scraper
by bebity
Linkedin Company Detail (No Cookies)
by apimaestro
Linkedin Profile Details Batch Scraper + EMAIL (No Cookies)
by apimaestro
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support