Y Combinator Scraper
by shahidirfan
Discover the Y Combinator Scraper, an efficient actor for scraping the YC companies directory. Easily extract detailed profiles, founder info, and bat...
Opens on Apify.com
About Y Combinator Scraper
Discover the Y Combinator Scraper, an efficient actor for scraping the YC companies directory. Easily extract detailed profiles, founder info, and batch data. Ideal for market research, lead generation, or investment analysis. For best results and to avoid blocks, residential proxies are best.
What does this actor do?
Y Combinator Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Y Combinator Companies Scraper
## Overview Discover and extract comprehensive data from the Y Combinator companies directory with this powerful scraper. Whether you're conducting market research, analyzing startup trends, or scouting talent, this tool provides structured access to YC-funded companies, their founders, and open job opportunities. Perfect for investors, recruiters, and analysts seeking insights into the startup ecosystem. Keywords: Y Combinator scraper, YC companies scraper, startup directory scraper, scrape Y Combinator companies, YC batch scraper, founder data extraction, job listings scraper. ## Features - Complete Company Profiles: Extract detailed information about Y Combinator-backed startups, including company names, descriptions, locations, funding batches, and status. - Founder Insights: Optionally scrape founder details with names and social media links for deeper analysis. - Job Market Intelligence: Access open positions at YC companies, including job titles, descriptions, locations, and salary ranges. - Batch-Specific Scraping: Target specific Y Combinator batches (e.g., Summer 2025, Winter 2024) or scrape the entire directory. - Flexible Data Collection: Customize what data to collect based on your research needs. - Reliable Extraction: Built-in mechanisms to handle dynamic content and ensure comprehensive data retrieval. - Structured JSON Output: Clean, consistent data format ready for analysis or integration. ## How It Works This scraper navigates the Y Combinator companies directory, systematically collecting data from company profiles. It handles pagination automatically and can visit individual company pages for additional details like founders and jobs. The process is optimized for efficiency while respecting website guidelines. ## Input Parameters Configure the scraper using the following parameters:
| Parameter | Type | Description | Default |
|---|---|---|---|
url | string | Specific Y Combinator directory URL to start scraping (e.g., https://www.ycombinator.com/companies?batch=Summer%202025). Overrides default if provided. | - |
scrape_all_companies | boolean | Scrape companies from the entire directory if true, or use provided URL/default page if false. | false |
scrape_founders | boolean | Extract founder information from individual company pages. | true |
scrape_open_jobs | boolean | Extract open job listings from company pages. | true |
results_wanted | integer | Maximum number of companies to collect. Use high number for all available. | 100 |
max_pages | integer | Maximum directory pages to visit as a safety limit. | 20 |
proxyConfiguration | object | Proxy settings for reliable scraping. Use Apify Proxy for best results. | {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]} |
company_image: Company logo URL - company_id: Unique company identifier - company_name: Official company name - url: Y Combinator profile URL - short_description: Brief company overview - long_description: Detailed company description - batch: YC batch (e.g., "S25", "W24") - status: Company status (Active, Acquired, etc.) - tags: Industry/technology tags array - company_location: Geographic location - year_founded: Founding year - team_size: Number of team members - primary_partner: Associated YC partner - website: Official website URL - company_linkedin: LinkedIn profile - company_x: X (Twitter) profile ### Founders Data (when enabled) - founders: Array of founder objects with id, name, linkedin, x ### Jobs Data (when enabled) - open_jobs: Array of job objects with id, title, description_url, description, location, salary, years_experience ### Sample Output Record json { "company_image": "https://example.com/logo.png", "company_id": "company-123", "company_name": "Example Startup", "url": "https://www.ycombinator.com/companies/example-startup", "short_description": "Building the future of AI", "long_description": "Example Startup is revolutionizing artificial intelligence with cutting-edge machine learning solutions.", "batch": "S25", "status": "Active", "tags": ["AI", "Machine Learning", "SaaS"], "company_location": "San Francisco, CA", "year_founded": "2023", "team_size": "15", "primary_partner": "John Doe", "website": "https://examplestartup.com", "company_linkedin": "https://linkedin.com/company/example-startup", "company_x": "https://x.com/example_startup", "founders": [ { "id": "founder-1", "name": "Jane Smith", "linkedin": "https://linkedin.com/in/jane-smith", "x": "https://x.com/janesmith" } ], "open_jobs": [ { "id": "job-456", "title": "Senior Software Engineer", "description_url": "https://www.ycombinator.com/companies/example-startup/jobs/senior-software-engineer", "description": "We are looking for a talented software engineer to join our growing team...", "location": "Remote", "salary": "$150k - $200k", "years_experience": "5+" } ] } ## Usage ### Running on Apify Platform 1. Navigate to the Y Combinator Companies Scraper on Apify. 2. Click "Run" to start the actor. 3. Configure input parameters in the form. 4. Monitor progress and view results in the dataset. ### API Usage Use the Apify API to run the scraper programmatically: bash curl -X POST "https://api.apify.com/v2/acts/shahidirfan100~ycombinator-companies-scraper/runs?token=YOUR_API_TOKEN" \ -H "Content-Type: application/json" \ -d '{ "scrape_all_companies": true, "scrape_founders": true, "results_wanted": 500 }' ### Configuration Examples Scrape Specific Batch with Founders
json { "url": "https://www.ycombinator.com/companies?batch=Summer%202025", "scrape_founders": true, "scrape_open_jobs": false, "results_wanted": 50 } Comprehensive Directory Scan
json { "scrape_all_companies": true, "scrape_founders": true, "scrape_open_jobs": true, "results_wanted": 1000, "max_pages": 50 } Quick Company Overview
json { "scrape_founders": false, "scrape_open_jobs": false, "results_wanted": 100 } Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Y Combinator Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- shahidirfan
- Pricing
- Paid
- Total Runs
- 60
- Active Users
- 10
Related Actors
🏯 Tweet Scraper V2 - X / Twitter Scraper
by apidojo
Google Search Results Scraper
by apify
Instagram Profile Scraper
by apify
Tweet Scraper|$0.25/1K Tweets | Pay-Per Result | No Rate Limits
by kaitoeasyapi
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support