Github Scraper | $2 / 1k | All In One

Name: Github Scraper | $2 / 1k | All In One
Author: fatihtahta

by fatihtahta

Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users,...

32 runs

3 users

Try This Actor

Opens on Apify.com

About Github Scraper | $2 / 1k | All In One

Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users, owners and more. Great for market intel, dev products, lead lists, talent scouting and big, clean datasets.

What does this actor do?

Github Scraper | $2 / 1k | All In One is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

Cloud-based execution - no local setup required
Scalable infrastructure for large-scale operations
API access for integration with your applications
Built-in proxy rotation and anti-blocking measures
Scheduled runs and webhooks for automation

How to Use

Click "Try This Actor" to open it on Apify
Create a free Apify account if you don't have one
Configure the input parameters as needed
Run the actor and download your results

Documentation

Github Scraper | $2 / 1k | All In One ## Overview Github.com hosts millions of repositories, issues, pull requests, discussions, users, and packages that reflect real development activity and technology adoption. Github Scraper | $2 / 1k | All In One captures this public data at scale so you can monitor projects, communities, and marketplaces without manual browsing. Configure searches or paste direct URLs, and the actor automates collection with consistent, reliable results to save time on every run. ## Why Use This Actor - Market and product research: Track trending repositories, forks, stars, and topics to understand technology momentum and competitor movement. - Lead and talent discovery: Surface active maintainers, organizations, and contributors for outreach or partnership research. - Directory and dataset building: Export structured lists of repos, issues, pull requests, discussions, users, commits, and marketplace listings for enrichment or analytics. - Operational efficiency: Schedule repeat runs, keep data fresh, and avoid repetitive manual checks on Github. ## Input Parameters | Parameter | Type | Description | Default | | --- | --- | --- | --- | | `startUrls` | array of strings | Fully formed Github URLs for search results, repositories, issues, pull requests, discussions, users/organizations, or marketplace listings. Each URL is crawled exactly as provided. | Prefilled example provided | | `queries` | array of strings | Keyword queries turned into Github search URLs using your selected result type and filters. | — | | `searchType` | string | Github search vertical for query-based runs: repositories, issues, pull requests, discussions, users, commits, registry packages, wikis, topics, or marketplace listings. | `repositories` | | `language` | string | Limit results to a specific programming language. | — | | `owner` | string | Restrict matches to a single user or organization namespace. | — | | `followers` | string | Filter users or organizations by follower counts using numbers, ranges, or comparisons. | — | | `forks` | string | Filter repositories by fork counts using numbers, ranges, or comparisons. | — | | `stars` | string | Target repositories by star counts using numbers, ranges, or comparisons. | — | | `topic` | string | Require repositories to include a specific topic tag. | — | | `license` | string | Filter repositories by license keyword or SPDX identifier. | — | | `created` | string | Filter repositories by creation date using exact dates, ranges, or comparisons. | — | | `pushed` | string | Filter repositories by most recent commit date using exact dates, ranges, or comparisons. | — | | `size` | string | Filter repositories by repository size in kilobytes using numbers, ranges, or comparisons. | — | | `limit` | integer | Maximum number of listings to save across all inputs. | `50000` | | `proxyConfiguration` | object | Configure Apify proxy settings to distribute requests and keep networking stable. Residential proxies are preselected. | `{"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}` | ## Example Input `json { "queries": ["apify scraping"], "searchType": "repositories", "stars": ">100", "language": "JavaScript", "limit": 200 }` ## Example Output Each dataset item represents one Github result with key descriptive fields. json { "archived": false, "brand": "vlang", "color": "#4f87c4", "description": "Simple, fast, safe, compiled language for developing maintainable software. Compiles itself in <1s with zero library dependencies. Suppor…", "followers": 37115, "good_first_issue_issues_count": 1, "has_funding_file": true, "has_issues": true, "help_wanted_issues_count": 0, "id": "169677297", "language": "V", "mirror": false, "owner_id": 46413578, "owner_login": "vlang", "owned_by_organization": true, "public": true, "repo_id": 169677297, "repo_name": "v", "sponsorable": false, "starred_by_current_user": false, "title": "vlang/ v", "topics": [ "language", "programming-language", "compiler", "v" ], "type": "Public", "updated_at": "2025-12-10T14:41:12.039Z", "url": "https://github.com/vlang/v" } - `archived` — Whether the repository is archived. - `brand`, `color` — Project branding details when available. - `description` — Repository summary from Github. - `followers`, `stars`, `forks` (when present) — Popularity and engagement indicators. - `owner_` and `repo_` fields — Unique identifiers and ownership metadata. - `topics` — Tagged topics for the repository. - `updated_at` — ISO timestamp of the latest observed update. - `url` — Direct link to the Github page captured. ## Notes & Limitations - Use this actor responsibly and only for lawful purposes. Review and respect Github’s terms of service and any applicable policies before collecting or using data. - Public data may include personal information; ensure you have a legal basis to process it in your jurisdiction. - Start with moderate limits when testing new queries or URLs to keep runs efficient. ## Support Questions or custom needs? Open an issue on the Issues tab of the actor page in Apify Console and it will be resolved around the clock. Happy Scraping, - Fatih

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Github Scraper | $2 / 1k | All In One now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer: fatihtahta
Pricing: Paid
Total Runs: 32
Active Users: 3

Related Actors

Web Scraper

by apify

Cheerio Scraper

by apify

Website Content Crawler

by apify

Legacy PhantomJS Crawler

by apify

Browse All Actors

Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support

Github Scraper | $2 / 1k | All In One

About Github Scraper | $2 / 1k | All In One

What does this actor do?

Key Features

How to Use

Documentation

Categories

Common Use Cases

Market Research

Lead Generation

Price Monitoring

Content Aggregation

Ready to Get Started?

Actor Information

Related Actors

Need Professional Help?