Github Scraper | $2 / 1k | All In One
by fatihtahta
Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users,...
Opens on Apify.com
About Github Scraper | $2 / 1k | All In One
Scrape GitHub at real scale with no cap. Get the richest data on repos, issues, PRs, users and orgs including stars, forks, topics, tech stack, users, owners and more. Great for market intel, dev products, lead lists, talent scouting and big, clean datasets.
What does this actor do?
Github Scraper | $2 / 1k | All In One is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Github Scraper | $2 / 1k | All In One ## Overview Github.com hosts millions of repositories, issues, pull requests, discussions, users, and packages that reflect real development activity and technology adoption. Github Scraper | $2 / 1k | All In One captures this public data at scale so you can monitor projects, communities, and marketplaces without manual browsing. Configure searches or paste direct URLs, and the actor automates collection with consistent, reliable results to save time on every run. ## Why Use This Actor - Market and product research: Track trending repositories, forks, stars, and topics to understand technology momentum and competitor movement. - Lead and talent discovery: Surface active maintainers, organizations, and contributors for outreach or partnership research. - Directory and dataset building: Export structured lists of repos, issues, pull requests, discussions, users, commits, and marketplace listings for enrichment or analytics. - Operational efficiency: Schedule repeat runs, keep data fresh, and avoid repetitive manual checks on Github. ## Input Parameters | Parameter | Type | Description | Default | | --- | --- | --- | --- | | startUrls | array of strings | Fully formed Github URLs for search results, repositories, issues, pull requests, discussions, users/organizations, or marketplace listings. Each URL is crawled exactly as provided. | Prefilled example provided | | queries | array of strings | Keyword queries turned into Github search URLs using your selected result type and filters. | — | | searchType | string | Github search vertical for query-based runs: repositories, issues, pull requests, discussions, users, commits, registry packages, wikis, topics, or marketplace listings. | repositories | | language | string | Limit results to a specific programming language. | — | | owner | string | Restrict matches to a single user or organization namespace. | — | | followers | string | Filter users or organizations by follower counts using numbers, ranges, or comparisons. | — | | forks | string | Filter repositories by fork counts using numbers, ranges, or comparisons. | — | | stars | string | Target repositories by star counts using numbers, ranges, or comparisons. | — | | topic | string | Require repositories to include a specific topic tag. | — | | license | string | Filter repositories by license keyword or SPDX identifier. | — | | created | string | Filter repositories by creation date using exact dates, ranges, or comparisons. | — | | pushed | string | Filter repositories by most recent commit date using exact dates, ranges, or comparisons. | — | | size | string | Filter repositories by repository size in kilobytes using numbers, ranges, or comparisons. | — | | limit | integer | Maximum number of listings to save across all inputs. | 50000 | | proxyConfiguration | object | Configure Apify proxy settings to distribute requests and keep networking stable. Residential proxies are preselected. | {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]} | ## Example Input json { "queries": ["apify scraping"], "searchType": "repositories", "stars": ">100", "language": "JavaScript", "limit": 200 } ## Example Output Each dataset item represents one Github result with key descriptive fields. json { "archived": false, "brand": "vlang", "color": "#4f87c4", "description": "Simple, fast, safe, compiled language for developing maintainable software. Compiles itself in <1s with zero library dependencies. Suppor…", "followers": 37115, "good_first_issue_issues_count": 1, "has_funding_file": true, "has_issues": true, "help_wanted_issues_count": 0, "id": "169677297", "language": "V", "mirror": false, "owner_id": 46413578, "owner_login": "vlang", "owned_by_organization": true, "public": true, "repo_id": 169677297, "repo_name": "v", "sponsorable": false, "starred_by_current_user": false, "title": "vlang/ v", "topics": [ "language", "programming-language", "compiler", "v" ], "type": "Public", "updated_at": "2025-12-10T14:41:12.039Z", "url": "https://github.com/vlang/v" } - archived — Whether the repository is archived. - brand, color — Project branding details when available. - description — Repository summary from Github. - followers, stars, forks (when present) — Popularity and engagement indicators. - owner_* and repo_* fields — Unique identifiers and ownership metadata. - topics — Tagged topics for the repository. - updated_at — ISO timestamp of the latest observed update. - url — Direct link to the Github page captured. ## Notes & Limitations - Use this actor responsibly and only for lawful purposes. Review and respect Github’s terms of service and any applicable policies before collecting or using data. - Public data may include personal information; ensure you have a legal basis to process it in your jurisdiction. - Start with moderate limits when testing new queries or URLs to keep runs efficient. ## Support Questions or custom needs? Open an issue on the Issues tab of the actor page in Apify Console and it will be resolved around the clock. Happy Scraping, - Fatih
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Github Scraper | $2 / 1k | All In One now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- fatihtahta
- Pricing
- Paid
- Total Runs
- 32
- Active Users
- 3
Related Actors
Web Scraper
by apify
Cheerio Scraper
by apify
Website Content Crawler
by apify
Legacy PhantomJS Crawler
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support