Reddit Scraper

by macrocosmos

Scrape Reddit data, via URL, subreddit, keyword, username.

385,918 runs
41 users
Try This Actor

Opens on Apify.com

About Reddit Scraper

Scrape Reddit data, via URL, subreddit, keyword, username.

What does this actor do?

Reddit Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Reddit Scraper Fast and reliable Reddit scraper that extracts posts and comments from any public subreddit. ## 🚀 Key Features - Simple Setup - Start scraping immediately with minimal configuration - Fast & Efficient - Optimized for speed and reliability - Rich Data - Extracts titles, text, scores, media, and engagement metrics - Flexible Search - Browse subreddits, search by keyword, or scrape specific URLs - Media Extraction - Automatically captures images and videos from posts ## 📋 What Data Can You Extract? Each Reddit post/comment includes: | Field | Description | |-------|-------------| | url | Permanent Reddit URL | | id | Reddit unique identifier | | title | Post title (posts only) | | body | Text content | | username | Author username | | communityName | Subreddit name (e.g., r/python) | | dataType | Content type: "post" or "comment" | | score | Upvotes minus downvotes | | upvote_ratio | Ratio of upvotes (0-1) | | num_comments | Number of comments (posts only) | | createdAt | ISO 8601 timestamp | | isNsfw | NSFW flag | | media | Array of image/video URLs (if present) | ## 🎯 Use Cases - Market Research - Monitor discussions about products, brands, or topics - Sentiment Analysis - Analyze community opinions and trends - Content Aggregation - Collect posts for newsletters or content curation - Academic Research - Gather data for social media studies - Trend Detection - Track emerging topics and viral content ## 💡 Input Configuration ### Subreddit Scraping Scrape posts from specific subreddits: json { "subreddits": ["python", "MachineLearning", "datascience"], "limit": 25, "sort": "new" } Parameters: - subreddits (array, required) - List of subreddit names (without r/ prefix) - limit (number, optional) - Posts per subreddit (1-100, default: 10) - sort (string, optional) - Sort order: "new", "hot", "top" (default: "new") - keyword (string, optional) - Filter posts containing this keyword ### Single URL Scraping Scrape a specific post or comment by URL: json { "url": "https://www.reddit.com/r/python/comments/abc123/title/" } Parameters: - url (string, required) - Full Reddit post or comment URL ## 📊 Output Example json [ { "url": "https://www.reddit.com/r/bittensor_/comments/1or4vcv/james_altucher_on_bittensor/", "id": "t3_1or4vcv", "title": "James Altucher on Bittensor: The Only Tech That Matters Since Bitcoin", "body": "# Who Is James Altucher\n\nJames Altucher is not a typical crypto influencer...", "username": "Internal-Patience533", "communityName": "r/bittensor_", "dataType": "post", "parentId": null, "score": 29, "upvote_ratio": 1.0, "num_comments": 3, "createdAt": "2025-11-07T20:10:31Z", "isNsfw": false, "media": ["https://i.redd.it/2iyvqtsu6wzf1.png"] } ] ## 🔧 Advanced Usage ### Keyword Filtering Search for posts containing specific keywords: json { "subreddits": ["technology"], "keyword": "artificial intelligence", "limit": 50, "sort": "top" } ### Multiple Subreddits Scrape from multiple communities in one run: json { "subreddits": ["python", "javascript", "golang", "rust"], "limit": 20, "sort": "hot" } ## 📈 Performance - Reliability: Automatic retry on failures - Proxy Support: Built-in residential proxy rotation ## ❓ Common Questions Q: Can I scrape private subreddits? A: No, only public subreddits and posts are accessible. Q: What about rate limits? A: The actor uses residential proxies to avoid rate limiting issues. Q: Can I scrape comments? A: Yes! The actor extracts both posts and can scrape individual comments via direct URL. Q: How fresh is the data? A: Data is fetched in real-time directly from Reddit. --- Built with ❤️ by Macrocosmos

Categories

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Reddit Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
macrocosmos
Pricing
Paid
Total Runs
385,918
Active Users
41
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support