Reddit Posts Scraper
by api-empire
Extract structured Reddit post data at scale. Get titles, scores, authors, dates, and engagement stats for analysis, marketing, or development projects.
Opens on Apify.com
About Reddit Posts Scraper
Need to pull clean, structured data from Reddit without getting bogged down in manual scraping? This Reddit Posts Scraper is what I use. It's a straightforward actor that visits any subreddit you point it to and pulls out the core data you actually need: post titles, upvote scores, the author, publication dates, and key engagement stats like comment counts. It runs reliably at scale, so you can collect data from hundreds of posts or threads in one go without hitting rate limits or getting blocked. I find it's perfect for a few key jobs. If you're in marketing, it's great for tracking brand mentions or understanding what topics are trending in your niche. Analysts can use the structured JSON output to feed into dashboards or sentiment analysis tools. For developers, it's a solid, dependable source of data for building datasets, training models, or powering other automation workflows. You get the raw Reddit insights without having to parse messy HTML or maintain your own scraper. Just configure your target subreddit and let it run.
What does this actor do?
Reddit Posts Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Reddit Posts Scraper
Overview
This Apify actor scrapes public data from Reddit, including posts, comments, and metadata. It outputs structured data in JSON, CSV, or XLSX formats for use in analysis, research, or integration into other systems. It's designed for developers, researchers, and marketers who need to programmatically collect Reddit data at scale.
Key Features
- Extracts comprehensive data: Captures post titles, bodies, URLs, timestamps, upvotes, authors, subreddit names, top comments, media links, and awards.
- Flexible sourcing: Scrapes from specific subreddits, custom search queries, or direct post URLs.
- Configurable filters: Supports sorting (hot, top, new, controversial), date ranges, and post limits.
- Integration-ready: Outputs clean JSON for easy use with databases, NLP models, dashboards, or APIs. Compatible with automation platforms like n8n, Zapier, and Google Sheets.
- No-code operation: Fully functional via the Apify console, while also providing a developer-friendly interface for custom workflows.
How to Use
- Access the Actor: Log into your Apify account and find the "Reddit Posts Scraper" in the store.
- Configure Input: In the actor's input field, specify your target data using one or more methods:
- Subreddit names (e.g.,
technology) - Direct post URLs
- Reddit search queries
- Subreddit names (e.g.,
- Set Options: Adjust parameters like sort order, post limit, and whether to include timestamps or comments.
- Run: Start the actor. It will process the requests and collect the data.
- Get Results: Download the dataset in your preferred format (JSON, CSV, XLSX) from the Apify dataset storage or access it via API.
Input/Output
Input Configuration:
The actor primarily uses the startUrls field. You can provide:
* Subreddit URLs: https://www.reddit.com/r/SUBREDDIT_NAME/
* Search URLs: https://www.reddit.com/search/?q=YOUR_QUERY
* Direct post URLs: https://www.reddit.com/r/.../comments/...
Additional options in the input schema control sorting, limits, and which data fields to extract.
Output Data:
The actor returns an array of items, where each item represents a Reddit post. A typical item includes the following fields in structured JSON:
{
"title": "Post Title",
"body": "Full text of the post",
"url": "https://www.reddit.com/...",
"timestamp": "2023-10-26T14:30:00.000Z",
"score": 150,
"upvotes": 150,
"author": "username",
"subreddit": "subredditName",
"comments": [...],
"media": [...],
"awards": [...]
}
This dataset can be downloaded directly or accessed programmatically via the Apify API.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Reddit Posts Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- api-empire
- Pricing
- Paid
- Total Runs
- 20
- Active Users
- 3
Related Actors
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Linkedin Profile Details Scraper + EMAIL (No Cookies Required)
by apimaestro
Twitter (X.com) Scraper Unlimited: No Limits
by apidojo
Content Checker
by jakubbalada
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support