Reddit Posts Scraper

by api-empire

Extract structured Reddit post data at scale. Get titles, scores, authors, dates, and engagement stats for analysis, marketing, or development projects.

20 runs

3 users

Try This Actor

Opens on Apify.com

About Reddit Posts Scraper

Need to pull clean, structured data from Reddit without getting bogged down in manual scraping? This Reddit Posts Scraper is what I use. It's a straightforward actor that visits any subreddit you point it to and pulls out the core data you actually need: post titles, upvote scores, the author, publication dates, and key engagement stats like comment counts. It runs reliably at scale, so you can collect data from hundreds of posts or threads in one go without hitting rate limits or getting blocked. I find it's perfect for a few key jobs. If you're in marketing, it's great for tracking brand mentions or understanding what topics are trending in your niche. Analysts can use the structured JSON output to feed into dashboards or sentiment analysis tools. For developers, it's a solid, dependable source of data for building datasets, training models, or powering other automation workflows. You get the raw Reddit insights without having to parse messy HTML or maintain your own scraper. Just configure your target subreddit and let it run.

What does this actor do?

Reddit Posts Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

Cloud-based execution - no local setup required
Scalable infrastructure for large-scale operations
API access for integration with your applications
Built-in proxy rotation and anti-blocking measures
Scheduled runs and webhooks for automation

How to Use

Click "Try This Actor" to open it on Apify
Create a free Apify account if you don't have one
Configure the input parameters as needed
Run the actor and download your results

Documentation

Reddit Posts Scraper

Overview

This Apify actor scrapes public data from Reddit, including posts, comments, and metadata. It outputs structured data in JSON, CSV, or XLSX formats for use in analysis, research, or integration into other systems. It's designed for developers, researchers, and marketers who need to programmatically collect Reddit data at scale.

Key Features

Extracts comprehensive data: Captures post titles, bodies, URLs, timestamps, upvotes, authors, subreddit names, top comments, media links, and awards.
Flexible sourcing: Scrapes from specific subreddits, custom search queries, or direct post URLs.
Configurable filters: Supports sorting (hot, top, new, controversial), date ranges, and post limits.
Integration-ready: Outputs clean JSON for easy use with databases, NLP models, dashboards, or APIs. Compatible with automation platforms like n8n, Zapier, and Google Sheets.
No-code operation: Fully functional via the Apify console, while also providing a developer-friendly interface for custom workflows.

How to Use

Access the Actor: Log into your Apify account and find the "Reddit Posts Scraper" in the store.
Configure Input: In the actor's input field, specify your target data using one or more methods:
- Subreddit names (e.g., technology)
- Direct post URLs
- Reddit search queries
Set Options: Adjust parameters like sort order, post limit, and whether to include timestamps or comments.
Run: Start the actor. It will process the requests and collect the data.
Get Results: Download the dataset in your preferred format (JSON, CSV, XLSX) from the Apify dataset storage or access it via API.

Input/Output

Input Configuration:
The actor primarily uses the startUrls field. You can provide:
* Subreddit URLs: https://www.reddit.com/r/SUBREDDIT_NAME/
* Search URLs: https://www.reddit.com/search/?q=YOUR_QUERY
* Direct post URLs: https://www.reddit.com/r/.../comments/...

Additional options in the input schema control sorting, limits, and which data fields to extract.

Output Data:
The actor returns an array of items, where each item represents a Reddit post. A typical item includes the following fields in structured JSON:

{
  "title": "Post Title",
  "body": "Full text of the post",
  "url": "https://www.reddit.com/...",
  "timestamp": "2023-10-26T14:30:00.000Z",
  "score": 150,
  "upvotes": 150,
  "author": "username",
  "subreddit": "subredditName",
  "comments": [...],
  "media": [...],
  "awards": [...]
}

This dataset can be downloaded directly or accessed programmatically via the Apify API.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Reddit Posts Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer: api-empire
Pricing: Paid
Total Runs: 20
Active Users: 3

Related Actors

Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.

by invideoiq

Linkedin Profile Details Scraper + EMAIL (No Cookies Required)

by apimaestro

Twitter (X.com) Scraper Unlimited: No Limits

by apidojo

Content Checker

by jakubbalada

Browse All Actors

Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support

Reddit Posts Scraper

About Reddit Posts Scraper

What does this actor do?

Key Features

How to Use

Documentation

Reddit Posts Scraper

Overview

Key Features

How to Use

Input/Output

Categories

Common Use Cases

Market Research

Lead Generation

Price Monitoring

Content Aggregation

Ready to Get Started?

Actor Information

Related Actors

Need Professional Help?