Reddit Media Search Scraper

Reddit Media Search Scraper

by easyapi

Scrape images and videos from Reddit search results with full metadata. Perfect for building AI datasets, tracking trends, or curating content.

417 runs
22 users
Try This Actor

Opens on Apify.com

About Reddit Media Search Scraper

Need to pull images and videos from Reddit for a project? This scraper does exactly that. It digs into Reddit's search results to find posts with media, grabbing not just the files but all the useful context around them—think titles, upvotes, comments, and subreddit info. You can choose to download images or videos, and for videos, you often have a few quality options to pick from. I've used it to gather training data for machine learning models, where having the associated text is a huge bonus. It's also become my go-to for tracking visual trends, researching memes at their source, or just curating interesting content from specific communities. It saves you the headache of manual collection and lets you focus on actually using the media, whether that's for analysis, a dataset, or creative inspiration. If your work involves Reddit's massive, ever-growing library of pictures and clips, this tool turns a complex task into a simple, automated process.

What does this actor do?

Reddit Media Search Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Reddit Media Search Scraper

Scrapes Reddit search results specifically for posts containing images or videos. It extracts the media files along with comprehensive post metadata, handling pagination automatically.

Key Features

  • Media-Focused Search: Searches across all subreddits for posts containing images or videos.
  • Multi-Quality Video Support: Downloads videos in multiple resolutions (220p, 240p, 360p, 480p, 720p).
  • Rich Metadata: Extracts post details (title, score, timestamps), author info, subreddit data, and engagement metrics.
  • Structured Output: Returns consistent, well-organized data for easy processing.

How to Use

Configure the actor using an input object. The main required parameter is the search query.

Input Configuration

Here’s the basic input structure. Only the query field is mandatory.

{
  "query": "ai",
  "sort": "relevance",
  "time": "all",
  "safeSearch": "0",
  "maxItems": 45
}
  • query (string, required): Your search term (e.g., "ai", "memes").
  • sort (string): Sort results by relevance, hot, top, new, or comments. Default is relevance.
  • time (string): Filter top posts by time: hour, day, week, month, year, all. Default is all.
  • safeSearch (string): "0" for standard search, "1" to filter NSFW content. Default is "0".
  • maxItems (integer): The maximum number of posts to scrape. Default is 100.

Input/Output

Output Data Structure

Results are saved to the dataset in your Apify storage. Each item is a post object containing:

  • Post Info: post_id, title, created_timestamp, score, comment_count, url, flags for nsfw/spoiler.
  • Context: subreddit object (id, name), author object (id), position in search results.
  • Media Data: media_type ("image" or "video") and a media object.
    • For images: media.src (URL) and media.alt.
    • For videos: media.poster (thumbnail URL), media.duration, and media.sources - an array of video files at different qualities (220p to 720p).

Output Example

[
  {
    "post_id": "t3_1epeshq",
    "title": "These are all AI",
    "created_timestamp": "2024-08-11T07:12:08.989Z",
    "score": 23117,
    "comment_count": 3483,
    "nsfw": false,
    "spoiler": false,
    "url": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/these_are_all_ai/",
    "subreddit": { "id": "t5_7hqomg", "name": "ChatGPT", "nsfw": false },
    "author": { "id": "t2_rpv4m" },
    "position": 0,
    "media_type": "image",
    "media": {
      "src": "https://preview.redd.it/these-are-all-ai-v0-o95bc7fqizhd1.png?...",
      "alt": "r/ChatGPT - These are all AI"
    }
  }
]

Data can be downloaded as JSON, JSONL, CSV, Excel, HTML, or XML from the Storage tab.

Related Actors

Categories

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Reddit Media Search Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
easyapi
Pricing
Paid
Total Runs
417
Active Users
22
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support