Reddit Media Search Scraper
by easyapi
Scrape images and videos from Reddit search results with full metadata. Perfect for building AI datasets, tracking trends, or curating content.
Opens on Apify.com
About Reddit Media Search Scraper
Need to pull images and videos from Reddit for a project? This scraper does exactly that. It digs into Reddit's search results to find posts with media, grabbing not just the files but all the useful context around them—think titles, upvotes, comments, and subreddit info. You can choose to download images or videos, and for videos, you often have a few quality options to pick from. I've used it to gather training data for machine learning models, where having the associated text is a huge bonus. It's also become my go-to for tracking visual trends, researching memes at their source, or just curating interesting content from specific communities. It saves you the headache of manual collection and lets you focus on actually using the media, whether that's for analysis, a dataset, or creative inspiration. If your work involves Reddit's massive, ever-growing library of pictures and clips, this tool turns a complex task into a simple, automated process.
What does this actor do?
Reddit Media Search Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Reddit Media Search Scraper
Scrapes Reddit search results specifically for posts containing images or videos. It extracts the media files along with comprehensive post metadata, handling pagination automatically.
Key Features
- Media-Focused Search: Searches across all subreddits for posts containing images or videos.
- Multi-Quality Video Support: Downloads videos in multiple resolutions (220p, 240p, 360p, 480p, 720p).
- Rich Metadata: Extracts post details (title, score, timestamps), author info, subreddit data, and engagement metrics.
- Structured Output: Returns consistent, well-organized data for easy processing.
How to Use
Configure the actor using an input object. The main required parameter is the search query.
Input Configuration
Here’s the basic input structure. Only the query field is mandatory.
{
"query": "ai",
"sort": "relevance",
"time": "all",
"safeSearch": "0",
"maxItems": 45
}
query(string, required): Your search term (e.g., "ai", "memes").sort(string): Sort results byrelevance,hot,top,new, orcomments. Default isrelevance.time(string): Filter top posts by time:hour,day,week,month,year,all. Default isall.safeSearch(string):"0"for standard search,"1"to filter NSFW content. Default is"0".maxItems(integer): The maximum number of posts to scrape. Default is 100.
Input/Output
Output Data Structure
Results are saved to the dataset in your Apify storage. Each item is a post object containing:
- Post Info:
post_id,title,created_timestamp,score,comment_count,url, flags fornsfw/spoiler. - Context:
subredditobject (id, name),authorobject (id),positionin search results. - Media Data:
media_type("image" or "video") and amediaobject.- For images:
media.src(URL) andmedia.alt. - For videos:
media.poster(thumbnail URL),media.duration, andmedia.sources- an array of video files at different qualities (220p to 720p).
- For images:
Output Example
[
{
"post_id": "t3_1epeshq",
"title": "These are all AI",
"created_timestamp": "2024-08-11T07:12:08.989Z",
"score": 23117,
"comment_count": 3483,
"nsfw": false,
"spoiler": false,
"url": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/these_are_all_ai/",
"subreddit": { "id": "t5_7hqomg", "name": "ChatGPT", "nsfw": false },
"author": { "id": "t2_rpv4m" },
"position": 0,
"media_type": "image",
"media": {
"src": "https://preview.redd.it/these-are-all-ai-v0-o95bc7fqizhd1.png?...",
"alt": "r/ChatGPT - These are all AI"
}
}
]
Data can be downloaded as JSON, JSONL, CSV, Excel, HTML, or XML from the Storage tab.
Related Actors
- Reddit Posts Search Scraper - Scrapes general Reddit post search results (not media-specific).
- Reddit Comments Scraper - Extracts comment threads and replies.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Reddit Media Search Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- easyapi
- Pricing
- Paid
- Total Runs
- 417
- Active Users
- 22
Related Actors
🏯 Tweet Scraper V2 - X / Twitter Scraper
by apidojo
Instagram Scraper
by apify
TikTok Scraper
by clockworks
Instagram Profile Scraper
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support