Wordpress Post Scraper - NEW
by eloquent_mountain
This actor scrapes WordPress blog posts of one or more websites, cleans the HTML content, and pushes flattened JSON data (collects all data it can fin...
Opens on Apify.com
About Wordpress Post Scraper - NEW
This actor scrapes WordPress blog posts of one or more websites, cleans the HTML content, and pushes flattened JSON data (collects all data it can find in the post). It uses Selenium to handle pages requiring JavaScript rendering.
What does this actor do?
Wordpress Post Scraper - NEW is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
WordPress Scraper Actor The WordPress Scraper Actor allows you to easily scrape content from (multiple) WordPress websites, including blogs, articles, author details, categories, comments and media. It uses the WordPress REST API, Requests library and if necessary Selenium for accurate data extraction. Only works on WP sites that accept REST API calls ## Features - Extract blog posts, articles, author information, products, categories, comments and images from WordPress websites. - Uses REST API and Selenium for complete data extraction. - Outputs cleaned HTML content as plain text in JSON format. - Supports pagination for comprehensive scraping. ## How It Works The actor takes a single or multiple website URLs as input, interacts with the REST API to gather data, and uses Selenium to handle JavaScript-rendered pages. The scraped data is cleaned and formatted as structured JSON. ### Input Parameters - start_urls (required): List of website URLs to scrape (company1.com,company2.com,etc). - max_results (optional): Maximum number of posts to retrieve per site. Set to 'all' for all posts. - scrape_mode (required, default is 'posts'): Choose the data you wish to scrape, you can choose from 'posts', 'media', 'categories','comments' ### Output The actor outputs (cleaned) JSON data for each post, including: - Title - Cleaned Content - Metadata (author, publication date, tags, categories) - Media Links - All post data: All the raw post data in the "All fields" tab ## Getting Started 1. Create an Actor Task: On Apify, create a new actor task and provide the list of URLs to scrape. 2. Input Configuration: Set start_urls and optionally max_results. 3. Run the Actor: Execute the actor to start scraping. 4. Review Results: Download the results as a JSON file. ## Use Cases - Content Aggregation: Collect articles or blog posts from multiple WordPress sites. - Market Research: Scrape product descriptions and reviews from WordPress-powered e-commerce sites. - Data Analysis: Gather articles for analysis or summarization. ## Important Notes - Respecting Site Policies: Always ensure you have permission to scrape data from a website, and respect the site's robots.txt policies. ## Actor Input Example json { "start_urls": [ { "url": "https://example.com" }, { "url": "https://another-example.com" } ], "max_results": "all" } ## Actor Output Example (CLEANED) json { "title": "Sample Blog Post", "cleaned_content": "This is the content of the blog post, without HTML tags.", "date_published": "2023-10-01", }
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Wordpress Post Scraper - NEW now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- eloquent_mountain
- Pricing
- Paid
- Total Runs
- 837
- Active Users
- 162
Related Actors
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Linkedin Profile Details Scraper + EMAIL (No Cookies Required)
by apimaestro
Twitter (X.com) Scraper Unlimited: No Limits
by apidojo
Content Checker
by jakubbalada
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support