WordPress Articles Scraper
by extremescrapes
Effortlessly scrape posts, authors, tags & images from any WordPress site via its REST API. Perfect for content aggregation, research, and migrations.
Opens on Apify.com
About WordPress Articles Scraper
Need to pull content from a WordPress site without a headache? This scraper is your go-to. It taps directly into the WordPress REST API, which is the cleanest and most reliable way to get structured data out of any modern WP site. You just give it a starting URL, and it systematically works through all the posts, handling pagination automatically so you don't have to. Beyond just the article text, it grabs the full metadata suite: author info, categories, tags, and the URLs for featured images. This makes it perfect for building content aggregators, conducting market research by analyzing competitors' publishing habits, or migrating content to a new platform without losing all that valuable SEO and organizational data. I've used it to feed article data into a custom dashboard, and the JSON output is consistently clean and ready for analysis or import. It saves you from writing and maintaining your own API integration scripts, letting you focus on actually using the data.
What does this actor do?
WordPress Articles Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
WordPress Articles Scraper
Overview
This Apify actor extracts posts and metadata from WordPress sites using the WordPress REST API. It's built for developers, researchers, and content aggregators who need structured article data. The actor handles pagination automatically and can filter results by keyword.
Key Features
- Fetches posts from any public WordPress site.
- Automatically manages pagination to retrieve all available posts.
- Filters posts based on a search term.
- Extracts full post metadata: author, categories, tags, and featured image URLs.
- Outputs clean, structured JSON.
How to Use
Provide a WordPress site URL to start scraping. You can run the actor directly on Apify or via its API.
Using the Apify Interface:
1. Go to the actor's page on Apify.
2. Enter the required startUrls parameter.
3. Click Run.
Using the Apify API:
curl -X POST -H "Content-Type: application/json" \
-d '{"startUrls": [{"url": "https://example.com", "method": "GET"}]}' \
"https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_API_TOKEN"
Input/Output
Input Parameters:
| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| startUrls | Array | Yes | List of WordPress site URLs to scrape (e.g., [{"url": "https://example.com"}]). |
Output Format:
The actor returns a JSON dataset. Each item represents a post with the following structure:
{
"id": 123,
"date": "2025-03-28T12:00:00",
"modified": "2025-03-28T14:00:00",
"slug": "example-post",
"link": "https://example.com/example-post",
"title": "Example Post Title",
"content": "<p>Post content...</p>",
"excerpt": "Short summary...",
"author": "Author Name",
"categories": ["Category1", "Category2"],
"tags": ["tag1", "tag2"],
"featured_image": "https://example.com/image.jpg"
}
Common Use Cases:
* Aggregating content from multiple blogs.
* SEO research and content analysis.
* Creating datasets for NLP or data science projects.
* Backing up blog content.
For issues or contributions, refer to the actor's page on Apify.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try WordPress Articles Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- extremescrapes
- Pricing
- Paid
- Total Runs
- 708
- Active Users
- 108
Related Actors
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Linkedin Profile Details Scraper + EMAIL (No Cookies Required)
by apimaestro
Twitter (X.com) Scraper Unlimited: No Limits
by apidojo
Content Checker
by jakubbalada
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support