WordPress Articles Scraper

WordPress Articles Scraper

by extremescrapes

Effortlessly scrape posts, authors, tags & images from any WordPress site via its REST API. Perfect for content aggregation, research, and migrations.

708 runs
108 users
Try This Actor

Opens on Apify.com

About WordPress Articles Scraper

Need to pull content from a WordPress site without a headache? This scraper is your go-to. It taps directly into the WordPress REST API, which is the cleanest and most reliable way to get structured data out of any modern WP site. You just give it a starting URL, and it systematically works through all the posts, handling pagination automatically so you don't have to. Beyond just the article text, it grabs the full metadata suite: author info, categories, tags, and the URLs for featured images. This makes it perfect for building content aggregators, conducting market research by analyzing competitors' publishing habits, or migrating content to a new platform without losing all that valuable SEO and organizational data. I've used it to feed article data into a custom dashboard, and the JSON output is consistently clean and ready for analysis or import. It saves you from writing and maintaining your own API integration scripts, letting you focus on actually using the data.

What does this actor do?

WordPress Articles Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

WordPress Articles Scraper

Overview

This Apify actor extracts posts and metadata from WordPress sites using the WordPress REST API. It's built for developers, researchers, and content aggregators who need structured article data. The actor handles pagination automatically and can filter results by keyword.

Key Features

  • Fetches posts from any public WordPress site.
  • Automatically manages pagination to retrieve all available posts.
  • Filters posts based on a search term.
  • Extracts full post metadata: author, categories, tags, and featured image URLs.
  • Outputs clean, structured JSON.

How to Use

Provide a WordPress site URL to start scraping. You can run the actor directly on Apify or via its API.

Using the Apify Interface:
1. Go to the actor's page on Apify.
2. Enter the required startUrls parameter.
3. Click Run.

Using the Apify API:

curl -X POST -H "Content-Type: application/json" \
-d '{"startUrls": [{"url": "https://example.com", "method": "GET"}]}' \
"https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_API_TOKEN"

Input/Output

Input Parameters:
| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| startUrls | Array | Yes | List of WordPress site URLs to scrape (e.g., [{"url": "https://example.com"}]). |

Output Format:
The actor returns a JSON dataset. Each item represents a post with the following structure:

{
  "id": 123,
  "date": "2025-03-28T12:00:00",
  "modified": "2025-03-28T14:00:00",
  "slug": "example-post",
  "link": "https://example.com/example-post",
  "title": "Example Post Title",
  "content": "<p>Post content...</p>",
  "excerpt": "Short summary...",
  "author": "Author Name",
  "categories": ["Category1", "Category2"],
  "tags": ["tag1", "tag2"],
  "featured_image": "https://example.com/image.jpg"
}

Common Use Cases:
* Aggregating content from multiple blogs.
* SEO research and content analysis.
* Creating datasets for NLP or data science projects.
* Backing up blog content.

For issues or contributions, refer to the actor's page on Apify.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try WordPress Articles Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
extremescrapes
Pricing
Paid
Total Runs
708
Active Users
108
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support