Substack Leaderboard Scraper

Substack Leaderboard Scraper

by saswave

Automatically scrape Substack leaderboard data—authors, subscribers, categories, and rankings—into a structured format for analysis and lead generation.

394 runs
21 users
Try This Actor

Opens on Apify.com

About Substack Leaderboard Scraper

Want to know who's really winning on Substack? It's one thing to browse the leaderboards, but getting that data into a spreadsheet for real analysis is a whole different story. That's why I built this scraper. It pulls down everything you see on the Substack leaderboard pages—author names, their subscription tiers and prices, estimated subscriber counts, categories, and content formats. It even breaks down rankings by language and grabs those all-important "Show" links. I use it to spot trends, find potential collaborators, and understand what topics are gaining traction in different niches. You get clean, structured JSON or CSV output ready to import into your database or analytics tool. It runs on Apify's platform, so you can schedule it to run weekly or just fire it off whenever you need a fresh dataset. If you've ever wasted an afternoon manually copying data from Substack, this automates that entire tedious process and lets you focus on the insights instead.

What does this actor do?

Substack Leaderboard Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Substack Leaderboard Scraper

Overview

This actor scrapes detailed subscription and author data from Substack leaderboards at scale. It's built for market research, newsletter analytics, and competitive intelligence.

Key Features

  • Author & Newsletter Data: Collects author name, handle, bio, photo, newsletter name, URL, category, language, and type (newsletter, podcast, etc.).
  • Subscription Plans: Extracts plan details including ID, nickname, currency, interval (monthly/yearly), amount, founding/premium tiers, free trials, and paywall settings.
  • Subscriber Metrics: Retrieves paid and free subscriber counts, global/language rankings, and subscriber estimate magnitudes.
  • Podcast & Content Info: Captures podcast flags, feed URLs, episode availability, community features, post settings, and moderation details.
  • Technical & Tracking Data: Gathers Stripe account info, theme variables, cover/hero images, and optional tracking pixels (GA, Twitter, Facebook).

How to Use

  1. Configure the actor's run settings in the Apify console or via API.
  2. Input the target Substack leaderboard URL(s) or search parameters.
  3. The actor will navigate the pages, extract the structured data, and handle pagination automatically.
  4. Retrieve the results in JSON format from the actor's dataset.

Input/Output

Input: Typically a Substack leaderboard URL (e.g., https://substack.com/leaderboard). Configuration options may include search filters or category selections.

Output: A structured JSON dataset containing an array of newsletter objects. Each object includes the comprehensive fields shown in the example below.

{
  "author_id": 105393068,
  "name": "RocaNews",
  "language": "en",
  "paid_subscription_benefits": ["Subscriber-only posts and full archive", "Post comments and join the community"],
  "free_subscription_benefits": ["Occasional public posts"],
  "paywall_free_trial_enabled": true,
  "podcast_enabled": false,
  "community_enabled": true,
  "cover_photo_url": "https://substack-post-media.s3.amazonaws.com/public/images/8576b80d-19ea-407a-a9b7-a4ccb853a672_300x300.png"
}

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Substack Leaderboard Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
saswave
Pricing
Paid
Total Runs
394
Active Users
21
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support