BlueSky Feed Scraper

BlueSky Feed Scraper

by harvest

Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and stat...

451 runs
35 users
Try This Actor

Opens on Apify.com

About BlueSky Feed Scraper

Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts.

What does this actor do?

BlueSky Feed Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Bluesky Feed Scraper for Apify This is an Apify actor that scrapes data from a specified Bluesky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts. ## Features - Scrapes Bluesky feed posts from a given feed URL. - Extracts detailed post data, including: - Author details (DID, handle, display name, avatar URL, etc.). - Post text, tags, and languages. - Embedded images, with metadata (alt text, aspect ratio, URLs). - Engagement statistics (likes, replies, reposts, quotes). - Thread and reply information. - Record metadata, including creation and indexing timestamps. ## Input The actor requires the following input: | Field | Type | Description | |---------------|--------|-----------------------------------------------| | url | String | The URL of the Bluesky feed you want to scrape. Example: https://bsky.app/profile/username/feed. | ### Example Input json { "url": "https://bsky.app/profile/c3rmen.bsky.social/feed" } ## Output The actor produces a JSON array where each object represents a post from the feed. The structure includes: - uri and cid: Unique identifiers for the post. - author: Details about the author (DID, handle, avatar, etc.). - record: Post text, tags, languages, and embedded media. - embed: View-ready image metadata (e.g., thumbnails, full-size URLs). - Engagement metrics (replyCount, repostCount, likeCount, quoteCount). - Thread and reply-related data. - Timestamps (createdAt, indexedAt). ### Example Output json [ { "uri": "at://did:plc:z72i7hdynmk6r22z27h6tvur/app.bsky.feed.post/3lbsizxfxa22r", "cid": "bafyreifohcetdw6e5mudaz6anigzsm5ssjpm3oreyxu4a2l665k7hpxo4q", "author": { "did": "did:plc:z72i7hdynmk6r22z27h6tvur", "handle": "bsky.app", "displayName": "Bluesky", "avatar": "https://cdn.bsky.app/img/avatar/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihagr2cmvl2jt4mgx3sppwe2it3fwolkrbtjrhcnwjk4jdijhsoze@jpeg", "associated": { "chat": { "allowIncoming": "none" } }, "labels": [], "createdAt": "2023-04-12T04:53:57.057Z" }, "record": { "createdAt": "2024-11-25T21:52:30.840Z", "embed": { "external": { "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", "thumb": { "ref": { "$link": "bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne" }, "mimeType": "image/jpeg", "size": 384236, "$type": "blob" }, "title": "BlueskySocial - Twitch", "uri": "https://www.twitch.tv/blueskysocial" }, "$type": "app.bsky.embed.external" }, "facets": [ { "features": [ { "did": "did:plc:qjeavhlw222ppsre4rscd3n2", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 55, "byteStart": 40 }, "$type": "app.bsky.richtext.facet" }, { "features": [ { "did": "did:plc:ragtjsm2j2vknwkz3zp4oxrd", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 76, "byteStart": 64 }, "$type": "app.bsky.richtext.facet" }, { "features": [ { "did": "did:plc:4ewnpnebeh7zuk5pbardaxqz", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 226, "byteStart": 203 }, "$type": "app.bsky.richtext.facet" } ], "langs": [ "en" ], "text": "Join us for another livestream with COO @rose.bsky.team and CTO @pfrazee.com, where they'll share team updates, the story of how Bluesky began, and what’s next. \n\nPlus, a special guest appearance from @flavorflav.bsky.social! πŸŽ‰\n\nToday 11/25 @ 5 pm PT / 8 pm ET / 1 am GMT / 10am JST", "$type": "app.bsky.feed.post" }, "embed": { "external": { "uri": "https://www.twitch.tv/blueskysocial", "title": "BlueskySocial - Twitch", "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", "thumb": "https://cdn.bsky.app/img/feed_thumbnail/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne@jpeg" }, "$type": "app.bsky.embed.external#view" }, "replyCount": 324, "repostCount": 1041, "likeCount": 9147, "quoteCount": 84, "indexedAt": "2024-11-25T21:52:35.058Z", "labels": [] }, // ...more posts ] ## Usage 1. Deploy the Actor: Use the Apify console to set up and deploy this actor. 2. Provide Input: Supply the url in the input configuration. 3. Run the Actor: Start the actor, and it will scrape the feed URL and return the posts as JSON. ## Notes - Ensure the url is publicly accessible. - The actor fetches only visible posts; private or restricted feeds will not be included. Feel free to suggest additional features or report any issues! πŸš€

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try BlueSky Feed Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
harvest
Pricing
Paid
Total Runs
451
Active Users
35
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support