BlueSky Feed Scraper
by harvest
Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and stat...
Opens on Apify.com
About BlueSky Feed Scraper
Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts.
What does this actor do?
BlueSky Feed Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Bluesky Feed Scraper for Apify This is an Apify actor that scrapes data from a specified Bluesky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts. ## Features - Scrapes Bluesky feed posts from a given feed URL. - Extracts detailed post data, including: - Author details (DID, handle, display name, avatar URL, etc.). - Post text, tags, and languages. - Embedded images, with metadata (alt text, aspect ratio, URLs). - Engagement statistics (likes, replies, reposts, quotes). - Thread and reply information. - Record metadata, including creation and indexing timestamps. ## Input The actor requires the following input: | Field | Type | Description | |---------------|--------|-----------------------------------------------| | url | String | The URL of the Bluesky feed you want to scrape. Example: https://bsky.app/profile/username/feed. | ### Example Input json { "url": "https://bsky.app/profile/c3rmen.bsky.social/feed" } ## Output The actor produces a JSON array where each object represents a post from the feed. The structure includes: - uri and cid: Unique identifiers for the post. - author: Details about the author (DID, handle, avatar, etc.). - record: Post text, tags, languages, and embedded media. - embed: View-ready image metadata (e.g., thumbnails, full-size URLs). - Engagement metrics (replyCount, repostCount, likeCount, quoteCount). - Thread and reply-related data. - Timestamps (createdAt, indexedAt). ### Example Output json [ { "uri": "at://did:plc:z72i7hdynmk6r22z27h6tvur/app.bsky.feed.post/3lbsizxfxa22r", "cid": "bafyreifohcetdw6e5mudaz6anigzsm5ssjpm3oreyxu4a2l665k7hpxo4q", "author": { "did": "did:plc:z72i7hdynmk6r22z27h6tvur", "handle": "bsky.app", "displayName": "Bluesky", "avatar": "https://cdn.bsky.app/img/avatar/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihagr2cmvl2jt4mgx3sppwe2it3fwolkrbtjrhcnwjk4jdijhsoze@jpeg", "associated": { "chat": { "allowIncoming": "none" } }, "labels": [], "createdAt": "2023-04-12T04:53:57.057Z" }, "record": { "createdAt": "2024-11-25T21:52:30.840Z", "embed": { "external": { "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", "thumb": { "ref": { "$link": "bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne" }, "mimeType": "image/jpeg", "size": 384236, "$type": "blob" }, "title": "BlueskySocial - Twitch", "uri": "https://www.twitch.tv/blueskysocial" }, "$type": "app.bsky.embed.external" }, "facets": [ { "features": [ { "did": "did:plc:qjeavhlw222ppsre4rscd3n2", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 55, "byteStart": 40 }, "$type": "app.bsky.richtext.facet" }, { "features": [ { "did": "did:plc:ragtjsm2j2vknwkz3zp4oxrd", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 76, "byteStart": 64 }, "$type": "app.bsky.richtext.facet" }, { "features": [ { "did": "did:plc:4ewnpnebeh7zuk5pbardaxqz", "$type": "app.bsky.richtext.facet#mention" } ], "index": { "byteEnd": 226, "byteStart": 203 }, "$type": "app.bsky.richtext.facet" } ], "langs": [ "en" ], "text": "Join us for another livestream with COO @rose.bsky.team and CTO @pfrazee.com, where they'll share team updates, the story of how Bluesky began, and whatβs next. \n\nPlus, a special guest appearance from @flavorflav.bsky.social! π\n\nToday 11/25 @ 5 pm PT / 8 pm ET / 1 am GMT / 10am JST", "$type": "app.bsky.feed.post" }, "embed": { "external": { "uri": "https://www.twitch.tv/blueskysocial", "title": "BlueskySocial - Twitch", "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", "thumb": "https://cdn.bsky.app/img/feed_thumbnail/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne@jpeg" }, "$type": "app.bsky.embed.external#view" }, "replyCount": 324, "repostCount": 1041, "likeCount": 9147, "quoteCount": 84, "indexedAt": "2024-11-25T21:52:35.058Z", "labels": [] }, // ...more posts ] ## Usage 1. Deploy the Actor: Use the Apify console to set up and deploy this actor. 2. Provide Input: Supply the url in the input configuration. 3. Run the Actor: Start the actor, and it will scrape the feed URL and return the posts as JSON. ## Notes - Ensure the url is publicly accessible. - The actor fetches only visible posts; private or restricted feeds will not be included. Feel free to suggest additional features or report any issues! π
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try BlueSky Feed Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- harvest
- Pricing
- Paid
- Total Runs
- 451
- Active Users
- 35
Related Actors
π― Tweet Scraper V2 - X / Twitter Scraper
by apidojo
Instagram Scraper
by apify
TikTok Scraper
by clockworks
Instagram Profile Scraper
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support