Threads Post Scraper

Threads Post Scraper

by sinam7

Scrape Threads posts and comments, post author only.

72 runs
7 users
Try This Actor

Opens on Apify.com

About Threads Post Scraper

Scrape Threads posts and comments, post author only.

What does this actor do?

Threads Post Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Threads Post Scraper Scrape Threads posts and comments from a single post URL. Filters out profile images and recommended posts, returning only the original post and its related replies. ## What this actor does - Accepts a single Threads post URL - Extracts post content, author info, timestamps, images, and comments - Automatically filters out: - Profile thumbnails (s150x150, s42x42) - Unrelated recommended posts from Threads feed - Returns only posts from the original author or replies to the original post ## Input ### Example input json { "url": "https://www.threads.net/@username/post/ABC123" } ### Input fields | Field | Type | Required | Notes | |-------|------|----------|-------| | url | string | Yes | Full Threads post URL (e.g., https://www.threads.net/@username/post/ABC123) | ## Output The actor outputs an array of post objects to the dataset. ### Example output item json [ { "authorId": "/@hyle_edu.kr", "authorName": null, "content": "hyle_edu.kr\nDesign Threads\nCursor 디자인 총괄이 말아주는(낋여주는) 스타트업 랜딩 페이지 리뷰 영상\nUI/UX 디자이너분들께 실전 레이아웃·카피 피드백이 꽤 도움 될 것 같습니다.\n5가지 핵심 교훈:\n1) 첫 화면에서 한 줄로 정체를 밝혀라.\n2) 누구를 위한 제품인지 콕 집어 말해라.\n3) 한 화면에 CTA는 딱 하나만 둬라.\n4) 보라 그라디언트·AI 템플릿 느낌을 버려라.\n5) 말로 돌리지 말고, 바로 데모를 보여줘라. \nTranslate", "timestamp": "2025-11-27T08:04:22.000Z", "images": [ "https://scontent-iad3-2.cdninstagram.com/v/t51.2885-15/588000954_17852930637589530_3115163017750846664_n.jpg?stp=dst-jpg_e35_tt6&efg=eyJ2ZW5jb2RlX3RhZyI6InRocmVhZHMuRkVFRC5pbWFnZV91cmxnZW4uMjE2MHgxMjA2LnNkci5mODI3ODcuZGVmYXVsdF9pbWFnZS5jMiJ9&_nc_ht=scontent-iad3-2.cdninstagram.com&_nc_cat=106&_nc_oc=Q6cZ2QHgrxrOBEi_eqXMrmfiDdX3j_o7E4JnDU2yJd9VGNapB_a-senBjgZIZW_Il9lVgJI&_nc_ohc=WeCMxiN7Zc4Q7kNvwGggxko&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&edm=APs17CUBAAAA&ccb=7-5&ig_cache_key=Mzc3NDk2Mjg2NDU1NDcxNzQwMg%3D%3D.3-ccb7-5&oh=00_Afm3SeH-ljp6D7EhRYx0phszFpE84lTMXT6l5lDUhMFC8Q&oe=69360721&_nc_sid=10d13b" ], "postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXLmGEzDa", "isOriginalAuthor": true }, { "authorId": "/@hyle_edu.kr", "authorName": null, "content": "hyle_edu.kr\nDesign Threads\nAuthor\nyoutube.com/watch…\nyoutube.com\nCursor Head of Design Reviews Startup Websites", "timestamp": "2025-11-27T08:04:27.000Z", "images": [ "https://external-iad3-1.xx.fbcdn.net/emg1/v/t13/4510717797100347293?stp=dst-src&url=https%3A%2F%2Fi.ytimg.com%2Fvi%2FRynySryqM_0%2Fmaxresdefault.jpg&utld=ytimg.com&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&_nc_oc=AdnnBnpQiJ-ImFHLB1zQ6KdkwE3eYgejFX751eNz-RdBMIC6qxpnkF8Fk2HROXnpQZk&ccb=13-1&oh=06_Q3-5AXm8t92Bc1cF_D9RD01869Pwh9uLvqptfXeRGYMGpgoL&oe=6931F0AD&_nc_sid=1d65fc" ], "postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXMNbE5Ds", "isOriginalAuthor": true } ] ### Output fields | Field | Type | Description | |-------|------|-------------| | authorId | string | Author's Threads ID (e.g., "/@username") | | authorName | string|null | Author's display name (often null in current implementation) | | content | string | Post text content, newline-separated | | timestamp | string | ISO 8601 timestamp | | images | string[] | Array of image URLs (excludes profile pictures) | | postUrl | string | Full URL to the post | | isOriginalAuthor | boolean | Whether this post is from the original thread author | ## Features ### Smart filtering Profile image exclusion: Automatically removes profile thumbnails by detecting thumbnail size patterns (s150x150, s42x42) in URLs. Feed recommendation filtering: Threads shows recommended posts mixed with replies. This actor filters them out by: - Identifying the original post author from the URL - Keeping only posts from that author or replies to their posts - Excluding unrelated recommended content ### Stable selectors Uses DOM attributes that are less likely to change: - div[data-pressable-container="true"] for post containers - a[href^="/@"] for author links - time elements for timestamps - span[dir="auto"] for text content Avoids obfuscated React Native class names like x1lliihq that change frequently. ## Usage ### Via Apify Console 1. Open the actor in Apify Console 2. Click Run 3. Enter a Threads post URL in the input 4. Start the run 5. View results in the Dataset tab ### Via API #### Start a run bash curl -X POST "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_TOKEN&waitForFinish=120" \ -H "Content-Type: application/json" \ -d '{ "url": "https://www.threads.net/@username/post/ABC123" }' The waitForFinish=120 parameter makes the request wait up to 120 seconds for results (synchronous mode). #### Fetch results from dataset bash curl "https://api.apify.com/v2/datasets/DATASET_ID/items?token=YOUR_TOKEN" ### Integration with n8n This actor is designed to work with n8n Cloud via a sub-workflow: 1. HTTP Request node calls the actor 2. HTTP Request node fetches results from the dataset 3. Results are passed to AI for summarization See the blog post for detailed n8n integration guide. ## Limitations - Public content only: No login/cookies support - DOM structure dependency: Threads may change their HTML structure at any time - Rate limiting: Respect Apify's rate limits and Threads' terms of service - No pagination: Fetches only initially loaded comments (typically first 10-20) ## Notes - The actor waits 3 seconds after page load to ensure dynamic content is rendered - authorName is often null because Threads stores author names in complex nested structures - Image URLs are CDN links that may expire after some time - This actor is optimized for single post scraping, not bulk operations - Update the size filters (s150x150, s42x42) in the code

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Threads Post Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
sinam7
Pricing
Paid
Total Runs
72
Active Users
7
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support