Shein Product Scraper

by ruly_optimism

723 runs
3 users
Try This Actor

Opens on Apify.com

About Shein Product Scraper

What does this actor do?

Shein Product Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

🛍️ SHEIN Product Scraper The most reliable scraper for extracting comprehensive product data from SHEIN - one of the world's largest fashion e-commerce platforms with over 150 million active users. ## ✨ Why Choose This Scraper? - High Success Rate - 95%+ success rate with 3-layer anti-blocking system - Complete Product Data - Get everything: prices, sizes, colors, specs, images, SKUs - Fast & Efficient - Optimized for speed (~10-30s per product) - Smart Recovery - Automatic browser restart on crashes, continues processing - Proxy Included - Premium residential proxy for Israel domain included - Reliable - 3 captcha bypass strategies + automatic retries ## 🛡️ Anti-Blocking System This scraper includes a sophisticated 3-strategy captcha bypass system: | Strategy | Description | Speed | |----------|-------------|-------| | Strategy 1 | Main domain redirect | ~2-3s | | Strategy 2 | Wait & retry | ~4-5s | | Strategy 3 | IP rotation (new proxy) | ~40-50s | The system automatically tries each strategy in order until success. ## 📊 What Data You Get Each product returns comprehensive, structured data: json { "sku": "sz2503138148213601", "product_id": "70811510", "title": "SHEIN BAE Sexy Lace Sheer Camisole Top", "main_image": "https://img.shein.com/images/product1.jpg", "images": ["https://...", "https://..."], "color": "Black", "retail_price": { "amount": 58.00, "amount_with_symbol": "₪58.00", "usd_amount": 15.50, "usd_amount_with_symbol": "$15.50" }, "sale_price": { "amount": 29.00, "amount_with_symbol": "₪29.00", "usd_amount": 7.75, "usd_amount_with_symbol": "$7.75" }, "has_discount": true, "discount_percentage": 50, "sizes": [ { "attr_value_name": "S", "is_sold_out": false, "size_chart": [ {"attr_name_value_key": "Bust", "attr_name_value_cm": "86 cm"} ] } ], "specs": [ {"name": "Material", "value": "Polyester"}, {"name": "Style", "value": "Sexy"}, {"name": "Pattern Type", "value": "Solid"} ], "variants": [...], "url": "https://il.shein.com/...", "scrape_time_seconds": 12.5, "scraped_at": "2024-12-15T14:30:00Z" } ## 🎯 Perfect For | Use Case | Description | |----------|-------------| | Price Monitoring | Track SHEIN prices and discounts over time | | Competitor Analysis | Compare products, pricing, and inventory | | Market Research | Analyze fashion trends and product availability | | Dropshipping | Get accurate product data for your listings | | E-commerce Integration | Sync SHEIN products to your store | | Data Analytics | Build comprehensive fashion industry datasets | ## ⚙️ Input Configuration | Field | Type | Description | Default | |-------|------|-------------|---------| | urls | array | List of SHEIN product URLs (max 5 per run) | Required | | useProxy | boolean | Enable premium proxy (recommended) | true | | maxRetries | number | Retries per URL (1-10) | 3 | | maxCaptchaRetries | number | Max captcha bypass attempts | 5 | | includeImages | boolean | Include all image URLs | true | | includeReviews | boolean | Include ratings | true | | timeout | number | Page timeout in seconds | 30 | | delayBetweenRequests | number | Delay between requests (ms) | 1000 | ### Example Input json { "urls": [ "https://il.shein.com/SHEIN-BAE-Sexy-Lace-p-70811510.html", "https://il.shein.com/Another-Product-p-12345678.html" ], "useProxy": true, "maxRetries": 3, "timeout": 30 } ## 📈 Performance | Metric | Value | |--------|-------| | Average scrape time (no captcha) | 10-15s per product | | Average scrape time (with captcha bypass) | 20-50s per product | | Success rate | >95% | | Memory usage | ~1GB | | Max URLs per run | 5 (recommended) | ### Performance Optimizations - ⚡ Fast Chrome startup - Disabled unnecessary Chrome features - ⚡ Eager page load - Don't wait for all resources - ⚡ Images disabled - Skip image loading for speed - ⚡ Quick polling - 0.5s data check interval - ⚡ Browser timeouts - 60s page load, 30s script timeout - ⚡ Auto recovery - Browser restarts on crash ## 🌍 Supported Regions Currently optimized for: - Israel (il.shein.com) - Full support with premium proxy More regions coming soon! ## 💡 Tips for Best Results 1. Use Proxy - Keep useProxy: true for reliable scraping 2. Batch Size - Process 3-5 URLs per run for best stability 3. Valid URLs - Ensure URLs are valid SHEIN product pages ending in .html 4. Reasonable Delay - Use 1000ms+ delay between requests 5. Monitor Logs - Watch for strategy indicators to understand performance ## 📊 Output Statistics Each run provides detailed statistics: json { "total": 5, "success": 5, "failed": 0, "captcha_bypasses": 2, "duration_seconds": 65 } ## 🔧 Error Handling | Error | Action | |-------|--------| | Captcha detected | Auto-bypass with 3 strategies | | Page timeout | Skip URL, restart browser, continue | | Browser crash | Auto-restart, continue with next URL | | Network error | Retry with new proxy | ## 📝 Changelog ### v1.2 (Latest) - ⚡ Optimized Chrome startup (~4-6s faster) - ⚡ Reduced captcha bypass times - 🛡️ Added browser crash recovery - 🛡️ Added page load timeouts (60s max) - 📊 Strategy indicators in logs ### v1.1 - Added 3-strategy captcha bypass system - IP rotation with Smartproxy - Improved error handling ### v1.0 - Initial release - Full product data extraction - Premium proxy integration ## 🔒 Compliance - This scraper is designed for legitimate business purposes - Please respect SHEIN's terms of service - Use responsibly and don't overload their servers ## 🤝 Support Need help or have questions? - Open an issue on the actor page - Contact via Apify messaging --- ⭐ If this scraper helps your business, please leave a review!

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Shein Product Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
ruly_optimism
Pricing
Paid
Total Runs
723
Active Users
3
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support