Sephora Scraper
by autofacts
Sephora scraper to fetch product or products of category.
Opens on Apify.com
About Sephora Scraper
Sephora scraper to fetch product or products of category.
What does this actor do?
Sephora Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Sephora Scraper Sephora Scraper is a professional-grade Apify actor designed to extract high-fidelity product data from Sephora's international stores. Built for reliability and scale, it intelligently handles anti-scraping measures, manages sessions, and delivers detailed structured data for your e-commerce analytics. --- ## 🚀 Features - Deep Product Extraction: Retrieves detailed product specifications, including: - Multiple variants (shades, sizes) with individual pricing and stock status. - High-resolution image galleries and media assets. - Rich HTML descriptions, ingredients, and "how to use" guides. - Review counts, ratings, and "loves" metrics. - Smart URL Parsing: Automatically converts standard web URLs (e.g., www.sephora.com/shop/...) into efficient API calls. - Multi-Region Support: Seamlessly supports en-US, en-CA, and fr-CA stores based on input URLs. - Anti-Blocking Architecture: - Built-in Apify Proxy integration with session rotation. - Smart retries (up to 5 times) for failed requests. - Mimics real-user behavior to bypass security checks. - Scalable Performance: Configurable concurrency to balance speed and stability. --- ## 📋 Input Parameters The actor accepts a JSON object with the following configuration: | Field | Type | Description | Default | | :--- | :--- | :--- | :--- | | startUrls | Array | Required. A list of Sephora URLs to scrape. Supports:
• Category Pages: https://www.sephora.com/shop/...
• Product Pages: https://www.sephora.com/product/...
• International URLs: sephora.com/ca/en/... | [] | | proxy | Object | Proxy configuration. Residential proxies are highly recommended for Sephora. | { "useApifyProxy": true } | | maxConcurrency | Integer | Maximum number of parallel requests. Reduce this if you encounter blocking. | 5 | | maxRequestsPerCrawl | Integer | Limit the total number of requests. Set to 0 for unlimited crawling. | 0 | ### Input Example json { "startUrls": [ { "url": "https://www.sephora.com/shop/clean-makeup" }, { "url": "https://www.sephora.com/product/glossier-cloud-paint-gel-cream-blush-P468600" } ], "proxy": { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }, "maxConcurrency": 2, "maxRequestsPerCrawl": 100 } --- ## 📦 Output Data Data is stored in the default dataset in JSON format. Each item represents a unique product. ### Data Schema | Field | Type | Description | | :--- | :--- | :--- | | source | Object | Metadata about the crawl (ID, URL, Retailer, Currency). | | brand | String | Product brand name. | | title | String | Product title. | | description | String | Full HTML description. | | variants | Array | List of all SKUs/variants for this product. | | price | Object | Pricing details inside each variant (current, original, stock). | | medias | Array | All images and media associated with the product. | | stats | Object | Review count, rating, and loves count. | View Sample Output JSON
json { "source": { "id": "P420440", "crawlUrl": "https://www.sephora.com/product/make-no-mistake-foundation-concealer-stick-P420440", "canonicalUrl": "https://www.sephora.com/product/make-no-mistake-foundation-concealer-stick-P420440", "retailer": "SEPHORA", "currency": "USD" }, "brand": "SEPHORA COLLECTION", "title": "Make No Mistake Foundation & Concealer Stick", "description": "<b>Coverage:</b><br>✔ Medium<br>...", "categories": ["Makeup", "Face", "Foundation"], "variants": [ { "id": "1887405", "sku": "1887405", "price": { "current": 8.00, "original": 20.00, "stockStatus": "OutOfStock" }, "options": ["3 Beechwood", "0.41 oz/ 11.6 g"] } ], "medias": [ { "type": "Image", "url": "https://www.sephora.com/productimages/sku/s1887405-main-zoom.jpg", "variantIds": ["1887405"] } ], "stats": { "reviewCount": 749, "rating": 3.93, "lovesCount": 61136 } } --- ## 🔧 Advanced Configuration ### Proxy Configuration Sephora employs strict anti-scraping measures. - Residential Proxies: Mandatory for consistent success. Datacenter proxies are often blocked. - Session Persistence: The scraper uses intelligent session rotation (cookies & headers) to maintain access. ### Performance Tuning - Default Concurrency: Set to 5 to be safe. - Scaling Up: If using high-quality residential proxies, you can increase maxConcurrency to 10-20 for faster speeds. - Debugging: Set maxRequestsPerCrawl to a low number (e.g., 10) to verify your configuration before a full run. --- ## ❓ FAQ & Troubleshooting Q: I'm getting 403 Forbidden errors. A: This usually means your IP is blocked. Ensure you are using Residential Proxies. If the issue persists, try reducing concurrency. Q: Can I scrape reviews? A: Currently, this actor fetches review stats (count, average rating) but not individual review text. Review text extraction is on the roadmap. Q: Does it support other countries? A: Yes! The scraper automatically detects the region from your input URL (e.g., sephora.com/ca). --- ## TODO / Roadmap - [ ] Video Extraction: Fetch product video URLs. - [ ] Reviews: Extract full text of user reviews. - [ ] Ingredients Analysis: Structured breakdown of ingredients.
View Sample Output JSON
json { "source": { "id": "P420440", "crawlUrl": "https://www.sephora.com/product/make-no-mistake-foundation-concealer-stick-P420440", "canonicalUrl": "https://www.sephora.com/product/make-no-mistake-foundation-concealer-stick-P420440", "retailer": "SEPHORA", "currency": "USD" }, "brand": "SEPHORA COLLECTION", "title": "Make No Mistake Foundation & Concealer Stick", "description": "<b>Coverage:</b><br>✔ Medium<br>...", "categories": ["Makeup", "Face", "Foundation"], "variants": [ { "id": "1887405", "sku": "1887405", "price": { "current": 8.00, "original": 20.00, "stockStatus": "OutOfStock" }, "options": ["3 Beechwood", "0.41 oz/ 11.6 g"] } ], "medias": [ { "type": "Image", "url": "https://www.sephora.com/productimages/sku/s1887405-main-zoom.jpg", "variantIds": ["1887405"] } ], "stats": { "reviewCount": 749, "rating": 3.93, "lovesCount": 61136 } } Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Sephora Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- autofacts
- Pricing
- Paid
- Total Runs
- 22,200
- Active Users
- 262
Related Actors
Web Scraper
by apify
Cheerio Scraper
by apify
Website Content Crawler
by apify
Legacy PhantomJS Crawler
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support