ZaraHome

ZaraHome

by datasaurus

Extract product data from any Zara Home website worldwide. Scrape by category, single item, or the entire site for price tracking and market analysis.

323 runs
19 users
Try This Actor

Opens on Apify.com

About ZaraHome

Need to pull product data from Zara Home's international sites? This actor handles it. I've used it to gather pricing, descriptions, and images from zarahome.com across all its regional versions, whether you're looking at the Spanish, US, or Japanese store. The beauty is in the flexibility: you can run a full site scrape to get everything, or narrow it down to a specific category like bedding or tableware. If you only need data on a few specific items, you can also point it directly at individual product URLs. It's built to be fast and doesn't get bogged down, which saves on compute costs when you're running larger jobs. I typically use the data for competitor price tracking, building product catalogs, or supply chain analysis. You get structured JSON output that's ready to feed into a database or spreadsheet, which saves hours of manual work. Just configure your target country and whether you want a broad crawl or a focused extraction, and it handles the rest.

What does this actor do?

ZaraHome is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Zara Home Products Scraper

Overview

This actor scrapes product data from Zara Home's website. It supports all country and language versions of the site and is built to handle the site's structure efficiently. The output is structured JSON data suitable for analysis or integration.

Key Features

  • Comprehensive Data Extraction: Scrapes detailed product information including name, description, price, SKU, colors, sizes, category, composition, sustainability info, images, availability date, promotions, and product URL.
  • Flexible Scraping Targets: You can scrape the entire site, specific category pages, or individual product pages.
  • Multi-URL Support: Process multiple start URLs in a single run, even from different country domains (e.g., .com, .fr, .de).
  • Built-in Deduplication: Returns unique products, minimizing duplicates when the same item appears across multiple categories.
  • Cost-Efficient: Configured to use residential proxies (required to avoid blocking) and optimized for data transfer. Example: scraping ~1000 products costs approximately $0.24 and takes about 5 minutes.
  • Structured Output: Provides both a flat summary of key fields (for easy CSV export) and a detailed nested JSON structure containing all variant data (colors, sizes, images).

How to Use

Configure the actor run via the Apify console or API using the input parameters below.

Input Configuration

The actor accepts the following input fields:

  • startUrls (Required): An array of URLs to scrape. You can provide homepage, category, or direct product URLs.
    • Example: ["https://www.zarahome.com/gb/", "https://www.zarahome.com/de/bedroom-n1007"]
  • maxProductsPerUrl: Maximum number of products to scrape from each provided startUrl. Default is 100.
  • maxCategoriesPerUrl: Maximum number of categories to explore from each provided startUrl. Default is 20.

Output

The actor outputs a dataset of items, where each item represents a unique product. The data is in JSON format. Key fields include:

  • name, description, price
  • colors, sizes, category, mainImage (summary fields)
  • colorsSizesImagesJSON (a detailed JSON object containing all variants, images, and specifics for each color/size combination)
  • sku, composition, sustainability, url, and many more.

For integration into spreadsheets, the top-level colors, sizes, category, and mainImage fields provide a flattened view. For complete variant data, use the nested colorsSizesImagesJSON.

Notes & Known Considerations

  • Output Format: JSON is used due to the nested nature of the product data (multiple colors/sizes per item).
  • Result Count: The final number of items may be lower than maxProductsPerUrl. This happens because:
    • The scraper filters out dummy/empty products returned by the website.
    • The website may list color variations as separate products, but the scraper consolidates them into a single "product bundle" containing all variants.
  • Scraping Challenges: Occasionally, requests may be blocked (resulting in 403 errors). If a run fails or stops prematurely, re-running it usually resolves the issue.
  • Deduplication: While the actor deduplicates products, in highly concurrent scrapes a small number of duplicates may rarely slip through.

Resources

Categories

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try ZaraHome now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
datasaurus
Pricing
Paid
Total Runs
323
Active Users
19
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support