Ichard Scraper

Name: Ichard Scraper
Author: corent1robert

by corent1robert

Extract comprehensive product data from Ichard.fr e-commerce website. Scrapes sitemap XML to discover all product URLs, then extracts detailed informa...

3 runs

2 users

Try This Actor

Opens on Apify.com

About Ichard Scraper

Extract comprehensive product data from Ichard.fr e-commerce website. Scrapes sitemap XML to discover all product URLs, then extracts detailed information including titles, prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images.

What does this actor do?

Ichard Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

Cloud-based execution - no local setup required
Scalable infrastructure for large-scale operations
API access for integration with your applications
Built-in proxy rotation and anti-blocking measures
Scheduled runs and webhooks for automation

How to Use

Click "Try This Actor" to open it on Apify
Create a free Apify account if you don't have one
Configure the input parameters as needed
Run the actor and download your results

Documentation

Ichard Scraper - Extract Product Data from Ichard.fr Extract comprehensive product data from Ichard.fr e-commerce website. This Actor scrapes the Ichard website to provide you with a complete database of automotive parts and accessories including prices, descriptions, references, compatibility information, and availability status. ## What does Ichard Scraper do? The Ichard Scraper extracts comprehensive data from the Ichard e-commerce website, covering all products available in their catalog. The Actor automatically scrapes the sitemap XML to discover all product URLs, then extracts detailed product information including titles, prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images. Simply provide the sitemap URL (or use the default) and the Actor will extract all product data with optimized parallel processing. ## What can this Ichard Scraper do? ### 🚀 Key Features - Complete Product Coverage: Scrapes all products from the sitemap XML automatically - Comprehensive Data Extraction: Extracts all product details including prices, references, descriptions, categories, brands, models, availability, and images - Sitemap-Based Discovery: Automatically discovers all product URLs from the sitemap - no manual URL input required - High Success Rate: Accurate extraction of product information with automatic retry mechanism - Optimized Performance: Processes multiple products in parallel (configurable concurrency) for maximum speed - Complete Product Data: Extracts prices, references, descriptions, categories, brands, compatible vehicle models, availability status, and product images - Structured Output: Clean, normalized data ready for immediate use in CSV, JSON, Excel, or HTML formats ### 🎯 Platform Advantages Your Actor + the Apify platform. They come as a package. This scraper benefits from: - Monitoring & Logs: Real-time execution monitoring with detailed logs to track scraping progress - API Access: Access your data programmatically via Apify API for seamless integration - Scheduling: Set up automated runs on a schedule to keep your product database up-to-date - Integrations: Connect to Make.com, Zapier, Google Sheets, and more for automated workflows - Proxy Rotation: Automatic proxy management for reliable scraping without IP blocking - Scalability: Handle large-scale scraping with cloud infrastructure that scales automatically - Data Storage: Secure dataset storage with multiple export formats (JSON, CSV, Excel, HTML) ## What data can Ichard Scraper extract? The Actor extracts comprehensive data from Ichard products. Here's what you can extract: | Data Category | Fields Extracted | Description | |--------------|------------------|-------------| | Product Information | `url`, `title`, `price`, `priceTaxExcl`, `priceWithoutReduction`, `reference`, `description`, `descriptionFull`, `category`, `categoryName`, `productCategory`, `brand`, `models`, `availability`, `availabilityMessage`, `imageUrl`, `imageUrls`, `quantity`, `idProduct`, `taxRate`, `taxName`, `metaTitle`, `metaDescription`, `lastModified` | Complete product details and information | ### Detailed Field Description #### Product Information - url: Full URL of the product page on Ichard.fr - title: Product title/name (e.g., "Bouchon essence chromé") - price: Product price including tax (TTC) as numeric value (e.g., "26.00") - priceTaxExcl: Product price excluding tax (HT) as numeric value (e.g., "21.67") - priceWithoutReduction: Original price before any discount - reference: Product reference/SKU code (e.g., "5605") - description: Short product description (first paragraph) - descriptionFull: Complete product description with all paragraphs - category: Product category extracted from breadcrumb (e.g., "4CV") - categoryName: Product category name from product data (e.g., "4CV") - productCategory: Product category from features (e.g., "Accessoires - Carrosserie") - brand: Product brand/manufacturer (e.g., "Renault") - models: Compatible vehicle models as comma-separated list (e.g., "4CV, Dauphine, Floride - Caravelle, Renault 8, Renault 10") - availability: Stock availability status code (e.g., "available") - availabilityMessage: Human-readable availability message (e.g., "En stock") - imageUrl: Full URL of the main product image - imageUrls: Comma-separated list of all product image URLs - quantity: Available stock quantity - idProduct: Internal product ID - taxRate: Tax rate percentage (e.g., "20" for 20%) - taxName: Tax name (e.g., "TVA FR 20%") - metaTitle: SEO meta title - metaDescription: SEO meta description - lastModified: Last modification date from sitemap (ISO format) ## How to scrape Ichard.fr products? ### Step-by-Step Tutorial 1. Configure Input: Click on the Input tab and optionally adjust: - `maxConcurrency`: Number of parallel requests (default: 10) - `requestTimeout`: Request timeout in milliseconds (default: 30000) - `sitemapUrl`: Sitemap URL (default: https://ichard.fr/1_fr_0_sitemap.xml) 2. Run the Actor: Click Start to begin scraping 3. Monitor Progress: Watch the logs to see real-time progress and statistics 4. Download Results: Once complete, download your data from the Dataset tab in JSON, CSV, Excel, or HTML format The Actor automatically: - Fetches the sitemap XML from Ichard.fr - Filters product URLs (format: `/category/id-title.html`) - Scrapes each product page in parallel - Extracts all product information - Saves results to the dataset ## How much will it cost to scrape Ichard.fr? Scraping Ichard.fr products is priced based on Compute Units (CUs) consumed during the Actor run. The cost depends on: - Number of products: Each product page requires one request - Concurrency level: Higher concurrency processes more products simultaneously but uses more resources - Request timeouts: Longer timeouts may consume more CUs if pages load slowly Estimated costs: - Free plan: Test with a small number of products - Starter plan: Scrape hundreds of products efficiently - Professional plan: Handle thousands of products with optimal performance The Actor is optimized to minimize CU consumption by using efficient Cheerio parsing (no browser overhead) and configurable concurrency. Most runs will complete in a few minutes depending on the total number of products in the catalog. ## Input Ichard Scraper has the following input options. Click on the Input tab for more information: - maxConcurrency (optional): Maximum number of concurrent requests. Default: `10`. Increase for faster scraping, decrease to reduce server load. - requestTimeout (optional): Request timeout in milliseconds. Default: `30000` (30 seconds). - sitemapUrl (optional): URL of the sitemap XML. Default: `https://ichard.fr/1_fr_0_sitemap.xml`. ## Output You can download the dataset extracted by Ichard Scraper in various formats such as JSON, HTML, CSV, or Excel. ### Output Example json { "url": "https://ichard.fr/4cv/13-bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.html", "title": "Bouchon essence chromé", "price": "26.00", "priceTaxExcl": "21.67", "priceWithoutReduction": "26.004", "reference": "5605", "description": "Bouchon de réservoir d'essence chromé pour Renault 4CV - Dauphine R8, R10 - Floride et Caravelle", "descriptionFull": "Bouchon de réservoir d'essence chromé pour Renault 4CV - Dauphine R8, R10 - Floride et Caravelle", "category": "4CV", "categoryName": "4CV", "productCategory": "Accessoires - Carrosserie", "brand": "Renault", "models": "4CV, Dauphine, Floride - Caravelle, Renault 8, Renault 10", "availability": "available", "availabilityMessage": "En stock", "imageUrl": "https://ichard.fr/981-large_default/bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.jpg", "imageUrls": "https://ichard.fr/981-large_default/bouchon-essence-chrome-renault-4cv-dauphine-r8-r10-floride-caravelle.jpg", "quantity": "14", "idProduct": "13", "taxRate": "20", "taxName": "TVA FR 20%", "metaTitle": "Bouchon essence chromé Renault 4CV Dauphine r8 r10 Floride Caravelle - Ichard", "metaDescription": "", "lastModified": "2023-08-22T11:27:22+02:00" } ## Tips for Best Results ### Performance Optimization - Adjust Concurrency: Increase `maxConcurrency` (up to 50) for faster scraping if the website can handle it - Timeout Settings: Use default timeout (30000ms) for most cases. Increase only if experiencing timeout errors - Monitor Progress: Check logs regularly to ensure smooth operation ### Data Quality - Complete Extraction: The Actor extracts all available fields. Some products may have missing fields if not available on the page - Price Format: Prices are extracted as numeric values (e.g., "26.00") for easy analysis - Model Compatibility: Compatible vehicle models are extracted and deduplicated automatically ## Is it legal to scrape Ichard.fr? Our scrapers are ethical and do not extract any private user data, such as email addresses, personal information, or payment details. They only extract publicly available product information that is displayed on the website. We therefore believe that our scrapers, when used for ethical purposes by Apify users, are safe. However, you should be aware that: - Product data is publicly available information - The scraper respects robots.txt and rate limits - You should comply with Ichard.fr's terms of service - Personal data protection regulations (GDPR, etc.) may apply depending on your use case If you're unsure whether your use case is legitimate, consult your lawyers. You can also read our blog post on the legality of web scraping. ## FAQ ### How many products can I scrape? The Actor can scrape all products available in the Ichard.fr sitemap. The exact number depends on the current catalog size. The Actor will automatically discover and process all product URLs from the sitemap. ### What if a product page fails to load? The Actor includes automatic retry mechanism (up to 2 retries) for failed requests. If a page still fails after retries, the Actor continues processing other products and logs the error. ### Can I scrape specific categories only? Currently, the Actor scrapes all products from the sitemap. You can filter the results after extraction by category using the dataset filters or by processing the exported data. ### How often should I run this Actor? It depends on your needs. For price monitoring, you might want to run it daily or weekly. For one-time catalog extraction, a single run is sufficient. ### Does the Actor support incremental updates? The Actor includes `lastModified` field from the sitemap, which you can use to identify recently updated products. You can filter results based on this field to get only new or updated products. ### Can I get support or request features? Yes! If you encounter issues or have feature requests, please use the Issues tab on the Actor page. We're open to feedback and continuously improving the Actor based on user needs. ### Need a custom solution? If you need a customized version of this Actor for specific requirements, feel free to contact us through the Actor page. We can create tailored solutions based on your needs. ## Technical Details ### Extraction Process 1. Sitemap Scraping: The Actor first scrapes the sitemap XML to extract all product URLs 2. URL Filtering: Automatically filters URLs to keep only product pages (format: `/category/id-title.html`) 3. Product Page Scraping: For each product, the Actor visits the product page to extract detailed information 4. Data Extraction: Uses Cheerio for efficient HTML parsing and extracts data from structured HTML elements and JSON-LD 5. Parallel Processing: Processes multiple products simultaneously with configurable concurrency for optimal performance ### Error Handling - Automatic retry mechanism for failed requests (up to 2 retries) - Timeout handling for slow-loading pages - Graceful error handling that continues processing other products - Fallback extraction methods for missing data fields ## Limitations - The scraper depends on the structure of the Ichard website. If the website structure changes significantly, the Actor may need updates - Some products may not have all information fields available (e.g., missing description or models) - Rate limiting may apply if scraping too aggressively - use default concurrency settings for best results ## Resources - Ichard.fr Website - Apify Platform Documentation - Web Scraping Basics --- Need help? Check the Issues tab for common problems and solutions, or contact support through the Actor page.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Ichard Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer: corent1robert
Pricing: Paid
Total Runs: 3
Active Users: 2

Related Actors

Google Maps Reviews Scraper

by compass

Facebook Ads Scraper

by apify

Google Ads Scraper

by silva95gustavo

Facebook marketplace scraper

by curious_coder

Browse All Actors

Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support