Geizhals.de Price Comparison Scraper
by barrierefix
Extract product prices, offer counts, categories, and detailed specifications from Geizhals.de - Germany's #2 price comparison website with 7.2M month...
Opens on Apify.com
About Geizhals.de Price Comparison Scraper
Extract product prices, offer counts, categories, and detailed specifications from Geizhals.de - Germany's #2 price comparison website with 7.2M monthly visits. Perfect for price tracking, competitive intelligence, market research, and automation workflows (n8n, Zapier, Make.com).
What does this actor do?
Geizhals.de Price Comparison Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Geizhals.de Price Comparison Scraper 🇩🇪 The ONLY dedicated Geizhals.de scraper on Apify Marketplace Extract product prices, offer counts, categories, and detailed specifications from Geizhals.de - Germany's #2 price comparison website with 7.2M monthly visits. Perfect for price tracking, competitive intelligence, market research, and automation workflows (n8n, Zapier, Make.com). ## Features ### 📦 Product Data Extraction - Complete product details (name, ID, URL) - Current lowest prices (EUR) - Offer count (number of shops selling each product) - High-quality product images - Category classification (Grafikkarten, Prozessoren, etc.) - Detailed technical specifications (Speicher, Chipsatz, etc.) ### 🎯 Flexible Search Options - Search Query: Enter keywords like "RTX 4090", "iPhone 15", "Samsung Monitor" - Category URL: Scrape entire Geizhals category pages - Product URLs: Target specific product pages ### 📊 Advanced Filters - Price range filtering (EUR) - Maximum items limit (1-1000) - Price history tracking for scheduled runs ### 🛡️ Anti-Bot Protection - Bypasses Cloudflare protection - Uses German residential proxies - Human behavior simulation (mouse movements, scrolling) - 85-95% success rate ## Use Cases ### 💰 Price Tracking & Monitoring Track GPU, CPU, and component prices for gaming PC builds. Set up automated price alerts and monitor historical price changes. ### 🔍 Competitive Intelligence Monitor competitor pricing strategies, track product availability, analyze merchant ratings, and compare specification offerings. ### 📊 Market Research Identify trending tech products in Germany, analyze product specifications trends, study category performance, and understand price distribution. ### 🤖 Automation Workflows Schedule daily/weekly scrapes, receive price drop alerts via webhooks, integrate with n8n/Zapier/Make.com, and feed data to Google Sheets or databases. ## Input | Field | Type | Required | Description | |-------|------|----------|-------------| | query | String | No | Search query (e.g., "RTX 4090") | | categoryUrl | String | No | Direct Geizhals category URL | | productUrls | Array | No | List of specific product URLs | | maxItems | Number | No | Max products (default: 100, max: 1000) | | priceMin | Number | No | Minimum price in EUR | | priceMax | Number | No | Maximum price in EUR | | trackPriceHistory | Boolean | No | Enable price tracking (default: false) | At least one search method (query, categoryUrl, or productUrls) is required. ## Output Perfect for automation tools like n8n, Zapier, Make.com, Google Sheets, and databases. Flat, automation-friendly JSON structure: json { "productId": "112339", "productName": "ASUS GeForce RTX 4090 24GB GDDR6X", "productUrl": "https://geizhals.de/...", "lowestPrice": 2789.99, "currency": "EUR", "offerCount": 9, "imageUrl": "https://...", "category": "Grafikkarten", "specifications": { "Speicher": "24GB GDDR6X" }, "scrapedAt": "2025-10-24T13:34:54.722Z" } ## Quick Start ### Local Testing bash # Install dependencies npm install # Run with test input npm run dev # Or use Apify CLI apify run --input-file test-input.json ### Example Input (test-input.json) json { "query": "RTX 4090", "maxItems": 10, "priceMin": 1000, "priceMax": 2500 } ### Running on Apify 1. Visit Apify Console 2. Create new actor or import this repository 3. Configure input parameters 4. Run the actor 5. Download results as JSON, CSV, or Excel ## Integration Examples ### n8n Workflow 1. Schedule: Daily at 6 AM 2. Apify → Run Geizhals Scraper (query: "RTX 4090") 3. Filter: Price < €1800 4. Compare: With yesterday's prices 5. Telegram: Send notification on drops > 10% 6. Google Sheets: Log results ### Zapier Automation 1. Schedule: Weekly on Monday 2. Apify → Geizhals Scraper (rating > 4.5) 3. Compare: Amazon pricing 4. Airtable: Update inventory 5. Email: Weekly price report ### Make.com Scenario 1. Webhook: Manual/scheduled trigger 2. Apify → Geizhals Scraper (category: Graphics Cards) 3. Calculate: Price-per-performance 4. Discord: Post top 5 deals 5. PostgreSQL: Archive data ## Technical Details ### Anti-Bot Strategy - Method: Playwright + Crawlee fingerprinting + Stealth plugins - Proxies: German residential proxies (MANDATORY) - Rate Limiting: 10-12 requests/minute - Success Rate: 85-95% - Speed: ~10 products/minute ### Architecture - Base Image: apify/actor-node-playwright-chrome:20 - Language: TypeScript (compiled to CommonJS) - Framework: Crawlee (Apify SDK v3) - Validation: Zod schemas - Browser: Playwright (Chromium) ### Cloudflare Bypass Techniques 1. Crawlee's built-in browser fingerprinting 2. German residential proxies (country code: DE) 3. Session warmup (visit homepage first) 4. Human behavior simulation (ghost-cursor) 5. Natural timing patterns (normal distribution) 6. German locale (de-DE) headers ## Pricing This actor uses pay-per-event pricing - you only pay for products successfully scraped. See current pricing in the Apify Console when starting a run. ## Limitations ### Robots.txt Compliance Geizhals.de actively blocks crawlers via robots.txt. This scraper uses residential proxies and human behavior simulation to bypass restrictions ethically while respecting rate limits. ### Rate Limits - Maximum 10-12 products/minute to avoid detection - Sequential processing (no parallel requests) - Natural delays between actions ### Coverage - Supported: Geizhals.de (Germany) - Coming soon: Geizhals.at (Austria) - Not supported: EU-wide (other countries) ### Best For - Electronics, computers, gaming hardware - Tech products with detailed specs - Not ideal for fashion, furniture, or non-tech categories ## Troubleshooting ### Issue: Empty Results Possible causes: 1. Cloudflare challenge triggered 2. Incorrect search query 3. All products filtered out Solutions: - Check storage/key_value_stores/default/search-page.html for debugging - Look for cloudflare-blocked.html to confirm blocking - Adjust filters (minRating, priceMin/Max) - Try different search query ### Issue: "Cloudflare challenge detected" Solutions: - German residential proxies are MANDATORY (automatically configured) - Reduce maxItems to lower detection risk - Add longer delays between runs - Contact support if persists ### Issue: Selector Not Found Cause: Geizhals.de changed their HTML structure Solution: - Check search-page.html in storage - Report issue to kontakt@barrierefix.de - We update selectors within 24 hours ## Development ### Project Structure geizhals-scraper-pro/ ├── src/ │ ├── main.ts # Entry point & scraper logic │ ├── types.ts # TypeScript interfaces + Zod schemas │ └── human-behavior.ts # Anti-bot behavior simulation ├── .actor/ │ ├── actor.json # Apify metadata │ ├── INPUT_SCHEMA.json # Input form definition │ └── DATASET_SCHEMA.json # Output structure ├── Dockerfile # Playwright-enabled container ├── package.json # Dependencies ├── tsconfig.json # TypeScript config └── test-input.json # Example input ### Building Locally bash npm install # Install dependencies + build npm run build # Compile TypeScript npm start # Run compiled code npm run dev # Build + run with Apify CLI ### Testing bash # Test with sample input npm run dev # Test specific query echo '{"query": "RTX 4090", "maxItems": 5}' > INPUT.json apify run --input-file INPUT.json # Check output cat storage/datasets/default/*.json ### Deploying bash # Login to Apify apify login # Push to Apify apify push # Or via Git integration git push origin main ## Roadmap Phase 2 - Coming Soon: - [ ] Historical price tracking dashboard - [ ] Price drop webhook notifications - [ ] Multi-merchant comparison - [ ] Specification normalization - [ ] Geizhals.at support (Austria) - [ ] Bulk product monitoring - [ ] Price alert threshold configuration ## Support - Email: kontakt@barrierefix.de - Response Time: < 24 hours - Bug Reports: Via Apify Support or email - Feature Requests: Email with use case description ## Legal Disclaimer This actor scrapes publicly available data from Geizhals.de. Use responsibly and in compliance with: - German data protection laws (GDPR) - Geizhals.de Terms of Service - Apify Terms of Service Important: - Do not republish copyrighted content (images, descriptions) without permission - Respect rate limits to avoid overloading Geizhals servers - Use for personal/commercial research, not to replicate Geizhals business Barrierefix is not affiliated with Geizhals GmbH. ## License MIT License - See LICENSE file for details ## About Built by Barrierefix - Professional web scraping solutions for German market. --- ## 🔗 Explore More of Our Actors ### 📊 Price Comparison (German Market) | Actor | Description | |-------|-------------| | Billiger.de Scraper | Extract prices from Germany's largest price comparison platform | | Testberichte Scraper Pro | Extract product reviews and ratings from Testberichte.de | | Chip.de Bestenlisten Scraper | Scrape Chip.de best-of lists for tech product rankings | ### 🚗 Automotive (German Market) | Actor | Description | |-------|-------------| | German Car Scraper | Scrape AutoScout24 and Mobile.de for vehicle listings | | eBay Kleinanzeigen Scraper | Extract classified ads from eBay Kleinanzeigen | ### 🍳 German Content & Lifestyle | Actor | Description | |-------|-------------| | Chefkoch Scraper | Extract recipes and cooking instructions from Chefkoch.de | | Gutefrage Scraper | Scrape Q&A content from Gutefrage.net for market research | ### 🛒 E-commerce | Actor | Description | |-------|-------------| | Shopify Scraper Pro | Extract complete Shopify product data with variants and sales estimates | | eBay Scraper (PPR) | Extract eBay products with seller analytics and engagement metrics | --- Last Updated: October 2025 Version: 1.0.0 Maintenance Status: ✅ Actively Maintained ## Legal Disclaimer / Rechtlicher Hinweis EN: This actor is a general-purpose tool for analyzing publicly accessible web data. The user bears sole responsibility for ensuring their specific use complies with: - Applicable laws (GDPR/DSGVO, copyright law) - The target website's Terms of Service - Apify's Terms of Service The provider (barrierefix) expressly disclaims liability for any unauthorized or unlawful use. By using this actor, the user agrees to indemnify the provider against any third-party claims arising from their use of the data. DE: Dieser Actor ist ein allgemeines Werkzeug zur Analyse öffentlich zugänglicher Webdaten. Der Nutzer trägt die alleinige Verantwortung dafür, dass seine spezifische Nutzung den geltenden Gesetzen (DSGVO, Urheberrecht), den Nutzungsbedingungen der Zielwebsite und den Apify-Nutzungsbedingungen entspricht. Der Anbieter (barrierefix) schließt jegliche Haftung für unbefugte oder rechtswidrige Nutzung ausdrücklich aus. Mit der Nutzung dieses Actors erklärt sich der Nutzer bereit, den Anbieter von allen Ansprüchen Dritter freizustellen, die aus seiner Datennutzung entstehen. --- This tool is not affiliated with Geizhals. All trademarks belong to their respective owners.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Geizhals.de Price Comparison Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- barrierefix
- Pricing
- Paid
- Total Runs
- 38
- Active Users
- 3
Related Actors
Google Maps Reviews Scraper
by compass
Facebook Ads Scraper
by apify
Google Ads Scraper
by silva95gustavo
Facebook marketplace scraper
by curious_coder
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support