Amazon Price Scraper

Amazon Price Scraper

by wilico

Extract product data from Amazon. Scrapes prices, availability, and product details without using the Amazon API.

61 runs
5 users
Try This Actor

Opens on Apify.com

About Amazon Price Scraper

Extract product data from Amazon. Scrapes prices, availability, and product details without using the Amazon API.

What does this actor do?

Amazon Price Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Amazon Price Scraper ## What does Amazon Price Scraper do? This Amazon Price Scraper extracts comprehensive product data from Amazon Japan at scale. With this powerful tool, you can get: - Product information: titles, images, ASINs, and detailed metadata - Pricing data: current prices, price ranges, currency information - Availability status: real-time stock information - Review metrics: average ratings and review counts - Promotional data: coupon information and sale status ## Features - Amazon Japan support: Supports Amazon Japan (amazon.co.jp) product pages (other Amazon domains will be supported in the future) - Automatic proxy configuration: Proxy is automatically configured based on the URL's country - Rate limiting: Configurable request delays to avoid blocking - Concurrent processing: Adjustable concurrency for optimal performance - Proxy support: Uses proxy for better reliability and IP rotation ## Input Parameters ### Required Parameters - amazonUrls (array, required): List of Amazon Japan product URLs to scrape - Currently only amazon.co.jp domain is supported (other Amazon domains will be supported in the future) - Example: ["https://www.amazon.co.jp/dp/B0CWGS19RG"] - Maximum: 1000 URLs - Pattern: Must match ^https?://(www\.)?amazon\.co\.jp/.*$ ### Optional Parameters - maxConcurrency (integer, optional): Number of concurrent requests - Default: 5 - Range: 1 - 20 - Higher values increase speed but may increase the risk of blocking - requestDelay (integer, optional): Delay between requests in milliseconds - Default: 1000 (1 second) - Range: 1000 - 10000 - Recommendation: Use 1000ms or more to avoid blocking ## Output Data Structure Each scraped product is saved to the Dataset with the following structure. Here's an example of the data you'll receive: json { "url": "https://www.amazon.co.jp/dp/B0CWGS19RG", "status": "success", "errorMessage": null, "asin": "B0CWGS19RG", "title": "Product Title", "image": "https://m.media-amazon.com/images/...", "current_price": 2999, "price_range": null, "currency": "JPY", "availability": "In Stock", "review_average": 4.5, "review_count": 1234, "coupon": { "amount": 100, "rate": 5, "expiry": "2024-12-31" }, "in_sale": false, "scrapedAt": "2024-01-15T10:30:00.000Z" } ### Field Descriptions - url: Original product URL - status: Processing status - "success" (scraped successfully), "not_found" (404 error), or "error" (other errors) - errorMessage: Error message if status is not "success", otherwise null - asin: Amazon Standard Identification Number - title: Product title/name - image: Product image URL - current_price: Current price as a number (null if not available) - price_range: Price range object for products with variable pricing, contains min and max properties (null if not applicable) - currency: Currency code (e.g., JPY) - availability: Stock availability status (e.g., "In Stock", "Out of Stock") - review_average: Average review rating (0-5, null if not available) - review_count: Total number of reviews (null if not available) - coupon: Coupon information object containing amount (discount amount), rate (discount rate %), and expiry (expiry date), or null if no coupon - in_sale: Boolean indicating if the product is currently on sale - scrapedAt: ISO 8601 timestamp of when the data was scraped ## Use Cases You can use the extracted Amazon data for various purposes: - Price tracking: Monitor price changes and identify the optimal time to purchase - Market research: Analyze competitor pricing and inventory status - Price comparison services: Build price comparison tools with multi-product data - Trend analysis: Track review ratings and trends over time - Stock alerts: Automatically detect when out-of-stock products become available - E-commerce platforms: Build product databases for your own marketplace ## How to Use ### Get Started in 5 Steps 1. Create a free Apify account (free trial available) 2. Open this Actor in Apify Console 3. Add Amazon product URLs (amazon.co.jp domain only) 4. Click "Start" to begin data extraction 5. Download results in JSON, CSV, Excel, or other formats ### Detailed Instructions 1. Open the Actor in Apify Console 2. Configure Input Parameters: - Add Amazon Japan product URLs to the amazonUrls field (amazon.co.jp domain only) - Optionally adjust maxConcurrency and requestDelay for performance tuning 3. Start the Actor: - Click the "Start" button - Monitor the progress in the run console 4. Download Results: - Once completed, access the results from the Dataset tab - Export data in JSON, CSV, Excel, or other formats ### Input Example Here's an example of how to configure the input parameters. You can specify multiple Amazon product URLs and adjust performance settings: json { "amazonUrls": [ "https://www.amazon.co.jp/dp/B0CWGS19RG", "https://www.amazon.co.jp/dp/B0DHG23KQ6", "https://www.amazon.co.jp/dp/B08SVY9V5D" ], "maxConcurrency": 3, "requestDelay": 2000 } ## Configuration Tips ### Performance Optimization - For speed: Increase maxConcurrency (up to 20) and decrease requestDelay (minimum 1000ms) - For reliability: Decrease maxConcurrency (1-3) and increase requestDelay (2000-5000ms) - Note: maxRequestsPerCrawl is automatically calculated based on the number of URLs (URL count × 3) to account for retries, captcha handling, and redirects ### Avoiding Blocks - Always use requestDelay of at least 1000ms - Keep maxConcurrency moderate (3-5) for better success rates - Proxy is automatically configured based on the URL's country for better IP rotation - Consider using residential proxies for high-volume scraping ## Troubleshooting ### Common Issues Issue: Actor returns empty results or status is "error" or "not_found" - Solution: Check if URLs are valid Amazon Japan (amazon.co.jp) product URLs - Solution: Verify that the product pages are accessible - Solution: Check the status field in the output - if it's "not_found", the product page may have been removed - Solution: If status is "error", check the errorMessage field for details - Solution: Check if CAPTCHA detection is triggering (check logs) Issue: Rate limiting / Blocking - Solution: Increase requestDelay to 2000ms or more - Solution: Decrease maxConcurrency to 1-3 - Solution: Ensure Apify Proxy is enabled Issue: Missing price data - Solution: Some products may not have price information (e.g., pre-orders, unavailable items) - Solution: Check the availability field in the output ## FAQ ### How much does it cost to use this Actor? This Actor uses a pay-per-event pricing model. The cost is calculated based on the number of successfully scraped products. Each successfully scraped product counts as one event, and you are charged per event. Apify Store pricing discounts use a tiered approach, where higher subscription plans unlock lower Actor costs. See our subscription plans for more details. | Events | Free | Starter | Scale | Business | | ------------------- | ------------------------------ | ---------------------------------- | ---------------------------------- | -------------------------------- | | Product_scraped | $3.50 / 1,000
(No discount) | $3.50 / 1,000
(Bronze discount) | $3.20 / 1,000
(Silver discount) | $3.00 / 1,000
(Gold discount) | ### How can I use this Actor with the Apify API? The Apify API provides programmatic access to run this Actor. You can use: - Node.js: Use the apify-client NPM package - Python: Use the apify-client PyPI package For full details, check the Apify API documentation or click the API tab for code examples. ### Is it legal to scrape Amazon? Web scraping is generally legal when it involves publicly available information, and this Actor only extracts publicly available product information. It does not collect any private user data such as email addresses or personal information. Important considerations: - Amazon's Terms of Service: Amazon's Terms of Service typically apply when you create an account and agree to them. Since this Actor accesses publicly available product pages without requiring an Amazon account, the Terms of Service restrictions may not apply to this type of automated data collection. - Public information: This Actor only accesses publicly available product information that anyone can view without logging in, which is generally considered legal to scrape. - Regional laws: Data protection laws (such as GDPR in the European Union) may apply depending on your location and how you use the data. - Responsible use: Ensure you use the scraped data responsibly and in compliance with applicable laws. Disclaimer: While accessing publicly available information without an account is generally considered legal, you are responsible for ensuring your use complies with applicable laws. If you're unsure whether your use case is legal, consult your legal counsel. For more information, read our blog post on the legality of web scraping. ### What should I do if I'm getting blocked? If you're experiencing blocking issues: - Increase requestDelay to 2000ms or higher - Decrease maxConcurrency to 1-3 See the Troubleshooting section for more details. ### Can I scrape Amazon domains other than amazon.co.jp? Currently, only Amazon Japan (amazon.co.jp) is supported. Support for other Amazon domains (e.g., amazon.com, amazon.de) will be added in the future. ## Support If you encounter any issues or have questions: - Check the Troubleshooting section above - Review the Apify documentation - Contact support through Apify Console ## Your Feedback We're always working on improving the performance of this Actor. If you have any technical feedback or found a bug, please create an issue on the Actor's Issues tab in Apify Console. Your feedback helps us make this Actor better for everyone! ## License ISC

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Amazon Price Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
wilico
Pricing
Paid
Total Runs
61
Active Users
5
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support