Amazon Reviews Stats Scraper

Amazon Reviews Stats Scraper

by lalala0095

Extracts review stats from Amazon product pages using Playwright. Get overall rating, total number of ratings, and star distribution (1★–5★). Supports...

177 runs
13 users
Try This Actor

Opens on Apify.com

About Amazon Reviews Stats Scraper

Extracts review stats from Amazon product pages using Playwright. Get overall rating, total number of ratings, and star distribution (1★–5★). Supports multiple URLs, all Amazon domains, and includes robust fallback selectors.

What does this actor do?

Amazon Reviews Stats Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Amazon Review Scraper A lightweight and efficient Apify Actor that extracts basic review statistics from Amazon product pages using Playwright and Python. ## Features ✅ Simple & Fast - Focuses only on essential review metrics ✅ Multi-URL Support - Process multiple products in a single run ✅ Cross-Region Support - Works with Amazon domains worldwide ✅ Reliable Extraction - Uses multiple fallback selectors for robust data extraction ## What It Scrapes This scraper extracts the following information from Amazon product pages: - Overall Rating (e.g., 4.5 out of 5) - Total Number of Ratings (e.g., 3,030 global ratings) - Star Distribution: - 5 star percentage - 4 star percentage - 3 star percentage - 2 star percentage - 1 star percentage ## Input Configuration The scraper accepts a simple JSON input with an array of Amazon product URLs: json { "urls": [ "https://www.amazon.co.uk/dp/B081VRSGWF", "https://www.amazon.com/dp/B08N5WRWNW" ] } ### Supported URL Formats - Product pages: https://www.amazon.com/dp/[ASIN] - Alternative format: https://www.amazon.com/gp/product/[ASIN] - All Amazon domains supported (.com, .co.uk, .de, .fr, .es, .it, .ca, etc.) ## Output Format The scraper returns structured data for each product: json { "url": "https://www.amazon.de/dp/B0CB37RQ9Z", "asin": "B0CB37RQ9Z", "success": true, "error": null, "data": { "overall_rating": 4.5, "total_ratings": 148, "star_distribution": { "5_star": 73, "4_star": 17, "3_star": 5, "2_star": 0, "1_star": 5 } } } ### Output Fields - url: The original Amazon product URL - asin: The extracted Amazon Standard Identification Number - success: Boolean indicating if scraping was successful - error: Error message if scraping failed (null on success) - data: The scraped review statistics - overall_rating: Average rating (0-5 scale) - total_ratings: Total number of customer ratings - star_distribution: Percentage breakdown by star rating (values are integers representing percentages) ## Local Development ### Prerequisites - Python 3.11+ - Apify CLI (npm install -g apify-cli) ### Project Structure amazon-review-scraper/ ├── .actor/ │ └── actor.json # Actor configuration ├── src/ │ ├── __init__.py # Package initialization │ ├── __main__.py # Entry point │ └── main.py # Main scraper logic ├── input_schema.json # Input validation schema ├── requirements.txt # Python dependencies ├── Dockerfile # Container configuration └── README.md # Documentation ### Installation 1. Clone or create the project structure 2. Install dependencies: bash pip install -r requirements.txt playwright install chromium ### Running Locally 1. Create an INPUT.json file: json { "urls": ["https://www.amazon.co.uk/dp/B081VRSGWF"] } 2. Run the actor: bash apify run -p ## Deployment to Apify Platform ### Method 1: Via Apify CLI 1. Login to Apify: bash apify login 2. Initialize the actor: bash apify init 3. Push to platform: bash apify push ### Method 2: Via Web Console 1. Go to Apify Console 2. Create a new Actor 3. Choose "Python + Playwright" template 4. Upload the source files 5. Build and run ## Use Cases - Price Monitoring - Track product ratings over time - Market Research - Analyze customer satisfaction across products - Competitor Analysis - Compare review metrics between similar products - Quality Assurance - Monitor your own product reviews - Data Analytics - Gather review data for analysis ## Limitations - Scrapes only publicly available review statistics - Does not extract individual review text or reviewer information - Requires valid Amazon product URLs with ASIN - Rate limited to prevent overwhelming Amazon servers (2-second delay between requests) ## Error Handling The scraper includes robust error handling: - Invalid URLs are reported with error messages - Network timeouts are handled gracefully - Missing elements use fallback selectors - Each URL failure doesn't affect others in the batch ## Performance - Speed: ~5-10 seconds per product page - Memory: ~256-1024 MB depending on batch size - CPU: Minimal usage (0.25-1 CPU) - Concurrency: Sequential processing with delays ## Support For issues, questions, or feature requests: - Check the Apify Documentation - Visit Apify Support - Review the Actor logs for debugging ## License This actor is provided as-is for educational and commercial use. Please ensure compliance with Amazon's Terms of Service and robots.txt when using this scraper. ## Changelog ### Version 0.1.0 - Initial release - Basic review statistics extraction - Multi-URL support - Playwright-based scraping

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Amazon Reviews Stats Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
lalala0095
Pricing
Paid
Total Runs
177
Active Users
13
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support