Amazon Product Scraper
by neuro-scraper
β‘ Instantly discover Amazonβs best-selling products with one click! π This smart actor fetches real-time prices, ratings, and deals β giving you insig...
Opens on Apify.com
About Amazon Product Scraper
β‘ Instantly discover Amazonβs best-selling products with one click! π This smart actor fetches real-time prices, ratings, and deals β giving you insights in seconds. Trusted by pros for accuracy, speed, and reliability. Run it now and find hidden gems before your competitors do! π
What does this actor do?
Amazon Product Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
π Amazon Search Scraper One-line tagline: Instantly extract product listings and product-page metadata from Amazon search queries β fast, secure, and ready for business use. --- ## π Summary Amazon Search Scraper retrieves product data from Amazon search results and product pages and returns clean, business-ready JSON records for analysis. Designed for speed and reliability, it helps teams discover product details, prices, reviews, images, and availability for competitive research, analytics, and monitoring. --- ## π‘ Use cases / When to use * Competitive price monitoring and alerting * Market/product research and sourcing * Creating product catalogs and feeds * Gathering product images and descriptions for analytics * Quickly prototyping e-commerce dashboards --- ## β‘ Quick Start β Console (one-click) 1. Open this Actor in Apify Console. 2. Fill the Queries input (single keyword or array of keywords). 3. (Optional) Enable Proxy Configuration if scraping at scale. 4. Click Run. Results appear in the default dataset/OUTPUT in seconds. > Friendly microcopy: βPlug in a search term, click Run, and get structured product data instantly.β --- ## βοΈ Quick Start (CLI + API) CLI bash # Run an actor with JSON input via apify-cli apify run --actor <your-actor-id> --input input.example.json Python (apify-client) python from apify_client import ApifyClient client = ApifyClient('<APIFY_TOKEN>') run = client.actor('your-user/amazon-search-scraper').call(run_input={"queries": ["wireless earbuds"]}) print('Started run:', run['id']) --- ## π Inputs (fields & schema) Console JSON input example (see input.example.json file): json { "queries": ["wireless earbuds", "gaming mouse"], "headless": true, "requestDelay": [1.0, 2.0] } Fields * queries β string or array β required β Search keywords or Amazon product URLs. The actor accepts either a search keyword (will run site search) or a direct Amazon product URL for product-page scraping. * headless β boolean β optional β Run browser in headless mode (default: true). * requestDelay β array [min, max] β optional β Delay range (seconds) between product-page requests to reduce rate. --- ## βοΈ Configuration | π Name | π Type | β Required | βοΈ Default | π Example | π§ Notes | | -----------------: | --------------: | ----------: | ---------: | ----------------------: | --------------------------------- | | queries | string/array | β
Yes | β | ["wireless earbuds"] | Search terms or product URLs | | headless | boolean | βοΈ Optional | true | false | Turn off to debug visually | | requestDelay | array (min,max) | βοΈ Optional | [1.0, 2.0] | [0.5, 1.0] | Avoids aggressive scraping | | proxyConfiguration | object | βοΈ Optional | {} | {"useApifyProxy": true} | Use residential proxies for scale | Example Console setup: paste "wireless earbuds" into queries and press Run Actor. --- ## π Outputs (Dataset / KV examples) Each dataset item is a JSON object with attributes similar to: json { "asin": "B09EXAMPLE", "title": "Wireless Earbuds XYZ", "brand_name": "BrandCo", "url": "https://www.amazon.com/...]", "price": "$59.99", "currency": "$", "thumbnail": "https://...jpg", "images": ["https://...jpg", "https://...jpg"], "stars": 4.3, "review_count": "1.2k", "availability": "In Stock", "description": "Key bullet points...", "categories": "Electronics > Audio", "search_keyword": "wireless earbuds" } > Note: The actor pushes results to the default dataset and also returns the full result array in the key-value store under the standard OUTPUT key. --- ## π Environment Variables * <APIFY_TOKEN> β required to call the Actor programmatically via API. * <PROXY_USER:PASS@HOST:PORT> β placeholder for custom proxy credentials. Security note: Store secrets in Apify Console Secrets β do not paste them into input fields. --- ## βΆοΈ How to Run Console 1. Go to the Actor page in Apify Console. 2. Paste your queries (single string or array) into the Input field. 3. (Optional) Configure proxies under the Proxy Configuration editor. 4. Click Run. CLI bash apify run --actor your-user/amazon-search-scraper --input input.example.json API (Python) See Quick Start (above) β use client.actor(...).call(run_input=...) and read the returned run ID. --- ## β° Scheduling & Webhooks * Use Apify Console scheduling to run this Actor at any interval (hourly, daily, weekly). * Configure webhooks on run completion to forward JSON output to your endpoint for real-time processing. --- ## πΎοΈ Logs & Troubleshooting * Check the Console logs for step-by-step run info and any per-item warnings. * Common issues: * No results: verify that queries are valid and spelled correctly. * Request timeouts: increase requestDelay or enable Proxy Configuration. * Selector changes on Amazon: refresh the run and adjust queries β the actor is resilient but web UIs change frequently. --- ## π Permissions & Storage Notes * This Actor collects publicly visible product information only. It does not perform account actions. * Results are stored in Apify Datasets/Key-Value stores in your account and follow Apifyβs standard retention and access controls. --- ## π Changelog / Versioning * v0.1.0 β Initial public release: search + product-page scraping, structured dataset output. --- ## π Notes / TODOs * TODO: Consider adding a CLI flag / input for limiting the number of product pages per query (reason: some queries return many results). * TODO: Add optional CSV export in output settings (reason: convenient for BI ingestion). --- ## π Proxy Configuration If you will run many searches or large-scale scraping, configure Apify Proxy or custom proxies. Enable Apify Proxy (Console): * In the Actor run form, open Proxy configuration and enable Use Apify Proxy (choose RESIDENTIAL for best results). Custom proxy example (as secret): * Use <PROXY_USER:PASS@HOST:PORT> format and store as a Console Secret. Reference it in the Proxy Configuration editor. Environment variables (examples) HTTP_PROXY=<PROXY_USER:PASS@HOST:PORT> HTTPS_PROXY=<PROXY_USER:PASS@HOST:PORT> Reminder: Store proxy credentials in Secrets and do not paste them into public inputs. TODO: Consider proxy rotation for large-scale scraping. --- ## π References 1. Apify Actor README guidelines β https://docs.apify.com/console/actors/README 2. Apify Input/Output schemas β https://docs.apify.com/platform/input-output 3. Apify CLI & API usage β https://docs.apify.com/console/actors/run --- ## π€ What I inferred from main.py * The Actor accepts queries (keywords or Amazon URLs) and uses an automated browser to fetch search results and product pages. * It extracts product metadata, images, prices, ratings, and availability and returns structured JSON items. * It respects throttling delays and can be configured to use proxies for scale. * Results are pushed to the default dataset and the key-value store under OUTPUT. --- --- # input.example.json json { "queries": [ "wireless earbuds", "gaming mouse" ], "headless": true, "requestDelay": [1.0, 2.0] } --- # CONFIG.md (optional) ## Quick config notes * Secrets: Add <APIFY_TOKEN> and any proxy credentials to Console Secrets. * Scaling: For repeated large runs, enable Apify Proxy (RESIDENTIAL) and consider running with scheduling + webhooks to automate downstream processing. ## Suggested settings in Console * Proxy configuration: use Apify Proxy β RESIDENTIAL * Dataset retention: enable automatic export to your storage of choice --- Generated by: Neuro Scraper
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Amazon Product Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- neuro-scraper
- Pricing
- Paid
- Total Runs
- 38
- Active Users
- 3
Related Actors
Google Maps Reviews Scraper
by compass
Facebook Ads Scraper
by apify
Google Ads Scraper
by silva95gustavo
Facebook marketplace scraper
by curious_coder
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support