trayv/my-actor
by trayv
An open-source web scraping actor for e-commerce & SEO, built by a store owner. Handles modern site protections to reliably extract product data, prices, and SEO elements.
Opens on Apify.com
About trayv/my-actor
Running an e-commerce store like elitetreasuresclub.shop means constantly needing clean, structured data for everything from competitor price tracking to SEO audits. I built this actor because I needed a reliable way to pull that data myself. As a Cloudflare Pro user with advanced security and performance rules on my own site, I understand the challenges of scraping modern, protected stores. This tool is designed to handle those complexities. It’s open-source, so you can see exactly how it works and adapt it for your own projects—whether you're monitoring prices, gathering product details for a feed, or analyzing on-page SEO elements across a category. I use it regularly to keep my own store competitive, and it saves me hours of manual work. Think of it as your own data-fetching assistant, built by someone who’s in the trenches with you.
What does this actor do?
trayv/my-actor is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
my-actor: Single-Page Web Scraper
Overview
A Node.js template for scraping structured data from a single web page. You provide a URL via input, and the actor fetches the HTML, parses it, and stores the results in an Apify dataset. It's built for ecommerce, SEO, and open-source use cases, but you can modify it to extract any data from a page.
Key Features
- Apify SDK: Provides the scaffolding for building and running the actor.
- Input Schema: Validates the input configuration, primarily the target URL.
- Axios Client: Handles the HTTP request to fetch the page HTML.
- Cheerio: Parses the HTML using a jQuery-like syntax to extract data.
- Dataset Storage: Outputs structured data (like page headings) into a dataset for easy access and export.
Input/Output
Input: The actor expects a JSON input containing the url of the page to scrape.
{
"url": "https://example.com"
}
Output: By default, the actor scrapes all heading elements (h1 through h6). The data is pushed to an Apify dataset, where each item is an object containing the heading text and its tag name. You can modify the Cheerio selector in the code to extract different elements.
How to Use
- Configure Input: Set the target page URL in the actor's input.
- Run the Actor: Start the run via the Apify Console, API, or CLI.
- Access Data: Once finished, retrieve the scraped data from the actor's dataset tab in JSON, CSV, or other formats.
The core scraping logic works as follows:
1. Actor.getInput() retrieves the input configuration.
2. axios.get(url) fetches the HTML of the target page.
3. cheerio.load(response.data) loads the HTML for parsing.
4. A Cheerio selector (e.g., $("h1, h2, h3, h4, h5, h6")) iterates through elements to extract data.
5. Actor.pushData() stores the extracted records into the dataset.
To customize what data is scraped, edit the Cheerio selector within the actor's code.
Local Development
You can pull this actor to your local machine for development using the Apify CLI.
- Install the CLI:
```bash
# Using npm
npm -g install apify-cli
# Using Homebrew
brew install apify-cli
```
- Pull the actor using its unique name or ID:
bash apify pull <ActorId>
Find the Actor ID or unique name (liketrayv/my-actor) in the Apify Console.
Resources
- Web scraping in Node.js with Axios and Cheerio
- Apify SDK for JavaScript documentation
- Apify Platform documentation
- Join our developer community on Discord
For a full guide on building actors, see this article.
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try trayv/my-actor now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- trayv
- Pricing
- Paid
- Total Runs
- 516
- Active Users
- 7
Related Actors
Google Maps Reviews Scraper
by compass
Facebook Ads Scraper
by apify
Google Ads Scraper
by silva95gustavo
Facebook marketplace scraper
by curious_coder
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support