Haldirams-Apify-Client
by saikarna
An open-source web scraper for Haldiram's products. Extracts titles, links, and prices fast using HTTPX & BeautifulSoup for e-commerce automation and price tracking.
Opens on Apify.com
About Haldirams-Apify-Client
Need to track Haldiram's product prices or build a dataset of their offerings? I built this scraper to do exactly that, and it's become my go-to for pulling clean, structured data from their site. It works by sending asynchronous requests using HTTPX, which makes it seriously fast, and then parses the HTML with BeautifulSoup to grab product titles, links, and current prices. I use it for keeping an eye on competitor pricing and for feeding product catalogs into my own analysis projects. If you're into e-commerce automation, market research, or just need a reliable way to monitor their inventory, this open-source client handles the heavy lifting. You get the data in a straightforward format, ready for a spreadsheet, database, or your custom dashboard. It's one of those simple, focused tools that just works without a fuss.
What does this actor do?
Haldirams-Apify-Client is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Haldirams-Apify-Client
Overview
A Python-based Apify Actor template for scraping data from a single web page. It takes a target URL as input, fetches the HTML using HTTPX, parses it with Beautiful Soup, and stores the extracted data in an Apify dataset. The template is pre-configured to extract page headings but is designed to be easily modified to scrape any desired elements.
Key Features
- Apify SDK for Python: Provides the core framework for building and running the Actor.
- Input Schema: Validates input configuration, primarily the target page URL.
- HTTPX: Handles asynchronous HTTP requests to fetch page content efficiently.
- Beautiful Soup 4: Parses HTML/XML to extract and structure data.
- Apify Storage: Uses a Dataset for structured output and a Request Queue for managing URLs.
- Open Source: The template code is available for modification and local development.
How to Use
Running the Actor
- Provide the target page URL via the Actor's input configuration.
- Run the Actor. It will:
- Fetch the HTML from the provided URL.
- Parse the content (default: extract all heading tags
h1throughh6). - Save the structured results to the Actor's dataset.
Local Development
To modify the scraper logic or run it locally, use the Apify CLI to pull the Actor code:
-
Install the Apify CLI:
```bash
# Using Homebrew
brew install apify-cliUsing npm
npm -g install apify-cli
2. Pull the Actor using its unique name or ID:bash
apify pull
```
Find the Actor ID or unique name in the Apify Console.
Modifying the Scraper
The core data extraction logic is in the main script. To scrape different data, edit the Beautiful Soup parsing section. For example, change the soup.find_all() call to target different HTML elements, classes, or IDs.
Input/Output
- Input: A JSON object containing the
url(string) of the page to scrape. - Output: Data is pushed to an Apify Dataset. The default output is an array of objects, each containing a
headingfield with the extracted text. After modification, the output will reflect your custom data structure.
Resources
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Haldirams-Apify-Client now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- saikarna
- Pricing
- Paid
- Total Runs
- 308
- Active Users
- 6
Related Actors
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Linkedin Profile Details Scraper + EMAIL (No Cookies Required)
by apimaestro
Twitter (X.com) Scraper Unlimited: No Limits
by apidojo
Content Checker
by jakubbalada
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support