Chiso

Chiso

by gainful_basil

Chiso is a reliable, easy-to-use developer tool for automation and data tasks. It's the straightforward foundation your project needs.

18 runs
3 users
Try This Actor

Opens on Apify.com

About Chiso

If you're tired of wrestling with clunky developer tools that overcomplicate simple tasks, you should try Chiso. I've been using it for a few months now to handle some of the more tedious parts of my workflow, and it just gets the job done without any fuss. It’s the kind of tool you install and actually use, not one that sits there collecting dust because the setup is a nightmare. The reliability is what stands out—it doesn’t crash halfway through a job, which is more than I can say for some other scripts I’ve tried. Whether you're automating data collection, testing APIs, or building a quick integration, Chiso provides a straightforward foundation you can count on. It won't magically solve every problem, but for creating dependable, no-nonsense automations, it’s become my go-to. It feels like it was built by developers who understand the need for something that just works, letting you focus on the logic of your project instead of debugging the tool itself.

What does this actor do?

Chiso is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Chiso: Python Web Scraping Actor Template

A Python Actor template built on Apify's platform for web scraping using Crawlee and Beautiful Soup. It provides a structured starting point for building data extraction tools that parse HTML content.

Overview

This Actor is a template that scrapes website data using the Beautiful Soup library, wrapped within Crawlee's BeautifulSoupCrawler. It extracts page titles and URLs by default, with a codebase you can modify for custom scraping logic. The template includes pre-configured project structure, input validation, and local/cloud deployment workflows.

Key Features

  • Beautiful Soup Integration: Uses BeautifulSoupCrawler from Crawlee for Python to parse and navigate HTML/XML.
  • Structured Data Output: Saves scraped results to an Apify dataset, ensuring consistent JSON formatting.
  • Configurable Crawling: Control the crawl via input parameters like startUrls and maxPagesPerCrawl.
  • Local Development & Cloud Deployment: Full support for local testing with apify run and easy pushing to the Apify platform.
  • Built-in Best Practices: Includes proxy configuration support, input/output schemas, and the Apify SDK for Actor development.

How to Use

Local Development

  1. Install dependencies and run the Actor locally:
    bash apify run
  2. To deploy to the Apify platform:
    bash apify login apify push

Project Structure

The main files and directories are:
* .actor/ – Configuration files (actor.json, input/output schemas).
* src/main.py – The main Actor entry point and crawling logic.
* storage/ – Local storage for datasets, key-value stores, and request queues (mirrors cloud structure).
* Dockerfile – Container definition for the Actor runtime.

How It Works

The crawler begins with URLs from the startUrls input. For each page, the requestHandler uses Beautiful Soup to parse the HTML. The default script extracts the page title and URL, saves each result as an item in the dataset, and logs the output. You can modify main.py to change the data extraction logic.

Input/Output

Input (via input_schema.json):
* startUrls (array): The list of URLs to begin crawling.
* maxPagesPerCrawl (number, optional): Limit the total number of pages scraped.

Output:
* Dataset: Contains JSON objects for each scraped page. By default, each item includes:
json { "url": "https://example.com", "title": "Example Page Title" }

Resources

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Chiso now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
gainful_basil
Pricing
Paid
Total Runs
18
Active Users
3
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support