universal-real-estate-extractor

universal-real-estate-extractor

by motivational_nickel

Extract real estate listing data from any website using CSS selectors, automatic detection, Playwright rendering, and pagination support.

53 runs
3 users
Try This Actor

Opens on Apify.com

About universal-real-estate-extractor

Extract real estate listing data from any website using CSS selectors, automatic detection, Playwright rendering, and pagination support.

What does this actor do?

universal-real-estate-extractor is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Universal Real Estate Extractor Universal Real Estate Extractor is a flexible, production-grade web scraper designed to extract structured real estate data from property listing websites. It supports CSS selector extraction, automatic selector detection, Playwright-based rendering, and pagination crawling. This Actor is ideal for: - Market analysis and research - Lead generation pipelines - Real estate investment scouting - Property monitoring and alerts - Business intelligence workflows --- ## Features ### Multi-site compatibility Works with any listing/search URL. You can configure the Actor using: - A listing selector (listing_selector) - A detail field map (detail_fields) - Optional pagination selector (pagination_selector) ### Automatic listing selector detection If you leave listing_selector empty, the Actor will try a set of common patterns for real estate listing cards. This makes it easier to start quickly on new websites. ### Pagination support If you provide a pagination_selector, the Actor will follow the "next" button or link until: - The maximum number of listings is reached, or - No further page is found ### Playwright-powered rendering The Actor uses Playwright to render JavaScript-heavy sites. This helps with: - Modern real estate portals - Lazy-loaded listings - Client-side rendering frameworks ### Clean, flat, structured output The Actor writes normalized data into the Apify dataset. Default fields include: - url - title - price - address - beds - baths - area - timestamp --- ## Input ### Example input (JSON tab) ```json { "start_urls": ["https://www.zillow.com/san-jose-ca/"], "listing_selector": "", "detail_fields": { "title": "h1.ds-address-container", "price": "span.ds-value", "address": "h1.ds-address-container" }, "pagination_selector": "", "max_listings": 50, "use_playwright": true, "save_html_snapshot": false, "save_screenshot": false } Field descriptions: start_urls (required) List of listing or search URLs where the Actor should start. listing_selector (optional) CSS selector used to locate individual listing links on the start pages. If empty, the Actor will try common patterns automatically. detail_fields (optional) JSON object mapping field names to CSS selectors on the detail page. Example: "price": "span.ds-value". pagination_selector (optional) CSS selector for the “next page” button or link. max_listings (optional, default: 100) Maximum number of listings to scrape before the Actor stops. use_playwright (required, default: true) Enables Playwright browser automation for JavaScript-heavy websites. save_html_snapshot (optional, default: false) If true, saves an HTML snapshot of each detail page to the key-value store. save_screenshot (optional, default: false) If true, saves a full-page screenshot of each detail page. Output Each item in the dataset contains normalized property data. Typical fields: { "url": "https://www.zillow.com/homedetails/123-Main-St-San-Jose-CA-95112/12345678_zpid/", "title": "123 Main St, San Jose, CA 95112", "price": "$1,250,000", "address": "123 Main St, San Jose, CA 95112", "beds": "3", "baths": "2", "area": "1,450 sqft", "timestamp": "2025-12-05T06:11:30.000Z" } You can download results as: JSON CSV Excel Or access them via the Apify API Notes Some real estate websites may block bots or preview requests. This can affect URL previews inside the Apify UI but does not affect the Actor’s functionality. If a site has a different structure, you can adjust listing_selector and detail_fields to match its HTML. Support and customization If you need help adapting this Actor for a specific real estate website, workflow, or integration, you can contact the developer for custom configurations or private extensions.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try universal-real-estate-extractor now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
motivational_nickel
Pricing
Paid
Total Runs
53
Active Users
3
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support