Metadata Extractor
by jancurn
A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as...
Opens on Apify.com
About Metadata Extractor
A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.
What does this actor do?
Metadata Extractor is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Metadata extractor The actor takes a list of URLs of web pages on input, loads the HTML, and then extracts metadata from the HTML. The result is stored as a JSON file into the default dataset. For example, for https://www.apify.com, the JSON result looks as follows: { "url": "https://www.apify.com/", "title": "Web Scraping, Data Extraction and Automation · Apify", "meta": { "X-UA-Compatible": "IE=edge,chrome=1", "viewport": "width=device-width,minimum-scale=1,initial-scale=1", "copyright": "Copyright© 2019 Apify Technologies s.r.o. All rights reserved.", "keywords": "web scraper, web crawler, scraping, data extraction, API", "robots": "index,follow", "referrer": "origin", "googlebot": "index,follow", "description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!", "twitter:card": "summary_large_image", "twitter:creator": "@apify", "fb:app_id": "1636933253245869", "og:url": "https://apify.com/", "og:type": "website", "og:title": "Web Scraping, Data Extraction and Automation · Apify", "og:description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!", "og:image": "https://apify.com/img/og-image.png", "og:image:alt": "Apify", "og:image:width": "1200", "og:image:height": "630", "og:locale": "en_IE", "og:site_name": "Apify", "next-head-count": "19" } }
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Metadata Extractor now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- jancurn
- Pricing
- Paid
- Total Runs
- 1,668,220
- Active Users
- 1,265
Related Actors
Web Scraper
by apify
Cheerio Scraper
by apify
Website Content Crawler
by apify
Legacy PhantomJS Crawler
by apify
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support