Oxford English Dictionary
by focused_berimbau
Extract words, definitions, part of speech, and popularity data from the Oxford English Dictionary. Perfect for research, app development, and content creation.
Opens on Apify.com
About Oxford English Dictionary
Need to pull data from the Oxford English Dictionary for a project? This actor is built for that. It's a straightforward scraper that lets you extract specific word entries, including their definitions, part of speech, and even that interesting percentage popularity metric. I've used it to build datasets for linguistic analysis and to populate reference databases, and it handles the structure of OED pages reliably. You get clean, structured JSON output with each run, which saves you hours of manual copying or wrestling with complex parsing logic. It's perfect for researchers compiling corpora, developers building educational apps, or content creators who need verified definitions at scale. Just configure your list of target words, run it, and you've got authoritative lexical data ready for your next step. It does one job and does it well, turning the OED's vast repository into an accessible data source.
What does this actor do?
Oxford English Dictionary is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Oxford English Dictionary Actor
Scrapes word definitions and metadata from the Oxford English Dictionary (OED) advanced search results.
Target URL: https://www.oed.com/search/advanced/Entries
Overview
This actor automates data extraction from the OED's advanced search. It navigates search results and detail pages to collect comprehensive lexical data for given search terms. It's built on the Apify platform using Puppeteer.
Key Features
- Word Data Extraction: Scrapes headword, part of speech, pronunciation, definitions, etymology, and usage examples.
- Pagination Handling: Automatically traverses multiple pages of search results.
- Request Queue Management: Efficiently processes large lists of search queries from the input dataset.
- Structured Output: Returns clean, organized data in JSON format, suitable for databases or analysis.
How to Use
Run the actor with a list of search terms. Configure the run settings (like max results per query) via the input.
Input Configuration
The input is a JSON object. Provide an array of search queries and optionally adjust the maxResultsPerQuery.
{
"queries": [
"lexicography",
"api",
"serendipity"
],
"maxResultsPerQuery": 50
}
queries: Required. An array of strings (words/phrases) to look up in the OED.maxResultsPerQuery: Optional. Maximum number of word entries to scrape per query. Default is 50.
Running the Actor
- Set your input JSON in the Apify console, UI, or via API.
- Start the run. The actor will open a browser, perform searches, and scrape the resulting pages.
- Download or access the results from the dataset when the run completes.
Input/Output
Input Schema
As shown in the How to Use section, the input requires a queries array.
Output Schema
Results are stored in the dataset as individual items, each representing a scraped word entry. The structure includes:
{
"query": "serendipity",
"headword": "serendipity, n.",
"pronunciation": "/ˌsɛrənˈdɪpɪti/",
"partOfSpeech": "noun",
"definitions": [
"The faculty of making happy and unexpected discoveries by accident."
],
"etymology": "Mid 19th century: from Serendip + -ity...",
"examples": [
{
"text": "a fortunate stroke of serendipity",
"source": "2002, N.Y. Times"
}
],
"url": "https://www.oed.com/view/Entry/176289"
}
Key Fields:
* query: The original search term.
* headword: The canonical dictionary entry heading.
* pronunciation: Phonetic transcription.
* partOfSpeech: Grammatical category (noun, verb, etc.).
* definitions: Array of meaning definitions.
* etymology: Word origin and history.
* examples: Array of usage example objects with text and source.
* url: Source URL on oed.com.
Find this actor on the Apify platform: https://apify.com/your-actor-profile/oxford-english-dictionary?fpr=python_automation
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Oxford English Dictionary now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- focused_berimbau
- Pricing
- Paid
- Total Runs
- 725
- Active Users
- 20
Related Actors
Similarweb scraper
by curious_coder
Google Ads Scraper
by silva95gustavo
Cheap Google Search Results Scraper
by tuningsearch
G2 Explorer
by jupri
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support