Clinicaltrials Scraper

Clinicaltrials Scraper

by parseforge

Collect structured trial records on ClinicalTrials.gov. It supports keyword searches, country/location filters, built-in status/phase options, documen...

38 runs
5 users
Try This Actor

Opens on Apify.com

About Clinicaltrials Scraper

Collect structured trial records on ClinicalTrials.gov. It supports keyword searches, country/location filters, built-in status/phase options, document availability indicators, and precise record limits so analysts can create medical research datasets in seconds.

What does this actor do?

Clinicaltrials Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

ClinicalTrials.gov Scraper πŸš€ Instantly turn the ClinicalTrials.gov search experience into clean, structured datasets ready for analysis, due diligence, or portfolio monitoring. This actor calls the same API that powers the new ClinicalTrials.gov interface, so you get up-to-the-minute trial data without fighting HTML pages or browser automation. Pick any combination of keywords, locations, study statuses, phases, funder types, or document availability filters and receive perfectly flattened JSON that is easy to drop into BI tools. Target Audience: Healthcare investors, market access teams, clinical operations, medical affairs, data vendors. Primary Use Cases: Competitive landscaping, site feasibility, results monitoring, KOL research, regulatory compliance tracking. --- ## πŸ“Š What Does the ClinicalTrials.gov Scraper Do? This tool collects every public data point exposed through the official Studies API: - Trial identifiers (NCT ID, acronyms, org study IDs) and shareable URLs - Current status, last-known status, verification dates, and results posting history - Design metadata (study type, phases, allocation, intervention model, masking, primary purpose, observational model, time perspective, target duration) - Eligibility details (sex, age brackets, healthy volunteer toggle, inclusion and exclusion criteria) - Lead sponsor, collaborator list, and funder category - Outcome measures (primary and secondary with descriptions), conditions, interventions, uploaded documents (protocols/SAP/ICF) - Central contacts, site-level contacts, site coordinates, and location status - Historical data including original enrollment info, sponsor changes, and version history Business Value: Replace manual ClinicalTrials.gov browsing with a repeatable pipeline, feed downstream analytics with reliable identifiers, and document compliance sensitive artifacts (posted results, FDAAA violation flags, document uploads) automatically. --- ## πŸ”§ Input To start collecting trials, fill in the input form. Choose either a start URL or the filters below: - startUrl – Paste a ClinicalTrials.gov search page URL (e.g., https://clinicaltrials.gov/search) or an API URL. We'll reuse its filters and paginate automatically. Prefill: https://clinicaltrials.gov/search - Condition/disease – Specific condition or disease to search for (e.g., diabetes, hypertension, cancer) - Other terms – Additional search terms (e.g., symptoms, treatments, keywords) - Intervention/treatment – Intervention or treatment to search for (e.g., drug name, procedure, device) - Location – Search by address, city, state, zip code, or country - Study Status – Filter by "All studies" or "Recruiting and not yet recruiting studies" - Sex – Filter by eligible sex (All, Female, Male) - Age – Use either predefined age groups (Child, Adult, Older adult) via ageGroups OR manually specify custom age values with units (Years, Months, Weeks, Days, Hours, Minutes) via ageRangeFrom/ageRangeTo fields. These options are mutually exclusive - you cannot use both at the same time. - Accepts healthy volunteers – Toggle to only show trials that accept healthy volunteers - Study Phase – Filter by Early Phase 1, Phase 1, Phase 2, Phase 3, Phase 4, or Not applicable - Study Type – Filter by Interventional, Observational, Patient registries, Expanded access, and variations - Study Results – Filter by "Any", "With results", or "Without results" - Study Documents – Require protocol, SAP, or informed consent form uploads - Funder Type – Filter by NIH, Other U.S. federal agency, Industry, or All others - Date Ranges – Filter by date ranges using separate "From" and "To" fields for: study start date, primary completion date, study completion date, first posted date, results first posted date, or last update posted date (e.g., studyStartDateFrom, studyStartDateTo) - More Ways to Search – Search by title/acronym, outcome measure, sponsor/collaborator, lead sponsor, study IDs, or facility name - FDAAA violation flag – Toggle to only return studies flagged for FDAAA results submission violations - Sort studies by – Sort by "Relevance" or "Newest First" - maxItems – Free users: Limited to 100. Paid users: Optional, max 1,000,000. Prefill: 10 Example JSON using predefined age groups: json { "condition": "diabetes", "otherTerms": "type 2", "intervention": "metformin", "location": "United States", "studyStatusFilter": "recruitingAndNotYetRecruiting", "ageGroups": ["adult", "older"], "phases": ["phase2", "phase3"], "documentTypes": ["protocol"], "maxItems": 10 } Example JSON using manual age range: json { "condition": "diabetes", "ageRangeFrom": 18, "ageRangeFromUnit": "years", "ageRangeTo": 65, "ageRangeToUnit": "years", "maxItems": 10 } Note: You cannot use both ageGroups and ageRangeFrom/ageRangeTo at the same time. The system automatically detects which method you're using based on which fields are present. --- ## πŸ“₯ Output When the actor finishes, your dataset contains one flat JSON per trial. Download as JSON, CSV, Excel, or feed directly into the Apify API/webhooks. Sample record (trimmed): json { "nctId": "NCT07246980", "briefTitle": "Continuous Observational Monitoring Program for Allayβ„’ Hydrogel Cap Safety and Success Study", "studyUrl": "https://clinicaltrials.gov/study/NCT07246980", "officialTitle": "Continuous Observational Monitoring Program for Allayβ„’ Hydrogel Cap Safety and Success (COMPASS) Study...", "acronym": "COMPASS", "orgStudyId": "CLP-0002", "secondaryIds": null, "overallStatus": "NOT_YET_RECRUITING", "hasResults": false, "studyType": "OBSERVATIONAL", "phase": null, "enrollmentCount": 1000, "enrollmentType": "ESTIMATED", "targetDuration": "2 Years", "conditions": ["Peripheral Nerve Injuries"], "interventions": [{ "type": "DEVICE", "name": "Nerve Cap", "description": "Patients who have or will receive allay Hydrogel Cap", "armGroupLabels": null, "otherNames": null }], "leadSponsorName": "Tulavi Therapeutics,Inc.", "leadSponsorClass": "INDUSTRY", "collaborators": null, "locations": null, "centralContacts": null, "briefSummary": "This is a multi-center, open-label, observational registry study...", "gender": "ALL", "standardAges": ["CHILD", "ADULT", "OLDER_ADULT"], "inclusionCriteria": ["Patients who have or will receive allayβ„’ Hydrogel Cap"], "exclusionCriteria": ["Patients who have not received allayβ„’ Hydrogel Cap"], "primaryOutcomeMeasures": [{ "measure": "Assessment of change in Pain Scores from baseline using VAS", "timeFrame": "Baseline (Pre Surgery ), 2 Weeks, 1 month, 4 month, 12 month and 24 month (Post Surgery)", "description": "Pain scores will be collected using either of the following scales..." }], "secondaryOutcomeMeasures": [{ "measure": "Incidence of Device Failures surgical & medical complications...", "timeFrame": "Upto 2 years Post Surgery", "description": "Secondary endpoints of the study include safety assessment..." }], "scrapedAt": "2025-11-25T00:07:52.271Z" } What You Get: Clean identifiers, design metadata, contacts, eligibility criteria, outcome measures, and compliance evidence in one row. Download Options: CSV, Excel, JSON, XML, or read directly through the Apify API. --- ## ⚑ Why Choose the ClinicalTrials.gov Scraper? - All filters, zero browsers – We hit the Studies API directly, so there's nothing to break when the UI changes - Comprehensive data extraction – Capture all available fields from both listing and detailed API responses, with intelligent deduplication - Compliance-ready evidence – Capture results posting dates, FDAAA violation flags, and document download URLs automatically - Location intelligence built-in – Every site includes status, city/state/country, ZIP, and coordinates for quick mapping - Perfectly flat schema – No nested surprises. Contacts, documents, outcomes, and interventions are tidy arrays that BI tools understand - Clean data formatting – Empty strings and arrays are normalized to null for cleaner datasets - Huge exports supported – Request up to one million records per run without worrying about throttled browsers ⏰ Time Savings: Minutes instead of days copying and pasting from web pages. πŸ’° Cost Efficiency: Automate repeatable monitoring instead of paying staff/consultants for manual screenshotting. --- ## 🎯 How to Use 1. Sign Up: Create a free account w/ $5 credit – two minutes 2. Open the Actor: Visit the ClinicalTrials.gov Scraper on Apify 3. Pick Your Approach: Paste a start URL or configure the filters (condition, status, location, date ranges, etc.) 4. Run It: Click "Start". Progress logs will show every API page fetched 5. Download Data: Grab the dataset as CSV/Excel/JSON or connect via webhook/API ⏰ Total Time: ~3 minutes for the first run, seconds for repeats. βœ… No Technical Skills Required: Everything is point and click. --- ## πŸ’Ό Business Use Cases Clinical Operations - Monitor recruiting competition in target indications - Validate site availability and contact info before outreach - Track protocol/SAP/ICF availability for submission packs Market Intelligence & BD - Size therapeutic landscapes by phase, sponsor class, and geography - Identify partners/co-developers by filtering collaborators or funder class - Flag terminated or withdrawn programs for acquisition scouting Medical Affairs / HEOR - Build custom registries of outcome measures for meta-analyses - Follow expanded access activity for patient advocacy - Keep real-time lists of results posted vs. overdue Compliance / Regulatory - Audit FDAAA violation flags and results submission states - Archive document download links for internal records - Automate evidence packages for periodic reporting --- ## ❓ Frequently Asked Questions Q: Do I need a browser or proxy? A: No. We call the official Studies API, so there is no headless browser, no proxy juggling, and no HTML parsing. Q: How current is the data? A: Exactly as current as ClinicalTrials.gov. Each run reads directly from their backend API. Q: What if I exceed maxItems? A: Free users are automatically limited to 100 items (with a warning if they specify more). Paid users can set maxItems anywhere up to 1,000,000. The actor paginates until the limit or the dataset is exhausted. Q: Can I reuse a ClinicalTrials.gov URL? A: Yes. Paste the URL you see in your browser after applying filters. We convert it into API parameters automatically. When using a startUrl, all other filters are ignored. Q: What happens if the API returns an error? A: The actor retries three times with exponential backoff. If it still fails, we log the friendly error, push it to the dataset, and exit gracefully. Q: Are empty fields included in the output? A: Empty strings and arrays are automatically converted to null for cleaner datasets and easier analysis. --- ## πŸ”Œ Integrate the ClinicalTrials.gov Scraper with any app This actor plugs into the rest of your stack through Apify integrations: - Make - Zapier - Slack - Airbyte - GitHub - Google Drive - or trigger your own webhooks whenever a run finishes --- ## πŸ”— Recommended Actors Looking for more data collection tools? Check out these related actors: | Actor | Description | Link | |-------|-------------|------| | FINRA BrokerCheck Scraper | Collect broker/advisor registrations and disclosures from FINRA | https://apify.com/parseforge/finra-brokercheck-scraper | | FAA Aircraft Registry Scraper | Monitor US aircraft ownership and certification data | https://apify.com/parseforge/faa-aircraft-registry-scraper | | HubSpot Marketplace Scraper | Track go-to-market tools listed on HubSpot's app store | https://apify.com/parseforge/hubspot-marketplace-scraper | | Lead Formatter Tool | Clean and enrich contact records with AI | https://apify.com/parseforge/lead-formatter | | GSA eLibrary Scraper | Extract government services and solutions data from GSA eLibrary | https://apify.com/parseforge/gsa-elibrary-scraper | Pro Tip: πŸ’‘ Browse our complete collection of data collection actors to find the perfect tool for your business needs. --- πŸ’¬ Need Help? Our team is ready to help you connect this actor to your workflow. --- > ⚠️ Disclaimer: This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by ClinicalTrials.gov or the U.S. National Library of Medicine. All trademarks are the property of their respective owners.

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Clinicaltrials Scraper now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
parseforge
Pricing
Paid
Total Runs
38
Active Users
5
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support