DRG Phantom Core - Genesis Pilot

DRG Phantom Core - Genesis Pilot

by tuguidragos

A stealth-grade autonomous lead intelligence engine that discovers, enriches, analyzes, and qualifies B2B prospects using multi-source scraping and AI...

55 runs
6 users
Try This Actor

Opens on Apify.com

About DRG Phantom Core - Genesis Pilot

A stealth-grade autonomous lead intelligence engine that discovers, enriches, analyzes, and qualifies B2B prospects using multi-source scraping and AI scoring. This pilot release showcases the system’s core capabilities and foundational architecture.

What does this actor do?

DRG Phantom Core - Genesis Pilot is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

Apify Actor Node.js TypeScript Playwright Cheerio OpenAI Anthropic Claude SerpAPI Clearbit People Data Labs License: Proprietary

Discover. Enrich. Score. Convert.

--- ## What It Does in 15 Seconds - Finds companies via search/keywords - Enriches them with full B2B data - Finds contacts and verifies emails - Detects buying signals - Scores leads with AI - Pushes to CRM or email platform ## Ideal For - Cold email agencies - Scale your outreach with quality leads - SDR teams - Focus on selling, not researching - B2B SaaS companies - Find your next 1,000 customers - Lead generation freelancers - Deliver results faster - Growth marketers - Automate top-of-funnel acquisition - Automation builders - Integrate with n8n, Make, Zapier ## Example Inputs No technical knowledge required. Just describe what you're looking for: - "Find SaaS companies hiring SDRs in Germany" - "Discover HVAC businesses in Texas" - "Find e-commerce brands using Shopify and Facebook Ads" - "Enrich my CSV of 200 company domains" - "Get all Series A funded fintech startups in UK" - "Find marketing agencies with 10-50 employees" - "Discover B2B software companies expanding to Europe" ## Performance & Limits | Operation | Speed | Notes | |-----------|-------|-------| | Company Discovery | 100–500/minute | Depends on search complexity | | Company Enrichment | 10–50/minute | Multi-source data fusion | | AI Lead Scoring | 50–200/minute | GPT-4 or Claude powered | | Email Validation | 500+/minute | MX + SMTP verification | | Contact Discovery | 25–100/minute | Decision-maker identification | Scale: Automatically scales with Apify infrastructure Max per run: 1,000 companies (pilot limit) Concurrency: Adaptive based on data sources ## Why This Actor vs Clay/Apollo/etc. | Feature | DRG Phantom Core | Traditional Tools | |---------|------------------|-------------------| | Pricing | Pay-per-use only | $500-2000/month subscriptions | | Data Ownership | 100% yours | Platform-locked | | API Architecture | Full API access | Limited API quotas | | Custom Scoring | Build your own AI models | Fixed scoring algorithms | | Data Sources | 10+ enrichment sources | 2-3 sources maximum | | Automation | Native webhook/API support | Requires Zapier/integrations | | Flexibility | Run anywhere, anytime | Browser-only access | | No Lock-in | Export everything | Credits expire monthly | ## Pilot Release Notice (Genesis Build) Version 0.2.0 - Genesis Build This is the inaugural pilot release of DRG Phantom Core, designated as the "Genesis Build". ### Release Status - Pilot Build: Version 0.2.0, first public pilot - Source Code: NOT publicly visible on Apify platform - Access: Limited pilot release with controlled distribution - Feedback: Actively collecting user feedback for v0.3 ## Quick Start Guide ### 1. Simple Mode (Keywords Only) Just add keywords and run: json { "mode": "discovery_and_enrichment", "keywords": ["B2B software companies"], "maxCompanies": 50 } ### 2. Targeted Mode (With Filters) Add industry and location filters: json { "mode": "discovery_and_enrichment", "keywords": ["SaaS companies"], "industries": ["software"], "countries": ["US", "GB"], "minEmployees": 10, "maxEmployees": 500, "maxCompanies": 100 } ### 3. AI Scoring Mode (With ICP) Enable AI-powered lead qualification: json { "mode": "discovery_and_enrichment", "keywords": ["marketing agencies"], "aiEnabled": true, "aiProvider": "openai", "icpDescription": "Marketing agencies with 10-50 employees focused on B2B clients, using modern martech stack, showing growth signals", "maxCompanies": 100 } ### 4. Enrichment Only Mode Have a list already? Just enrich it: json { "mode": "enrichment_only", "companyDomains": ["example.com", "acme.com", "company.io"], "maxContactsPerCompany": 5, "emailValidationLevel": "mx" } ## Test Your Config (Dry Run) Before spending compute credits, test your configuration with a dry run. This validates your input and estimates costs without making any API calls or consuming credits. json { "dryRun": true, "mode": "discovery_and_enrichment", "keywords": ["B2B SaaS companies"], "industries": ["software"], "maxCompanies": 100, "aiProvider": "openai" } ### What You Get The dry run returns a comprehensive report: | Section | Information | |---------|-------------| | Configuration Summary | Mode, keywords, industries, limits | | Enabled Features | Which features are active based on your API keys | | Integration Status | CRM, Mailer, Webhook configurations | | Cost Estimation | Estimated API cost range (min-max) | | Run Time Estimate | Expected duration in minutes | | Helpful Tips | Suggestions to optimize your configuration | ### Example Dry Run Output json { "_type": "DRY_RUN_REPORT", "status": "dry_run_complete", "estimation": { "costRange": { "min": 0.50, "max": 1.25, "currency": "USD" }, "expectedCompanies": 100, "expectedContacts": 500, "estimatedRunTimeMinutes": 12 }, "validation": { "inputValid": true, "tips": [ "Enable AI scoring for better lead qualification", "Consider using 'mx' email validation for higher deliverability" ] } } > 💡 Tip: Always run a dry run first when testing new configurations or large runs. ## Modern Crawling Layer (2025-grade) The Genesis Pilot now ships with a production-ready crawling orchestrator: - Adaptive crawler selection – starts with ultra-fast Cheerio fetches and automatically escalates to Playwright when a domain shows JS challenges, CAPTCHAs or Cloudflare/Akamai fingerprints. - Domain strategy cache – once a domain is identified as JS-heavy it is routed straight to Playwright for the rest of the run. - Proxy orchestration & throttling – sticky Apify residential sessions per domain, adaptive retry/backoff for 4xx/5xx, per-domain concurrency limits and jittered delays. - Smart timeout / retry logic – 5xx => exponential backoff, 429 => wait + rotate proxy, 403 => rotate + escalate to Playwright, with automatic domain quarantine when blocks persist. - Realistic browser fingerprinting – rotating desktop/mobile user agents, viewport pools, timezone alignment with proxies, stealth scripts and human-like scrolling/waiting. - Content-based block detection – CAPTCHA / "are you a robot" / Cloudflare challenge pages trigger proxy/session rotation and, if needed, domain-level fallback to Playwright. ### Crawl Configuration json { "crawlMode": "balanced", "crawlConcurrency": 6, "maxRequestsPerDomain": 2, "minDelayMs": 200, "maxDelayMs": 800, "networkTimeoutMs": 30000 } ## Key Features ### Discovery & Intelligence Gathering - SERP Discovery: Advanced Google scraping with custom keyword targeting - API Discovery: SerpAPI integration for enhanced search capabilities - Industry Targeting: Precise filtering by industry verticals - Geographic Precision: Country and region-specific targeting - Company Sizing: Employee count and revenue-based filtering ### Enrichment & Data Fusion - Company Intelligence: Multi-source enrichment (Clearbit, Crunchbase) - Contact Discovery: Decision-maker identification via PDL and Proxycurl - Tech Stack Analysis: Technology detection with Wappalyzer/BuiltWith - Email Validation: Multi-tier validation (regex, MX, SMTP) - LinkedIn Enrichment: Professional profile data extraction ### Signal & Intent Detection - Hiring Signals: Active job posting detection - Funding Intelligence: Recent investment round tracking - Growth Indicators: Traffic and expansion signals - Buying Intent: Behavioral signal analysis ### AI Qualification Engine - Advanced Scoring: Multi-model AI evaluation (GPT-4, Claude) - ICP Matching: Ideal Customer Profile alignment scoring - Approach Generation: Personalized outreach strategies - Message Crafting: AI-generated outreach snippets ### Integration & Export - CRM Systems: HubSpot, Pipedrive, GoHighLevel, Close.io - Email Platforms: Instantly, Smartlead, Lemlist - Data Warehouses: BigQuery, Redshift export - Webhook Support: Custom integrations and alerts - Notifications: Slack and Teams integration ## Input Parameters #### Required Parameters | Parameter | Type | Description | Example | |-----------|------|-------------|---------| | mode | string | Operating mode | "discovery_and_enrichment" | | keywords | string[] | Search keywords for discovery | ["B2B software", "SaaS"] | #### Target Configuration | Parameter | Type | Description | Default | |-----------|------|-------------|---------| | industries | string[] | Target industries | ["software", "technology"] | | countries | string[] | Country codes (ISO) | ["US"] | | minEmployees | number | Minimum company size | 10 | | maxEmployees | number | Maximum company size | 1000 | | excludeIndustries | string[] | Industries to exclude | [] | #### Discovery Settings | Parameter | Type | Description | Default | |-----------|------|-------------|---------| | maxCompanies | number | Maximum companies to process | 50 | | serpMaxPagesPerKeyword | number | Search pages per keyword | 3 | | serpSource | string | Search provider | "apify" | #### AI Configuration | Parameter | Type | Description | Default | |-----------|------|-------------|---------| | aiEnabled | boolean | Enable AI scoring | true | | aiProvider | string | AI provider | "openai" | | aiModel | string | Model to use | "gpt-4o-mini" | | icpDescription | string | Ideal Customer Profile | Required for AI mode | #### Integration Settings | Parameter | Type | Description | |-----------|------|-------------| | crmPlatform | string | CRM platform (hubspot, pipedrive, gohighlevel, close) | | crmApiKey | string | CRM API key or token | | mailerPlatform | string | Email platform (instantly, smartlead, lemlist) | | mailerApiKey | string | Email platform API key | | webhookUrl | string | Webhook endpoint | ## Cost Estimation Understanding the costs before running helps you budget effectively. ### Apify Compute Costs | Run Size | Memory | Estimated Duration | Compute Cost | |----------|--------|-------------------|--------------| | 50 companies | 4GB | 5-10 min | ~$0.10-0.20 | | 100 companies | 4GB | 10-20 min | ~$0.20-0.40 | | 500 companies | 8GB | 45-90 min | ~$0.80-1.50 | | 1000 companies | 8GB | 90-180 min | ~$1.50-3.00 | ### External API Costs (Pay-per-use) | Provider | Cost Per Call | Typical Usage | Estimated Cost/100 companies | |----------|--------------|---------------|------------------------------| | OpenAI (GPT-4o-mini) | ~$0.0015/1K tokens | 1 call/company | ~$0.15 | | Clearbit | Free tier: 50/mo | 1 call/company | Free or ~$0.10 | | PDL | ~$0.03/match | 5 contacts/company | ~$15.00 | | SerpAPI | ~$0.01/search | 1 search/keyword | ~$0.05 | ### Total Cost Example For 100 companies with full enrichment + AI scoring: - Apify compute: ~$0.30 - OpenAI scoring: ~$0.15 - PDL contacts: ~$15.00 (optional) - Total: ~$0.45 - $15.45 depending on contact discovery > 💡 Cost Saving Tips: > - Use dryRun: true to estimate before running > - Start with emailValidationLevel: "mx" (free) instead of full SMTP > - Limit maxContactsPerCompany to reduce PDL costs > - Use "serpSource": "apify" (included) instead of SerpAPI ## Integrations ### CRM Platforms - HubSpot - Direct contact and company creation - Pipedrive - Lead and organization sync - GoHighLevel - Full contact management - Close.io - Lead import with custom fields ### Email Platforms - Instantly - Campaign-ready lead import - Smartlead - Sequence enrollment - Lemlist - Personalized campaign creation ### Data Warehouses - BigQuery - Structured data export - Redshift - Analytics-ready datasets ### Automation - Webhooks - Real-time data streaming - Zapier/Make/n8n - Workflow automation - Slack/Teams - Instant notifications ## Output Format ### Company Data json { "companyName": "Acme Corp", "companyDomain": "acme.com", "companyIndustry": "Software", "companyEmployees": 250, "companyCountry": "US", "companyDescription": "Leading B2B software", "companyFunding": "Series B", "companyRevenue": "$10-50M", "companyTechStack": ["React", "Node.js", "AWS"], "companyLinkedIn": "linkedin.com/company/acme" } ### Contact Information json { "contactFullName": "John Smith", "contactTitle": "VP of Engineering", "contactEmail": "john.smith@acme.com", "contactEmailStatus": "valid", "contactDepartment": "Engineering", "contactSeniority": "VP", "contactLinkedIn": "linkedin.com/in/johnsmith" } ### AI Intelligence json { "leadScore": 85, "leadFitLevel": "high", "leadReason": "Strong ICP match, recent funding", "leadAngle": "Focus on scaling engineering", "leadSnippet": "Hi John, noticed Acme's Series B...", "buyingSignals": ["hiring", "funding", "expansion"] } ## API Provider Setup ### Required for Full Functionality | Provider | Purpose | Free Tier | Why Needed | |----------|---------|-----------|------------| | OpenAI/Anthropic | AI scoring | Pay-per-use | Lead qualification | | Clearbit | Company data | 50/month | Company enrichment | | PDL | Contacts | Trial available | Find decision-makers | ### Optional Enhancements | Provider | Purpose | Impact | |----------|---------|--------| | SerpAPI | Better search | 10x more results | | Proxycurl | LinkedIn data | Professional profiles | | Wappalyzer | Tech detection | Accurate tech stacks | | BuiltWith | Tech analysis | Historical tech data | ## Error Handling The actor handles errors gracefully and continues processing when possible. ### Common Scenarios | Scenario | Behavior | |----------|----------| | Missing API Key | Feature disabled with warning, run continues | | Invalid API Key | Error logged, feature skipped, run continues | | Rate Limited (429) | Automatic retry with exponential backoff | | API Timeout | 3 retries, then skips to next company | | Invalid Input | Run fails fast with clear error message | | Network Error | Automatic retry with proxy rotation | ### Error Messages in Output Failed enrichments include an error field: json { "companyDomain": "example.com", "enrichmentStatus": "partial", "error": "Clearbit: API key invalid, PDL: Rate limited" } ### Monitoring Errors - Check Apify Console logs for detailed error traces - Enable Slack/Teams webhook for real-time error alerts - Use webhookUrl for custom error handling ## Rate Limits Be aware of external API rate limits when planning large runs. ### Provider Limits | Provider | Rate Limit | Recommended Concurrency | |----------|------------|------------------------| | Clearbit | 600/min | 10 req/sec | | PDL | 100/min | 1-2 req/sec | | OpenAI | 3500 RPM (GPT-4o-mini) | 50 req/sec | | SerpAPI | 100/min | 1-2 req/sec | | HubSpot | 100/10sec | 10 req/sec | | Pipedrive | 100/10sec | 10 req/sec | ### Built-in Protections - Automatic throttling per domain and per API - Queue-based processing to respect limits - Retry with backoff on 429 responses - Domain quarantine for persistent blocks > 💡 Tip: For runs > 500 companies, use incrementalMode: true to spread processing across multiple runs. ## Security & Compliance ### Data Protection - GDPR Compliant - EU data standards - CCPA Compliant - California privacy - SOC 2 Ready - Enterprise security - Encrypted Storage - All data encrypted - API Key Protection - Secure credential handling ### Ethical Standards - Respects robots.txt - Rate limiting enforced - No unauthorized PII - Opt-out support - Terms of Service compliance ## Support & Resources ### Getting Help - Documentation: This README + inline input descriptions - Issues: Report through Apify Console - Feature Requests: Contact via Apify messaging ### Pilot Program Benefits As a pilot participant: - Priority support response - Regular version updates - Feature request priority - Performance optimization assistance ### Feedback Welcome We actively collect feedback to improve: - Performance issues - Feature suggestions - Integration requests - Use case examples ---

Built with 🩶 for the Apify community 🫡

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try DRG Phantom Core - Genesis Pilot now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
tuguidragos
Pricing
Paid
Total Runs
55
Active Users
6
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support