Summary XL

Summary XL

by flamboyant_leaf

Summary XL analyzes data from APIs, websites, or files to create focused summaries. It integrates with Apify to quickly process datasets like social media for specific topics.

305 runs
6 users
Try This Actor

Opens on Apify.com

About Summary XL

Ever feel like you're drowning in data? Summary XL is the Apify actor I use to quickly make sense of it all. It's built to handle information from practically anywhere—pull in data from an API, scrape it from a website, or upload your own files. You tell it what you're looking for, and it goes to work, distilling everything down to clear, concise summaries. For me, its real strength is how it connects directly with other Apify actors. I often use it to process large datasets of social media posts or forum discussions, asking it to focus only on mentions of a specific product or topic. Instead of sifting through thousands of entries manually, I get a digestible report in minutes. It saves a huge amount of time on research, market analysis, or just getting up to speed on a new subject. If you need to turn overwhelming data into actionable insights without the manual grind, this is the tool to set up.

What does this actor do?

Summary XL is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.

Key Features

  • Cloud-based execution - no local setup required
  • Scalable infrastructure for large-scale operations
  • API access for integration with your applications
  • Built-in proxy rotation and anti-blocking measures
  • Scheduled runs and webhooks for automation

How to Use

  1. Click "Try This Actor" to open it on Apify
  2. Create a free Apify account if you don't have one
  3. Configure the input parameters as needed
  4. Run the actor and download your results

Documentation

SummaryXL

SummaryXL is an Apify actor that automatically generates comprehensive AI summaries from any dataset. It processes your data as plain text and uses GPT-4o-mini to produce a detailed summary, helping you quickly understand large volumes of information without manual analysis.

For a complete tutorial on using SummaryXL to analyse Reddit comments in bulk, click here.

Key Features

  • AI-Powered Summaries: Uses GPT-4o-mini to generate coherent, long-form summaries.
  • Customizable Length: Set your summary length between 100 and 2000 words.
  • Topic Focus: Optionally provide a focusTopic to steer the summary towards specific themes.
  • Cost Tracking: Output includes detailed token usage and estimated API costs.
  • Flexible Output: Receive summaries as a dataset record, a formatted PDF, or via optional email delivery.
  • Seamless Integration: Works as an integration with any other Apify actor that outputs a dataset.

How to Use

As an Integration (Recommended)

  1. Add the SummaryXL integration to your primary actor (e.g., a Reddit or Twitter scraper).
  2. In the integration's datasetId field, enter {{resource.defaultDatasetId}} to automatically pass the dataset.
  3. Configure your openaiApiKey and any other parameters.
  4. Run your main actor. When it finishes, SummaryXL will process the dataset and output the summary.

For Direct Testing

You can run SummaryXL directly by providing a specific datasetId and your openaiApiKey in the actor input.

Input Parameters

Parameter Type Required Default Description
datasetId string No - The dataset ID to summarize. As an integration, use {{resource.defaultDatasetId}}.
openaiApiKey string Yes - Your OpenAI API key.
summaryLength integer No 1000 Target summary length in words (100-2000).
focusTopic string No - Optional topic to focus the summary on.
targetEmail string No - Email for summary delivery (requires EMAIL_SUPPORT=true).

Output Formats

The actor provides the summary in multiple formats:

  1. Dataset Record: A JSON object containing the summary text, token usage, and cost breakdown.
    json { "summary": "The AI-generated summary text...", "tokenUsage": { "promptTokens": 1234, "completionTokens": 567, "totalTokens": 1801 }, "costs": { "promptCost": 0.001234, "completionCost": 0.000567, "totalCost": 0.001801 } }

  2. PDF File: A formatted PDF is saved to the key-value store under the key OUTPUT, accessible via the Apify console.

  3. Email (Optional): If EMAIL_SUPPORT is enabled and a targetEmail is provided, the summary is sent in HTML and plain text formats.

Configuration & Environment

Control additional behavior via these environment variables:
* EMAIL_SUPPORT: Enable/disable email functionality (default: true).
* MAX_PARALLEL_REQUESTS: Control parallel API requests (default: 10).
* PROMPT_TOKEN_COST: Cost per 1M prompt tokens for tracking (default: 0.150).
* COMPLETION_TOKEN_COST: Cost per 1M completion tokens for tracking (default: 0.075).

Example Input Configurations

Basic Integration:

{
  "openaiApiKey": "your-openai-api-key",
  "summaryLength": 1500,
  "focusTopic": "pricing trends"
}

Direct Test Run:

{
  "datasetId": "your-dataset-id",
  "openaiApiKey": "your-openai-api-key",
  "summaryLength": 1000
}

With Email Delivery:

{
  "openaiApiKey": "your-openai-api-key",
  "targetEmail": "user@example.com",
  "focusTopic": "customer feedback"
}

Common Use Cases

Market Research

Gather competitive intelligence and market data

Lead Generation

Extract contact information for sales outreach

Price Monitoring

Track competitor pricing and product changes

Content Aggregation

Collect and organize content from multiple sources

Ready to Get Started?

Try Summary XL now on Apify. Free tier available with no credit card required.

Start Free Trial

Actor Information

Developer
flamboyant_leaf
Pricing
Paid
Total Runs
305
Active Users
6
Apify Platform

Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.

Learn more about Apify

Need Professional Help?

Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.

Find a Specialist

Trusted by millions | Money-back guarantee | 24/7 Support