Text to speech generator
by akash9078
Generate natural, human-like speech from text with 50+ voices in 9 languages. Perfect for voiceovers, audiobooks, podcasts, and accessible content.
Opens on Apify.com
About Text to speech generator
Need to turn written content into spoken audio without hiring a voice actor? This text-to-speech generator is my go-to for exactly that. It uses some genuinely impressive AI models to produce voices that don't sound robotic—they have natural cadence and inflection. I've used it to quickly create voiceovers for video tutorials and to prototype audio for app features. You get access to over 50 high-quality voices spanning nine different languages, which is a lifesaver for projects that need multilingual support or just a specific vocal tone. It’s incredibly handy for content creators looking to repurpose blog posts into podcast snippets, for developers building accessibility features into their applications, or for anyone producing audiobooks or social media content on a timeline. The setup is straightforward; you feed it text, select your voice and language, and it delivers a clean audio file. It saves me hours of manual recording and editing, and the output quality consistently sounds professional.
What does this actor do?
Text to speech generator is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Text-to-Speech Generator Actor
An Apify actor that converts text into high-quality, natural-sounding speech using AI. It supports multiple languages and voices, is optimized for performance, and outputs professional-grade WAV audio files.
Overview
This actor is a text-to-speech (TTS) generator powered by advanced AI models. You provide text, and it returns a downloadable audio file. It's built for reliability and speed, with pre-loaded models and GPU acceleration support. Common use cases include creating voiceovers, audiobook narration, podcast content, accessibility tools, and multilingual audio applications.
Key Features
- Multi-language & Voices: Synthesizes speech in 9 languages (English, Spanish, French, Hindi, Italian, Portuguese, Japanese, Chinese) using over 50 distinct premium voices.
- Performance Optimized: Starts up quickly (~2 seconds) using pre-loaded models and automatically uses CUDA for GPU acceleration when available.
- Flexible Controls: Adjust speech speed from 0.1x to 3.0x the normal rate.
- Robust Processing: Handles long texts efficiently via automatic chunking and includes voice validation to match voices with the correct language.
- Professional Output: Generates broadcast-quality, uncompressed 24kHz WAV audio files.
- Detailed Metrics: Returns audio duration, file size, and processing performance data.
How to Use
Configure the actor's input via the Apify platform or API. The core required parameter is the text you want to convert.
Basic Input Configuration
| Parameter | Type | Description | Default |
|---|---|---|---|
text |
string | Required. The text to convert to speech. | - |
voice |
string | The premium voice identifier for synthesis. | af_heart |
lang |
string | Language code for the input text (e.g., en-us, fr-fr). |
en-us |
speed |
string | Speech speed multiplier. Accepts values from 0.1 to 3.0. |
1.0 |
Example Inputs
Basic English TTS:
{
"text": "Welcome to our advanced text-to-speech system.",
"voice": "af_heart",
"lang": "en-us",
"speed": "1.0"
}
Multilingual Synthesis:
{
"text": "Bonjour et bienvenue.",
"voice": "ff_siwis",
"lang": "fr-fr",
"speed": "1.2"
}
Voice Selection
Select a voice parameter from the lists below. The actor validates that the voice matches the specified language.
- English (US/GB):
af_heart(F),am_adam(M),bf_alice(F),bm_daniel(M), and many more. - Spanish:
ef_dora(F),em_alex(M),pf_dora(F). - French:
ff_siwis(F). - Hindi:
hf_alpha(F),hm_omega(M). - Italian:
if_sara(F),im_nicola(M). - Portuguese:
pf_dora(F),pm_alex(M). - Japanese:
jf_alpha(F),jm_kumo(M). - Chinese:
zf_xiaoxiao(F),zm_yunxi(M).
Input/Output
Input
Provide input as a JSON object containing the parameters defined in the How to Use section.
Output
The actor's dataset contains a detailed JSON object for each run. The primary output includes:
- Audio File: A high-quality WAV format audio file available for download.
- Public URL: A direct, publicly accessible URL to the generated audio file.
- Audio Metadata: Duration (seconds), file size (bytes), and sample rate (24000 Hz).
- Configuration Used: The voice, language, and speed settings applied.
- Performance Metrics: Processing time and pipeline initialization time.
A typical output item structure:
{
"audioUrl": "https://api.apify.com/v2/actor-runs/RUN_ID/datasets/DATASET_ID/items/ITEM_ID?attachment=audio.wav",
"duration": 4.75,
"fileSize": 571308,
"format": "wav",
"sampleRate": 24000,
"voice": "af_heart",
"language": "en-us",
"speed": "1.0",
"processingTime": 1.24
}
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Text to speech generator now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- akash9078
- Pricing
- Paid
- Total Runs
- 153
- Active Users
- 13
Related Actors
Google Search Results Scraper
by apify
Website Content Crawler
by apify
🔥 Leads Generator - $3/1k 50k leads like Apollo
by microworlds
Video Transcript Scraper: Youtube, X, Facebook, Tiktok, etc.
by invideoiq
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support