API Reference
Welcome to the Voice Cloning API service documentation. Our API provides powerful voice cloning capabilities that allow you to generate high-quality synthetic speech based on audio samples.
API Overview
Our API offers two main voice cloning approaches:
🚀 Synchronous Voice Cloning
- Use Case: Short text (less than 1000 characters)
- Response: Immediate results
- Processing Time: Completed within seconds
- Best For: Real-time applications, short phrase generation, quick testing
⏳ Asynchronous Voice Cloning
- Use Case: Long text (over 1000 characters)
- Response: Task queue processing
- Processing Time: 2-20 minutes depending on length
- Best For: Long-form content, batch processing, high-quality generation
Getting Started
- Get API Key: Generate your API key in the user center
- Prepare Audio Sample: Upload a 5-30 second clear audio file
- Choose Endpoint: Select sync or async based on text length
- Send Request: Call the API using your preferred programming language
- Get Results: Download the generated voice file
Supported Audio Formats
- WAV: Recommended format, best quality
- MP3: Common format, good compatibility
- M4A: Apple device common format
Technical Specifications
Parameter | Specification |
---|---|
Max File Size | 4.5MB |
Audio Duration | 5-30 seconds |
Supported Languages | Chinese, English, and more |
Output Format | WAV (22kHz, 16bit, Mono) |
Authentication
All API requests require your API key in the request header:
Authorization: Bearer YOUR_API_KEY
Pricing Information
⚠️ Important Notice: API interface services are only available for Professional and Unlimited version users.
Error Handling
The API uses standard HTTP status codes and JSON-formatted error responses. All errors include a detail
field providing detailed error descriptions.
Getting Help
If you encounter issues during usage:
- Review the detailed API documentation
- Check your API key permissions
- Ensure audio samples meet requirements
- Contact technical support for assistance
Last updated on