State-of-the-art voice cloning based on LongCat-AudioDiT by the Meituan LongCat Team. Give it a reference audio, type your text, get the result.
Research & Testing Only. This tool is provided strictly for research, educational, and personal experimentation purposes. It is not intended for generating deceptive, misleading, or harmful content. Do not use it to impersonate real individuals without their explicit consent, to create non-consensual deepfakes, or for any activity that violates applicable laws or regulations. By using this tool you accept full responsibility for ensuring your use complies with all relevant laws in your jurisdiction.
Memory Mode
Device
Reference Voice
Saved Voices
Whisper Model for Auto-Transcribe
Language (auto=detect)
Save this voice to library
Text to Synthesise
AudioDiT Model
Guidance
464
110
Synthesise speech without a reference voice. The model picks a random voice — useful for testing or when you just need audio.
Model
Guidance
464
110
Transcribe any audio file with Whisper — output is plain text.
Whisper Model
Language
Download models before using them. Select what you need, hit Download, watch the live log. Already-downloaded models are skipped automatically.