Flashlabs releases the world’s first open-source voice cloning model

3 pointsposted 11 hours ago
by sangwen

2 Comments

kuandriy

10 hours ago

The end-to-end speech-to-speech claim is interesting, especially avoiding the ASR→LLM→TTS pipeline, which is where most latency and error compounding happens.