Flashlabs releases the world’s first open-source voice cloning model

3 pointsposted 16 days ago
by sangwen

2 Comments

kuandriy

16 days ago

The end-to-end speech-to-speech claim is interesting, especially avoiding the ASR→LLM→TTS pipeline, which is where most latency and error compounding happens.