CorentinJ: Real-Time Voice Cloning (2021)

94 pointsposted 5 months ago
by redbell

24 Comments

hleszek

5 months ago

For some reason the VibeVoice model from Microsoft (which is also able to clone voices and is also very good) has been deleted from GitHub 10 days ago even tough it was released under a MIT license. But this post shows that the cat is out of the bag for some time already now (post is from 2021) and we have to live with this technology.

qwertox

5 months ago

The reason is known: "we discovered instances where the tool was used in ways inconsistent with the stated intent"

askl

5 months ago

The stated intend would be scamming people I guess? What would be the other ways inconsistent with that?

numpad0

5 months ago

NSFW? That seems to be a bigger deal on the Internet today than scams, somehow.

frank_nitti

5 months ago

Honest question - is NSFW just a code word for pornography now?

I had thought it would be anything that isn’t safe to open at work, including things with extreme profanity or gore, etc

numpad0

5 months ago

To me, the word feel like it's almost a synonym for anime, pornographic or not, with a hint of negativity.

whimsicalism

5 months ago

there are many easy extant ways to do voice coding. many models are released without a “voice embedding” model but they are easy to recreate by passing the gradients through the soft prompt

nickthegreek

5 months ago

any links still up?

hleszek

5 months ago

cchance

5 months ago

And that's why people need to clone these repos from big companies when their first released.

anonymousiam

5 months ago

Cloning the repo isn't enough, because Microsoft/Github still control the platform, and can delete all copies they have control over.

dceddia

5 months ago

Cloning the repo (running git clone on your computer) is enough because it makes a local copy. Forking merely makes a copy under your account on GitHub though which is not going to survive if they go on a deleting spree.

anonymousiam

5 months ago

Yes, you are correct. I used the word "clone" when I should have used the word "fork" instead.

ivape

5 months ago

Does anyone know for sure if Voice ID has security measures to protect against AI voice cloning?

user

5 months ago

[deleted]

blurbleblurble

5 months ago

Atomic bomb level technological shifts happening, open sourced online. What a time to be alive!

avereveard

5 months ago

(2021)

niek_pas

5 months ago

In fact, the YouTube video the GitHub repo links to is from 2019.

jasonjmcghee

5 months ago

Papers are all from 2017-2018

blurbleblurble

5 months ago

Doesn't change the fact that it's really epic

jasonjmcghee

5 months ago

The way you wrote your comment/ the chosen example gives the vibe of "this is super dangerous"- and that it's actively happening, so everyone is pointing out this stuff is from a while ago.

p2detar

5 months ago

It looks rather complete, but indeed - the project has 3 commits for the past 3 years.