hackernews client

ChrisArchitect

7 hours ago

Related / discussion:

OpenAI to remove non-profit control and give Sam Altman equity

https://news.ycombinator.com/item?id=41651548

OpenAI to Become For-Profit Company

https://news.ycombinator.com/item?id=41655954

nullsmack

6 hours ago

Thankfully there are many options for offline LLMs now

codeful

7 hours ago

Remember that Sam and/or the whole thing is dependent of Microsoft. It may explain shift and consolidation of power within organization.

pmarreck

7 hours ago

With the amount of debt they have, they don't really have a choice.

That said, thanks to the efforts of Meta and others, open-source AI running on your desktop is moving along at quite a pace. I can generate images superior to what is available via DALL-E/Meta/ChatGPT, and more or less completely uncensored, thanks to running FLUX.1[dev] locally (albeit slower) on the https://drawthings.ai/ app, and I can do language model work locally that gets very close to approaching, in some cases surpassing, GPT4o (albeit slower), on my M1 Macbook Pro (also uncensored, if that's what you want), and now thanks to Llama 3.2 I can also process images locally.

The only remaining things left are a good substitute for ElevenLabs' still-amazing ability to create realistic voice models of people based on a sample, voice input, multimodal interactive voice chat (i.e. Advanced Voice), more easily accessible function-calling running locally (regarding web requests, you might be able to block OpenAI, but can't block curl running from my house!), and o1-style chain-of-thought reasoning, but I think we have enough clues about how the latter works that we should see something any day now to compete with it.

(going on a tangent for a minute...)

I really want a whole-house computer that runs locally and is in charge of everything, responds like an LLM to voice commands in any voice I want (recognizing who is speaking as well), knows a bunch of things about me, has a personality I can customize like OpenAI's "custom instructions", and executes whatever functions I give it access to (searching the web, running code it's written, etc.), plus can stick to schedules. I'd be happy to pay a small licensing fee for the use of someone's voice.

I have a nightly job that coaxes me to bed at 11pm in Raphael's voice (from Baldur's Gate 3) using a dynamically-generated script from Claude. It's absolutely amazing and Andrew Wincott should seriously reach out to me to try to make a product out of it because I seem to have hit the jackpot with his voice model...

Here are 2 examples of its output: https://vocaroo.com/1kotE1UgYCoy https://vocaroo.com/1lUyZbGIHPIH (I do not know how long this will stay up on this free service, but perhaps that's for the best...)

d13

6 hours ago

ElevenLabs just uses tortoise with its own high quality recorded voice data. You could definitely do the same:

https://github.com/neonbjb/tortoise-tts

jarbus

6 hours ago

Thank god for Meta and Mistral

7e

3 hours ago

We’re so worried about AI alignment but we neglect human alignment: in this case, a sociopath heading a AGI company. Is OpenAI is a threat to humanity?

OpenAI Takes Its Mask Off To Reveal What It Really Is

9 Comments

ChrisArchitect

yababa_y

nullsmack

codeful

pmarreck

d13

jarbus

7e

user