How Do Large Language Models Generate Text?

2 pointsposted 9 months ago
by k3ntaki

6 Comments

k3ntaki

9 months ago

Large Language Models (LLMs) have transformed the way artificial intelligence interacts with human language. While these systems are immensely powerful, their inner workings can feel like a mystery to most. This article aims to simplify the complexity behind LLMs, breaking down advanced concepts such as neural networks, and transformers in a way that's easy to grasp.

cratermoon

9 months ago

Step 1. Indiscriminately hoover up any text you can find in the name of training data.

Step 2. ???

Step 3. Profit.

k3ntaki

9 months ago

That sounds about right! but Step 3 seems working well for them...

cratermoon

9 months ago

I'm not so sure about that. Companies are burning through huge amounts of money to train and run these models, but the revenue isn't keeping up. Billions of investment, but where's the revenue? https://www.axios.com/2024/07/12/ai-bubble-revenue-missing

k3ntaki

9 months ago

Yeah, Nvidia's pocketing the revenue, but it seems we’re in an era where revenue and profit do not matter, money goes where the hype is...

user

9 months ago

[deleted]