Show HN: How are Markov chains so different from tiny LLMs?

12 pointsposted 10 hours ago
by JPLeRouzic

Item id: 45958004

1 Comments

MarkusQ

5 hours ago

LLMs include mechanisms (notably, attention) that allow longer-distance correlations than you could get with a similarly-sized Markov chain. If you squint hard enough though, they are Markov chains with this "one weird trick" that makes them much more effective for their size.