Grokking at the Edge of Linear Separability

23 pointsposted an hour ago
by marojejian

2 Comments

delichon

8 minutes ago

I think this means that when training a cat detector it's better to have more bobcats and lynx and fewer dogs.

diwank

17 minutes ago

Grokking is so cool. What does it even mean that grokking exhibits similarities to criticality? As in, what are the philosophical ramifications of this?