Training large language models on narrow tasks can lead to broad misalignment

5 pointsposted 23 days ago
by thebeardisred

1 Comments