![](https://www.youtube.com/watch?v=HT-UZkiOLv8) From [Andrej Karpathy](https://twitter.com/karpathy/status/1884336943321997800/?rw_tt_thread=True): > it's when an AI, trained via the trial-and-error process of reinforcement learning, discovers actions that are new, surprising, and secretly brilliant even to expert humans. It is a magical, just slightly unnerving, emergent phenomenon only achievable by large-scale reinforcement learning. You can't get there by expert imitation. It's when AlphaGo played move 37 in Game 2 against Lee Sedol, a weird move that was estimated to only have 1 in 10,000 chance to be played by a human, but one that was creative and brilliant in retrospect, leading to a win in that game [[Move 37 Is the Word-of-Day - It's When an AI...]] #published 2025-03-01