Deconstructing Autoresearch: A Complete Technical Deep Dive into Autonomous ML Research
Inside Andrej Karpathy's `autoresearch` repo: a comprehensive mathematical teardown of the SSSL attention pattern, BPB scaling, Newton-Schulz iterations in the Muon optimizer, and zero-waste BPE tokenization pipelines. What happens when the ML researcher is a machine?