ML 6 Interpretability project - nanoGPT trained on a WhatsApp group chat May 13, 2024 Transformers like a physicist Apr 29, 2024 Introducing PAdam - Adam optimzer for any p-norm Nov 7, 2023 Generalization Toy Models - II Nov 2, 2023 Generalization Toy Models - I Oct 30, 2023 Thoughts about Lasso Regression Jul 26, 2023