Published inData Science CollectiveWhat does the future of AI look like if we hit the LLM scaling wall?Why small models, scaled inference, and AI agents maybe be itAug 16A response icon1Aug 16A response icon1
Published inData Science CollectiveUnderstanding reinforcement learning for model training from scratchAn intuitive treatment of RLHF, TRPO, PPO, GRPO, DPO and RLAIF. This article follows my paper here: https://arxiv.org/abs/2509.04501Aug 10A response icon2Aug 10A response icon2
Published inData Science CollectiveAn intuitive treatment of Negative log-likelihood, Cross entropy, KL divergence, and Importance…For the past few months, I have been working on a follow-up to my earlier article “Understanding LLMs from Scratch Using Middle School…Jun 15A response icon2Jun 15A response icon2
Published inTDS ArchiveUnderstanding LLMs from Scratch Using Middle School MathIn this article, we talk about how LLMs work, from scratch — assuming only that you know how to add and multiply two numbers. The article…Oct 19, 2024A response icon102Oct 19, 2024A response icon102
Published inQuickAI.appHow to do the Price-Volume-Mix waterfall rightBreaking down revenue changes accuratelyAug 6, 2024A response icon2Aug 6, 2024A response icon2