CaseyH

Loss Spikes in Gradient Descent

April 22, 2026 11 minute read

Loss spikes aren’t noise. They’re gradient descent briefly exceeding the edge of stability and snapping back. Here’s why.

Fixed Point vs Floating Point: A Numerical Representation Deep Dive

April 11, 2026 8 minute read

A ground-up look at how floating-point and fixed-point numbers are represented and how arithmetic works for each

Robust Regression Without Gradients

March 29, 2026 10 minute read

L1 regression is more robust to outliers than least squares, but harder to solve. We walk through four algorithms, each addressing a limitation of the previo...

Golden Section Search for Robust Regression

February 17, 2026 9 minute read

Golden section search reuses objective evaluations to efficiently minimize 1D functions. Learn how this classical algorithm connects to the golden ratio and ...