Recent posts

Loss Spikes in Gradient Descent

11 minute read

Loss spikes aren’t noise. They’re gradient descent briefly exceeding the edge of stability and snapping back. Here’s why.

Robust Regression Without Gradients

10 minute read

L1 regression is more robust to outliers than least squares, but harder to solve. We walk through four algorithms, each addressing a limitation of the previo...

Golden Section Search for Robust Regression

9 minute read

Golden section search reuses objective evaluations to efficiently minimize 1D functions. Learn how this classical algorithm connects to the golden ratio and ...

Newton-Gregory Interpolation

7 minute read

Interpolate equally-spaced data efficiently and discover its connection to Taylor series.

BKM

7 minute read

Compute logarithms and exponentials without a floating point unit.

CORDIC

12 minute read

Compute sine, cosine, and exponentials using only addition, subtraction, and bit shifts.

The AAA Algorithm

9 minute read

Fit rational functions to data with poles and discontinuities where polynomials fail.

Tanhsinh Quadrature

8 minute read

Tackle tricky integrals with endpoint singularities using a clever variable transformation.