CaseyH

Optimization

Loss Spikes in Gradient Descent

April 22, 2026 11 minute read

Loss spikes aren’t noise. They’re gradient descent briefly exceeding the edge of stability and snapping back. Here’s why.

Robust Regression Without Gradients

March 29, 2026 10 minute read

L1 regression is more robust to outliers than least squares, but harder to solve. We walk through four algorithms, each addressing a limitation of the previo...

Golden Section Search for Robust Regression

February 17, 2026 9 minute read

Golden section search reuses objective evaluations to efficiently minimize 1D functions. Learn how this classical algorithm connects to the golden ratio and ...

Lagrange Duality

January 7, 2021 8 minute read

Derive and interpret the dual form of an optimization problem.

Quadratic Penalty Algo

January 12, 2020 6 minute read

Solve constrained optimization problems using your favorite unconstrained solver.

Nonlinear Least Squares

October 5, 2019 12 minute read

Fit nonlinear models using Gauss-Newton and Levenberg-Marquardt algorithms.

Back to top ↑

Numerical methods

Fixed Point vs Floating Point: A Numerical Representation Deep Dive

April 11, 2026 8 minute read

A ground-up look at how floating-point and fixed-point numbers are represented and how arithmetic works for each

Newton-Gregory Interpolation

July 12, 2025 7 minute read

Interpolate equally-spaced data efficiently and discover its connection to Taylor series.

Tanhsinh Quadrature

January 17, 2025 8 minute read

Tackle tricky integrals with endpoint singularities using a clever variable transformation.

Lagrange Interpolation

January 11, 2025 5 minute read

Approximate functions using polynomial interpolation without solving linear systems.

Back to top ↑

Iterative methods

BKM

May 15, 2025 7 minute read

Compute logarithms and exponentials without a floating point unit.

CORDIC

March 30, 2025 12 minute read

Compute sine, cosine, and exponentials using only addition, subtraction, and bit shifts.

Nonlinear Least Squares

October 5, 2019 12 minute read

Fit nonlinear models using Gauss-Newton and Levenberg-Marquardt algorithms.

Back to top ↑

Python

Plotting Ellipses

May 30, 2020 6 minute read

Plot ellipses using conic, quadratic, and parametric representations.

Numpy Regression Trees

April 4, 2020 9 minute read

Implement a regression tree from scratch using only numpy.

3D Plotting using Non-Rectangular Grids

October 30, 2019 5 minute read

Create cleaner 3D surface plots using radial and elliptical grids in matplotlib.

Back to top ↑

Interpolation

Newton-Gregory Interpolation

July 12, 2025 7 minute read

Interpolate equally-spaced data efficiently and discover its connection to Taylor series.

The AAA Algorithm

January 23, 2025 9 minute read

Fit rational functions to data with poles and discontinuities where polynomials fail.

Lagrange Interpolation

January 11, 2025 5 minute read

Approximate functions using polynomial interpolation without solving linear systems.

Back to top ↑

Approximation theory

Newton-Gregory Interpolation

July 12, 2025 7 minute read

Interpolate equally-spaced data efficiently and discover its connection to Taylor series.

The AAA Algorithm

January 23, 2025 9 minute read

Fit rational functions to data with poles and discontinuities where polynomials fail.

Lagrange Interpolation

January 11, 2025 5 minute read

Approximate functions using polynomial interpolation without solving linear systems.

Back to top ↑

Fixed-point arithmetic

Fixed Point vs Floating Point: A Numerical Representation Deep Dive

April 11, 2026 8 minute read

A ground-up look at how floating-point and fixed-point numbers are represented and how arithmetic works for each

BKM

May 15, 2025 7 minute read

Compute logarithms and exponentials without a floating point unit.

CORDIC

March 30, 2025 12 minute read

Compute sine, cosine, and exponentials using only addition, subtraction, and bit shifts.

Back to top ↑

Numerical computing

Fixed Point vs Floating Point: A Numerical Representation Deep Dive

April 11, 2026 8 minute read

A ground-up look at how floating-point and fixed-point numbers are represented and how arithmetic works for each

BKM

May 15, 2025 7 minute read

Compute logarithms and exponentials without a floating point unit.

CORDIC

March 30, 2025 12 minute read

Compute sine, cosine, and exponentials using only addition, subtraction, and bit shifts.

Back to top ↑

Least squares

The AAA Algorithm

January 23, 2025 9 minute read

Fit rational functions to data with poles and discontinuities where polynomials fail.

Nonlinear Least Squares

October 5, 2019 12 minute read

Fit nonlinear models using Gauss-Newton and Levenberg-Marquardt algorithms.

Back to top ↑

Regression

Robust Regression Without Gradients

March 29, 2026 10 minute read

L1 regression is more robust to outliers than least squares, but harder to solve. We walk through four algorithms, each addressing a limitation of the previo...

Nonlinear Least Squares

October 5, 2019 12 minute read

Fit nonlinear models using Gauss-Newton and Levenberg-Marquardt algorithms.

Back to top ↑

Visualization

Plotting Ellipses

May 30, 2020 6 minute read

Plot ellipses using conic, quadratic, and parametric representations.

3D Plotting using Non-Rectangular Grids

October 30, 2019 5 minute read

Create cleaner 3D surface plots using radial and elliptical grids in matplotlib.

Back to top ↑

Matplotlib

Plotting Ellipses

May 30, 2020 6 minute read

Plot ellipses using conic, quadratic, and parametric representations.

3D Plotting using Non-Rectangular Grids

October 30, 2019 5 minute read

Create cleaner 3D surface plots using radial and elliptical grids in matplotlib.

Back to top ↑

Constrained optimization

Lagrange Duality

January 7, 2021 8 minute read

Derive and interpret the dual form of an optimization problem.

Quadratic Penalty Algo

January 12, 2020 6 minute read

Solve constrained optimization problems using your favorite unconstrained solver.

Back to top ↑

Machine Learning

Loss Spikes in Gradient Descent

April 22, 2026 11 minute read

Loss spikes aren’t noise. They’re gradient descent briefly exceeding the edge of stability and snapping back. Here’s why.

Robust Regression Without Gradients

March 29, 2026 10 minute read

L1 regression is more robust to outliers than least squares, but harder to solve. We walk through four algorithms, each addressing a limitation of the previo...

Back to top ↑