This is a linkpost for the original article hosted on my old blog: Stochastic Gradient Descent, Part III, Fitting linear, quadratic and sinusoidal data using a neural network and SGD.