optimization 2 Stochastic Gradient Descent: Why Randomness Works May 13, 2026 Why Gradient Descent Works: A Small Mathematical Story May 12, 2026