Linear Regression Solution from Projections onto Subspaces April 18, 2026 Math Deriving Linear Regression Normal Form & Showing its Properties Read more →
Properties of L2 Loss April 16, 2026 Math Informative Properties of the Ubiquitous L2 Squared Loss Read more →
Bias Variance Tradeoff March 30, 2026 Generalization Bias Variance Decomposition of Squared Loss Read more →
VC Dimension via Pascal's Triangle March 27, 2026 Generalization What's the Maximum Number of Points a Model can Classify? Understanding VC Dimension via Pascal’s Triangle Read more →
The Euler–Lagrange Equation, Derived March 25, 2026 Math From Shortest Paths to a General Optimization Principle for Functionals Read more →
Derivatives of Functions vs. Derivatives of Functionals March 24, 2026 Math How do you take a derivative of a functional? $dF[f]$ Read more →
Fun with Functionals March 23, 2026 Math Introduction to Functionals: $F: f \rightarrow F[f] \in \mathbb{R}$ Read more →
Why is Learning Possible? March 21, 2026 Generalization Feasibility of Learning via Hoeffding's Inequality Read more →
Maximum Likelihood Estimation, Cross-Entropy, and Softmax March 18, 2026 Information Theory For Probabilistic Classifiers, MLE Reduces to Cross-Entropy Loss Read more →
KL-Divergence and Cross Entropy March 15, 2026 Information Theory $\nabla D_{KL}(P||Q) = \nabla H(P,Q)$, $D_{KL}(P||Q)$ vs $D_{KL}(Q||P)$, & Intuitive Visualizations Read more →