Machine Learning

The geometry of trainable models with piecewise linear structures

A ReLU neural network is an alternating composition of an affine linear function and the ReLU activation \(max(0,x)\), where this function is applied to a vector coordinate-wise. Any such network is thus a piecewise-linear function, and, in fact, also the converse holds true: Any piecewise linear function can be expressed as a ReLU neural network. Tropial geometry may be thought of as the geomtry of piecewise-linear functions, and allows a new perspective to study such network representing piecewise-linear functions. Inherently, this draws close connections to polyhedral geometry and oriented matroids.

 

References
  1. How to learn a star: Binary classification with starshaped polyhedral sets with Katharina Jochemko. Accepted at NeurIPS (The Thirty-Ninth Annual Conference on Neural Information Processing Systems), December 2025. [abstract] [arXiv]
  2. Decomposition Polyhedra of Piecewise Linear Functions with Moritz Grillo, and Christoph Hertrich. International Conference on Learning Representations (ICLR), February 2025. [abstract] [url]
  3. The Real Tropical Geometry of Neural Networks for Binary Classification with Georg Loho, and Guido Montúfar. Transactions on Machine Learning Research, September 2024. [abstract] [url] [bibtex]