3 papers across 3 sessions
A novel generative model for hierarchies based on Bayesian Flow Networks.
We prove that the gradient descent map is non-singular for any neural network using piecewise analytic activation functions.