Self-Tuning Networks: Amortizing the Hypergradient Computation for Hyperparameter Optimization
Microsoft Research shares this amazing talk on the optimization of many deep learning hyperparameters can be formulated as a bilevel optimization problem. While most black-box...
Details