Efficient Hessian-based DNN Optimization via Chain-Rule ApproximationTemperoni, Alessandro ; Dalle Lucca Tosi, Mauro ; Theobald, Martin ![]() in Proceedings of the 6th Joint International Conference on Data Science Management of Data (10th ACM IKDD CODS and 28th COMAD) (2023) Learning non use-case specific models has been shown to be a challenging task in Deep Learning (DL). Hyperparameter tuning requires long training sessions that have to be restarted any time the network or ... [more ▼] Learning non use-case specific models has been shown to be a challenging task in Deep Learning (DL). Hyperparameter tuning requires long training sessions that have to be restarted any time the network or the dataset changes and are not affordable by most stakeholders in industry and research. Many attempts have been made to justify and understand the source of the use-case specificity that distinguishes DL problems. To this date, second-order optimization methods have been partially shown to be effective in some cases but have not been sufficiently investigated in the context of learning and optimization. In this work, we present a chain rule for the efficient approximation of the Hessian matrix (i.e., the second-order derivatives) of the weights across the layers of a Deep Neural Network (DNN). We show the application of our approach for weight optimization during DNN training, as we believe that this is a step that particularly suffers from the enormous variety of the optimizers provided by state-of-the-art libraries such as Keras and PyTorch. We demonstrate—both theoretically and empirically—the improved accuracy of our approximation technique and that the Hessian is a useful diagnostic tool which helps to more rigorously optimize training. Our preliminary experiments prove the efficiency as well as the improved convergence of our approach which both are crucial aspects for DNN training. [less ▲] Detailed reference viewed: 263 (14 UL) "The Origins Of Chess" A Digital 3D Chess Artwork with Physics & AI; Temperoni, Alessandro ; Franck, Christian ![]() Scientific Conference (2022, November 30) Chess and Artificial Intelligence (AI) have always been connected together as the game naturally challenges the ability of a computer to think. In this work, we present a novel chess game using AI and 3D ... [more ▼] Chess and Artificial Intelligence (AI) have always been connected together as the game naturally challenges the ability of a computer to think. In this work, we present a novel chess game using AI and 3D technology for the implementation of the engine as well as for the physical installation of the game. For the engine, the Minimax algorithm is utilized to calculate the best possible move. The game is installed at the ”AI and Art” exhibition in the Computational Creativity Hub (CCH) of the University of Luxembourg. [less ▲] Detailed reference viewed: 95 (6 UL) Robust and Provable Guarantees for Sparse Random Embeddings; Temperoni, Alessandro ; Theobald, Martin ![]() in Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part II. (2022, May 18) Detailed reference viewed: 66 (1 UL) Revisiting Weight Initialization of Deep Neural Networks; Temperoni, Alessandro ; Theobald, Martin ![]() in Proceedings of Machine Learning Research (2021, November 17) Detailed reference viewed: 70 (0 UL) |
||