Second Order Machine Learning - Michael Mahoney

From Katie Gentilello September 29th, 2017

31 plays 0 comments

A major challenge for large-scale machine learning, and one that will only increase in importance as we develop models that are more and more domain-informed, involves going beyond high-variance first-order optimization methods to more robust second order methods. Here, we consider the problem of minimizing the sum of a large number of functions over a convex constraint set, a problem that arises in many data analysis, machine learning, and more traditional scientific computing applications, as well as non-convex variants of these basic methods. While this is of interest in many situations, it has received attention recently due to challenges associated with training so-called deep neural networks. We establish improved bounds for algorithms that incorporate sub-sampling as a way to improve computational efficiency, while maintaining the original convergence properties of these algorithms. These methods exploit recent results from Randomized Linear Algebra on approximate matrix multiplication. Within the context of second order optimization methods, they provide quantitative convergence results for variants of Newton's methods, where the Hessian and/or the gradient is uniformly or non-uniformly sub-sampled, under much weaker assumptions than prior work.

https://mediaspace.gatech.edu/media/mahoney/1_9d6482iu

Tags: ideas seminar seriesgeorgia techrecorded by gtlibraryfall 2017institute for data engineering and science

name: Michael Mahoney
Date: September 22nd, 2017
Appears In: Open Scholarship

Comments
Related Media

Add a comment

Second Order Machine Learning - Michael Mahoney

Related Media