Michael Cohen (MIT)

Monday, May 1, 2017

Klaus 1116 East - 11:00 am

Title: New Algorithms for Matrix Scaling Problems via Second-order Methods and Generalized Laplacian System Solvers 

In this paper, we study matrix scaling and balancing, which are fundamental problems in scientific computing, with a long line of work on them that dates back to the 1960s. We provide algorithms for both these problems that, ignoring logarithmic factors involving the dimension of the input matrix and the size of its entries, both run in time m log(k) log^2(1/eps) where eps is the amount of error we are willing to tolerate. Here, k represents the ratio between the largest and the smallest entries of the optimal scalings. This implies that our algorithms run in nearly-linear time whenever k is quasi-polynomial, which includes, in particular, the case of strictly positive matrices.

In order to establish these results, we develop a new second-order optimization framework that enables us to treat both problems in a unified and principled manner. This framework identifies a certain generalization of linear system solving which we can use to efficiently minimize a broad class of functions, which we call second-order robust. We then show that in the context of the specific functions capturing matrix scaling and balancing, we can leverage and generalize the work on Laplacian system solving to make the algorithms obtained via this framework very efficient.

We also discuss an interior point method that runs in time, up to logarithmic factors, of m^{3/2} log(1/eps) for the case of matrix balancing and the doubly-stochastic variant of matrix scaling (with an additional log(log(k)) bound in a more general setting). 

Joint work with Aleksander Madry, Dimitris Tsipras, and Adrian Vladu.

ArXiv posting:

A similar approach (but not using the generalization of Laplacian solvers and hence obtaining somewhat different results) was developed independently by Zeyuan Allen-Zhu, Yuanzhi Li, Rafael Oliveira, and Avi Wigderson:


