Processing math: 100%
Skip to yearly menu bar Skip to main content


Virtual presentation / poster accept

Almost Linear Constant-Factor Sketching for 1 and Logistic Regression

Alexander Munteanu · Simon Omlor · David Woodruff

Keywords: [ logistic regression ] [ regression ] [ data streams ] [ sketching ] [ Theory ]


Abstract: We improve upon previous oblivious sketching and turnstile streaming results for 1 and logistic regression, giving a much smaller sketching dimension achieving O(1)-approximation and yielding an efficient optimization problem in the sketch space. Namely, we achieve for any constant c>0 a sketching dimension of ˜O(d1+c) for 1 regression and ˜O(μd1+c) for logistic regression, where μ is a standard measure that captures the complexity of compressing the data. For 1-regression our sketching dimension is near-linear and improves previous work which either required Ω(logd)-approximation with this sketching dimension, or required a larger poly(d) number of rows. Similarly, for logistic regression previous work had worse poly(μd) factors in its sketching dimension. We also give a tradeoff that yields a 1+ε approximation in input sparsity time by increasing the total size to (dlog(n)/ε)O(1/ε) for 1 and to (μdlog(n)/ε)O(1/ε) for logistic regression. Finally, we show that our sketch can be extended to approximate a regularized version of logistic regression where the data-dependent regularizer corresponds to the variance of the individual logistic losses.

Chat is not available.