Michał Dereziński

Email: mderezin at berkeley edu

I am a postdoc at the Foundations of Data Analysis (FODA) Institute at UC Berkeley.

Previously, I was a research fellow at the Simons Institute for the Theory of Computing (Fall 2018, Foundations of Data Science program). I obtained my Ph.D. in Computer Science at the University of California, Santa Cruz, advised by professor Manfred Warmuth. In my research, I develop efficient data sampling techniques with applications to learning theory and optimization. Prior to UCSC, I completed Master's degrees in mathematics and computer science at the University of Warsaw. I also interned at a variety of Silicon Valley research labs (e.g. Microsoft, Yahoo, eBay), working on projects ranging from online learning to large-scale distributed optimization.

Short overview of my thesis work:

Unbiased estimates for linear regression via volume sampling   (video link)
Simons Institute, September 2018. Randomized Numerical Linear Algebra and Applications workshop.


Education

Ph.D. in Computer Science, University of California, Santa Cruz, 2018.
Thesis: Volume Sampling for Linear Regression  PDF

M.S. in Mathematics, University of Warsaw, 2013.
Thesis: Isomorphic Properties of Function Space BV on Simply Connected Planar Sets  PDF

M.S. in Computer Science, University of Warsaw, 2012.
Thesis: On Generating Concept Hierarchies with Fuzzy Data  PDF


Publications

2020

Determinantal Point Processes in Randomized Numerical Linear Algebra
M. Dereziński, M. W. Mahoney
arXiv

Isotropy and Log-Concave Polynomials: Accelerated Sampling and High-Precision Counting of Matroid Bases
N. Anari, M. Dereziński
arXiv

Improved guarantees and a multiple-descent curve for the Column Subset Selection Problem and the Nyström method
M. Dereziński, R. Khanna, M. W. Mahoney
arXiv

Convergence Analysis of Block Coordinate Algorithms with Determinantal Sampling
M. Mutný, M. Dereziński, A. Krause
To appear at AISTATS 2020  arXiv

Bayesian experimental design using regularized determinantal point processes
M. Dereziński, F. Liang, M. W. Mahoney
To appear at AISTATS 2020  arXiv

2019

Exact expressions for double descent and implicit regularization via surrogate random design
M. Dereziński, F. Liang, M. W. Mahoney
arXiv

Unbiased estimators for random design regression
M. Dereziński, M. K. Warmuth, D. Hsu
arXiv

Exact sampling of determinantal point processes with sublinear time preprocessing
M. Dereziński, D. Calandriello, M. Valko
NeurIPS 2019  arXiv

Distributed estimation of the inverse Hessian by determinantal averaging
M. Dereziński, M. W. Mahoney
NeurIPS 2019  arXiv

Minimax experimental design: Bridging the gap between statistical and worst-case approaches to least squares regression
M. Dereziński, K. L. Clarkson, M. W. Mahoney, M. K. Warmuth
COLT 2019  arXiv

Fast determinantal point processes via distortion-free intermediate sampling
M. Dereziński
COLT 2019  arXiv

Correcting the bias in least squares regression with volume-rescaled sampling
M. Dereziński, M. K. Warmuth, D. Hsu
AISTATS 2019  arXiv

2018

Leveraged volume sampling for linear regression
M. Dereziński, M. K. Warmuth, D. Hsu
NeurIPS 2018  arXiv

Reverse iterative volume sampling for linear regression
M. Dereziński, M. K. Warmuth
JMLR 2018  arXiv

Subsampling for Ridge Regression via Regularized Volume Sampling
M. Dereziński, M. K. Warmuth
AISTATS 2018  arXiv  Poster  Talk

Batch-Expansion Training: An Efficient Optimization Framework
M. Dereziński, D. Mahajan, S. S. Keerthi, S.V.N. Vishwanathan, M. Weimer
AISTATS 2018  arXiv  Poster  Talk

Discovering Surprising Documents with Context-Aware Word Representations
M. Dereziński, K. Rohanimanesh, A. Hydrie
IUI 2018  PDF  Talk

2017 and older

Unbiased estimates for linear regression via volume sampling
M. Dereziński, M. K. Warmuth
NIPS 2017  arXiv  Poster  Spotlight

Anticipating Concept Drift in Online Learning
M. Dereziński, B. N. Bhaskar
LFED Workshop at NIPS 2015  PDF

The limits of squared Euclidean distance regularization
M. Dereziński, M. K. Warmuth
NIPS 2014  PDF  Spotlight

Active Semi-Supervised Concept Hierarchy Refinement
M. Dereziński
LAWS 2012 workshop  PDF


Internships

Research Intern, Microsoft Research, Sunnyvale, CA
Developed a new optimization framework combining the benefits of stochastic and batch methods.

Intern Research Scientist, Yahoo Inc., Sunnyvale, CA
Developed optimization algorithms for recommendation systems that dynamically adapt to changing environment.

Intern Research Scientist, eBay Inc., San Jose, CA
Developed and implemented an unsupervised recommendation system based on topic modeling, for finding interesting products.

Software engineer Intern, Nvidia, Santa Clara, CA.

Intern, Interdisciplinary Centre for Mathematical and Computational Modelling, Warsaw, Poland.
Researched algorithms related to concurrent programming.