The Guedon-Vershynin Semi-definite Programming Approach to Low Dimensional Embedding for Unsupervised Clustering
This paper proposes a method for estimating the cluster matrix in the Gaussian mixture framework via Semi-Definite Programming. Theoretical error bounds are provided and a (non linear) low dimensional embedding of the data is deduced from the cluster matrix estimate. The method and its analysis is inspired by the work by Guédon and Vershynin on community detection in the stochastic block model. The adaptation is non trivial since the model is different and new Gaussian concentration arguments are needed. Our second contribution is a new Bregman-ADMM type algorithm for solving the semi-definite program and computing the embedding. This results in an efficient and scalable algorithm taking only the pairwise distances as input. The performance of the method is illustrated via Monte Carlo experiments and comparisons with other embeddings from the literature.
CITE THIS COLLECTION
SHARE
Usage metrics
Read the peer-reviewed publication
AUTHORS (3)
CATEGORIES
- Statistics
- Mathematical Physics not elsewhere classified
- Ordinary Differential Equations, Difference Equations and Dynamical Systems
- Computation Theory and Mathematics
- Financial Mathematics
- Applied Mathematics not elsewhere classified
- Optimisation
- Numerical and Computational Mathematics not elsewhere classified
- Applied Statistics
- Numerical Computation
- Computation Theory and Mathematics not elsewhere classified