Self-tuning Spectral Clustering

The choice of the kernel size $\sigma$ in (3) has a high impact on the clustering quality. It is a measure of when two points are considered similar, and should be of the same order of the distance between similar points. Some rules of thumb have been proposed to set a value for $\sigma$ , whereas in other cases this value is set manually.

When the data contains clusters with different local statistics, there may not be a single value of $\sigma$ that works well for all the data. In [12] a ``local'' scaling parameter $\sigma_i$ is proposed instead of this global parameter. It allows self-tuning of the point-to-point distances by studying the local statistics of the neightboring points of every point $\textbf{x}_i$ . This leads to the following extension of (3):

$\displaystyle \tilde{\kappa}(\textbf{x}[i],\textbf{x}[j]) = \exp\left(-\frac{d^2(\textbf{x}[i],\textbf{x}[j])}{\sigma_i \sigma_j}\right).$

(4)

If $\sigma_i$ is chosen as

$\displaystyle \sigma_i = d(\textbf{x}_i,\textbf{x}_L)$

(5)

where $\textbf{x}_L$ is the L'th nearest neighbor of point $\textbf{x}_i$ , then spectral clustering will group together all points with their closest neighbors that have similar $\sigma_i$ . The selection of

only depends on the data dimension of the embedding space.

Steven Van Vaerenbergh
Last modified: 2007-10-17