Central samples are removed because they correspond to inactive
sources and are almost unaffected by the nonlinearity. If
, the probability of having no active
sources at all according to the sparse source model
(1) is
, so the
samples
closest to the origin can be removed. In addition, ``non-sparse''
samples, which are the result of multiple sources active at the
same time, are also removed. They can be estimated as the
samples with highest local
scale.
If the sources have different -values,
and
can easily be calculated according to the previous description. In
practice rough (over-) estimates can be used for
and
. Especially when the
are unknown,
and
should be chosen so that after preprocessing the remaining
samples can be clustered into non-overlapping clusters.