Since the underlying tool in many pattern recognition, clustering, and machine learning algorithms is actually a selected distance function, or more generally, a selected similarity measure, it is motivated to study similarity measures to understand what kind of bias they apply to algorithms based on them.

This project was started at summer of 2000 by Ph.D. students Ville Kyrki and Joni Kamarainen, and research assistant Jarmo Ilonen. The primary goal of the project was to gain authors knowledge on similarity measures. As a result authors introduced a new linear transform, called the neighbor-bank transform, to reduce the dimensions of the data and simultaneously increase information content of the data vectors. The neighbor-bank transform is a useful pre-processing technique for ordered histogram type of data, e.q., gray-level histograms and color spectra. See documentation for more information


Project is finished. All the goals were met.


  1. Kamarainen, J.-K., Kyrki, V., Ilonen, J., Kalviainen, H., Similarity Measures for Ordered Histograms, In Proceedings of the 12th Scandinavian Conference on Image Analysis (Bergen, Norway, 2001), pp. 699-705.

  2. Kamarainen, J.-K., Kyrki, V., Ilonen, J., Kšlvišinen, H., Improving Similarity Measures of Histograms Using Smoothing Projections, Pattern Recognition Letters 24, 12 (2003), 2009-2019.


Matlab(tm) source code (for scientific and non-profitable use only)
Function for neighbor-bank generationnewneighborbank.m
Script for demonstration of properties of neigbor-bank projection demo01.m

Data sets used in our experiments:
1 Trees data set Download page
2 Lumber data set Download page
3 Bubbling data set Download page

