Archives of Acoustics, 16, 2, pp. 237-247, 1991

Image similarity functions in non-parametric algorithms of voice identification

Cz. BASZTURA
Institute of Telecommunication and Acoustics of the Wrocław Technical University
Poland

J. ZUK
Institute of Telecommunication and Acoustics of the Wrocław Technical University
Poland

This paper is dedicated to the question of the choice of a function of similarity between images in non-parametric alogorithms of voice recognition. The usefulness of 10 similarity functions (8 distances and 2 nearness'es) in three non-parametric identification algorithms – NN (nearest neighbour), k-NN (k-nearest neighbours) and NM (nearest mean) – was investigated for three sets of parameters (1 natural and 2 normalized). Results obtained for a population of speakers from a closed set with size M = 20 (after 10 repetitions of the learning and test sequences) have proved that the Camberr distance function prevails in all types of parameters and algorithms. Other functions ensure a differentiated discrimination force strongly dependent on the algorithm and form of parameters. Limited usefulness of the square of Mahalonobis distance in comparison to other similarity functions was proved, as well as generally worse results for the NM algorithm.
Full Text: PDF
Copyright © Polish Academy of Sciences & Institute of Fundamental Technological Research (IPPT PAN).

References

Cz. BASZTURA, Sources, signals and acoustic images (in Polish), WKiŁ, Warszawa 1988.

Cz. BASZTURA, J. JURKIEWICZ, Analysis of zero-crossings of a speech signal in a short-term model of automatic speaker identification (in Polish) Arch. Akustyki 13, 3, 203-214 (1978).

Cz. BASZTURA, Similarity functions of acoustic images as indicators of objective evaluation of speech quality transmission (in Polish) Arch. Akustyki 22, 3, 217-233 (1987).

A. J. GRAY, J. D. MARKEL, Distance measures for speech processing, IEEE ASSP-24, 5, 380-391 (1976).