Improved method for correcting sample Mahalanobis distance without estimating population eigenvalues or eigenvectors of covariance matrix

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

The recognition performance of the sample Mahalanobis distance (SMD) deteriorates as the number of learning samples decreases. Therefore, it is important to correct the SMD for a population Mahalanobis distance (PMD) such that it becomes equivalent to the case of infinite learning samples. In order to reduce the computation time and cost for this main purpose, this paper presents a correction method that does not require the estimation of the population eigenvalues or eigenvectors of the covariance matrix. In short, this method only requires the sample eigenvalues of the covariance matrix, number of learning samples, and dimensionality to correct the SMD for the PMD. This method involves the summation of the SMD’s principal components (each of which is divided by its expectation obtained using the delta method), Lawley’s bias estimation, and the variances of the sample eigenvectors. A numerical experiment demonstrates that this method works well for various cases of learning sample number, dimensionality, population eigenvalues sequence, and non-centrality. The application of this method also shows improved performance of estimating a Gaussian mixture model using the expectation–maximization algorithm.

Original languageEnglish
Pages (from-to)121-134
Number of pages14
JournalInternational Journal of Data Science and Analytics
Volume10
Issue number2
DOIs
StatePublished - 1 Aug 2020

Keywords

  • Correction method
  • Delta method
  • Gaussian mixture model
  • Lawley’s bias estimation
  • Sample eigenvalues and eigenvectors
  • Sample Mahalanobis distance

Fingerprint

Dive into the research topics of 'Improved method for correcting sample Mahalanobis distance without estimating population eigenvalues or eigenvectors of covariance matrix'. Together they form a unique fingerprint.

Cite this