Probabilistic Principal Components and Mixtures, How This Works

Anna M. Bartkowiak; Radoslaw Zimroz

doi:10.1007/978-3-319-24369-6_2

Conference Papers Year : 2015

Probabilistic Principal Components and Mixtures, How This Works

(1, 2) , (2)

1
2

Anna M. Bartkowiak

Function : Author
PersonId : 999185

University of Wrocław [Poland]

Wroclaw University of Science and Technology

Radoslaw Zimroz

Function : Author

Wroclaw University of Science and Technology

Abstract

Classical Principal Components Analysis (PCA) is widely recognized as a method for dimensionality reduction and data visualization. This is a purely algebraic method, it considers just some optimization problem which fits exactly to the gathered data vectors with their particularities. No statistical significance tests are possible. An alternative is to use probabilistic principal component analysis (PPCA), which is formulated on a probabilistic ground. Obviously, to do it one has to know the probability distribution of the analyzed data. Usually the Multi-Variate Gaussian (MVG) distribution is assumed. But what, if the analyzed data are decidedly not MVG? We have met such problem when elaborating multivariate gearbox data derived from a heavy duty machine. We show here how we have dealt with the problem.In our analysis, we assumed that the considered data are a mixture of two groups being MVG, specifically: each of the sub-group follows a probabilistic principal component (PPC) distribution with a MVG error function. Then, by applying Bayesian inference, we were able to calculate for each data vector x its a posteriori probability of belonging to data generated by the assumed model. After estimation of the parameters of the assumed model we got means - based on a sound statistical basis - for constructing confidence boundaries of the data and finding outliers.

Keywords

Domains

Fichier principal

978-3-319-24369-6_2_Chapter.pdf (218)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-01444479

Submitted on : Tuesday, January 24, 2017-10:40:56 AM

Last modification on : Saturday, November 21, 2020-9:54:03 AM

Long-term archiving on : Tuesday, April 25, 2017-2:22:19 PM

Dates and versions

hal-01444479 , version 1 (24-01-2017)

Licence

Attribution

Identifiers

HAL Id : hal-01444479 , version 1
DOI : 10.1007/978-3-319-24369-6_2

Cite

Anna M. Bartkowiak, Radoslaw Zimroz. Probabilistic Principal Components and Mixtures, How This Works. 14th Computer Information Systems and Industrial Management (CISIM), Sep 2015, Warsaw, Poland. pp.24-35, ⟨10.1007/978-3-319-24369-6_2⟩. ⟨hal-01444479⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC8 IFIP-CISIM IFIP-LNCS-9339

192 View

159 Download

Probabilistic Principal Components and Mixtures, How This Works

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share