Comparing partitions of two sets of units based on the same variables

G Youness, G Saporta - Advances in data analysis and classification, 2010 - Springer
Advances in data analysis and classification, 2010Springer
We propose a procedure based on a latent variable model for the comparison of two
partitions of different units described by the same set of variables. The null hypothesis here
is that the two partitions come from the same underlying mixture model. We define a method
of “projecting” partitions using a supervised classification method: once one partition is taken
as a reference; the individuals of the second data set are allocated to the clusters of the
reference partition; it gives two partitions of the same units of the second data set: the …
Abstract
We propose a procedure based on a latent variable model for the comparison of two partitions of different units described by the same set of variables. The null hypothesis here is that the two partitions come from the same underlying mixture model. We define a method of “projecting” partitions using a supervised classification method: once one partition is taken as a reference; the individuals of the second data set are allocated to the clusters of the reference partition; it gives two partitions of the same units of the second data set: the original and the projected one and we evaluate their difference by usual measures of association. The empirical distributions of the association measures are derived by simulation.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果