Abstract | ||
---|---|---|
The paper introduces base clusterings distributed cluster ensemble which can handle the problems of privacy preservation, distributed computing and knowledge reuse. First, the latent variables in latent Dirichlet location model for cluster ensemble (LDA-CE) are defined and some terminologies are defined. Second, Variational approximation inference for LDA-CE is stated in detail. Third, base on the variational approximation inference, we design a distributed and paralleled EM algorithm for cluster ensemble (DPEM). Finally, some datasets from UCI are chosen for experiment, Compared with cluster-based similarity partitioning algorithm (CSPA), hyper-graph partitioning algorithm(HGPA) and meta-clustering algorithm(MCLA), the results show DPEM algorithm does work better and DPEM can work distributed and paralleled, so DPEM can protect privacy information more and can save time. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/PACIIA.2008.346 | PACIIA |
Keywords | Field | DocType |
null | Algorithm design,Computer science,Parallel algorithm,Expectation–maximization algorithm,Inference,Theoretical computer science,Latent variable,Artificial intelligence,Dirichlet distribution,Information privacy,Cluster analysis,Machine learning | Conference |
Volume | Issue | Citations |
2 | null | 1 |
PageRank | References | Authors |
0.34 | 5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hongjun Wang | 1 | 661 | 51.68 |
Zhishu Li | 2 | 43 | 12.49 |
Yang Cheng | 3 | 1 | 0.34 |