mixdir: Cluster High Dimensional Categorical Datasets

Scalable Bayesian clustering of categorical datasets. The package implements a hierarchical Dirichlet (Process) mixture of multinomial distributions. It is thus a probabilistic latent class model (LCM) and can be used to reduce the dimensionality of hierarchical data and cluster individuals into latent classes. It can automatically infer an appropriate number of latent classes or find k classes, as defined by the user. The model is based on a paper by Dunson and Xing (2009) <doi:10.1198/jasa.2009.tm08439>, but implements a scalable variational inference algorithm so that it is applicable to large datasets. It is described and tested in the accompanying paper by Ahlmann-Eltze and Yau (2018) <doi:10.1109/DSAA.2018.00068>.

Version: 0.3.0
Depends: R (≥ 2.10)
Imports: extraDistr, Rcpp
LinkingTo: Rcpp
Suggests: testthat, tibble, purrr, dplyr, rmutil, pheatmap, mcclust, ggplot2, tidyr, utils
Published: 2019-09-20
DOI: 10.32614/CRAN.package.mixdir
Author: Constantin Ahlmann-Eltze ORCID iD [aut, cre], Christopher Yau ORCID iD [ths]
Maintainer: Constantin Ahlmann-Eltze <artjom31415 at googlemail.com>
License: GPL-3
URL: https://github.com/const-ae/mixdir
NeedsCompilation: yes
Citation: mixdir citation info
Materials: README NEWS
CRAN checks: mixdir results


Reference manual: mixdir.pdf


Package source: mixdir_0.3.0.tar.gz
Windows binaries: r-devel: mixdir_0.3.0.zip, r-release: mixdir_0.3.0.zip, r-oldrel: mixdir_0.3.0.zip
macOS binaries: r-release (arm64): mixdir_0.3.0.tgz, r-oldrel (arm64): mixdir_0.3.0.tgz, r-release (x86_64): mixdir_0.3.0.tgz, r-oldrel (x86_64): mixdir_0.3.0.tgz
Old sources: mixdir archive


Please use the canonical form https://CRAN.R-project.org/package=mixdir to link to this page.