cluster analysis - k-means clustering in R on very large, sparse matrix? -


I am trying to do some k-mean clustering on a very large matrix.

The matrix is ​​approximately 500,000 rows x 4000 columns yet very rare (only a couple of "1" values ​​per line).

The whole thing is not fit in memory, so I converted it into a rare ARFF file. But R can not read clearly the rare ARFF file format. I also have data as a plain CSV file.

Is there any package available in R to efficiently load such rare matrix? I use regular algorithm from the cluster package to proceed.

Thank you very much

packages (or the families of packages now - see them) There is a running example of expanded analytics on data. Specially look in the sub-package, which includes the k-mean function.


Comments

Popular posts from this blog

windows - Heroku throws SQLITE3 Read only exception -

lex - Building a lexical Analyzer in Java -

python - rename keys in a dictionary -