Displaying 3 results from an estimated 3 matches for "lishuliu".
2012 Feb 20
1
bigmemory not really parallel
Hi, all,
I have a really big matrix that I want to run k-means on.
I tried:
>data <-
read.big.memory('mydata.csv',type='double',backingfile='mydata.bin',descriptorfile='mydata.desc')
I'm using doMC to register multicore.
>library(doMC)
>registerDoMC(cores=8)
>ans<-bigkmeans(data,k)
In system monitor, it seems only one thread running R. Is
2012 Jan 18
1
kmeans clustering on large but sparse matrix
Hi,
I have a 60k*600k matrix, which exceed the vector length limit of 2^32-1.
But it's rather sparse, only 0.02% has value. So I save is as MarketMatrix
(mm) file, it's about 300M in size. I use readMM in Matrix package to read
it in. If do so, the data type becomes dgTMatrix in 'Matrix' package
instead of the common matrix type.
The problem is, if I run k-means only on part of
2012 Feb 02
0
bigkmeans not parallel
I'm using bigkmeans in 'biganalytics' to cluster my 60,000 by 600,000 matrix.
I'm using a 8 core Linux VM.
I have register parallel backend with
>registerDoMC()
And I checked how many cores registered with
>getDoParWorkers()
It returns 8, which is the number of cores I have on my machine.
And I run the test below, whose results shows improved speed due to
parallel.
check