Displaying 20 results from an estimated 1000 matches similar to: "parLapplyLB: Load balancing?"
2018 Feb 19
0
[parallel] fixes load balancing of parLapplyLB
Dear R-Devel List,
I have installed R 3.4.3 with the patch applied on our cluster and ran a *real-world* job of one of our users to confirm that the patch works to my satisfaction. Here are the results.
The original was a series of jobs, all essentially doing the same stuff using bootstrapped data, so for the original there is more data and I show the arithmetic mean with standard deviation. The
2018 Mar 01
0
[parallel] fixes load balancing of parLapplyLB
Dear Tomas,
Thanks for your commitment to fix this issue and also to add the chunk size as an argument. If you want our input, let us know ;)
Best Regards
On 02/26/2018 04:01 PM, Tomas Kalibera wrote:
> Dear Christian and Henrik,
>
> thank you for spotting the problem and suggestions for a fix. We'll probably add a chunk.size argument to parLapplyLB and parLapply to follow OpenMP
2018 Feb 20
0
[parallel] fixes load balancing of parLapplyLB
Dear Henrik,
The rationale is just that it is within these extremes and that it is really simple to calculate, without making any assumptions and knowing that it won't be perfect.
The extremes A and B you are mentioning are special cases based on assumptions. Case A is based on the assumption that the function has a long runtime or varying runtime, then you are likely to get the best load
2018 Feb 26
2
[parallel] fixes load balancing of parLapplyLB
Dear Christian and Henrik,
thank you for spotting the problem and suggestions for a fix. We'll
probably add a chunk.size argument to parLapplyLB and parLapply to
follow OpenMP terminology, which has already been an inspiration for the
present code (parLapply already implements static scheduling via
internal function staticClusterApply, yet with a fixed chunk size;
parLapplyLB already
2012 Oct 23
0
Typos/omissions/inconsistencies in man page for clusterApply
Hi,
Here are the issues I found:
Typos
-----
(a) Found: It a parallel version of ?evalq?,
"is" missing.
(b) Found: 'parLapplyLB', 'parSapplyLB' are load-balancing versions,
intended for use when applying ?FUN? to
'parLapplyLB' has no 'FUN' arg (more on this below).
(c) Found: 'clusterApply' calls 'fun' on the first
2018 Feb 19
2
[parallel] fixes load balancing of parLapplyLB
Hi, I'm trying to understand the rationale for your proposed amount of
splitting and more precisely why that one is THE one.
If I put labels on your example numbers in one of your previous post:
nbrOfElements <- 97
nbrOfWorkers <- 5
With these, there are two extremes in how you can split up the
processing in chunks such that all workers are utilized:
(A) Each worker, called
2018 Feb 12
2
[parallel] fixes load balancing of parLapplyLB
Dear R-Devel List,
**TL;DR:** The function **parLapplyLB** of the parallel package has [reportedly][1] (see also attached RRD output) not
been doing its job, i.e. not actually balancing the load. My colleague Dirk Sarpe and I found the cause of the problem
and we also have a patch to fix it (attached). A similar fix has also been provided [here][2].
[1]:
2014 Dec 06
1
does parLapplyLB do load-balancing?
Looking at parLapplyLB, one sees that it takes in X and then passes
splitList(X, length(cl)) to clusterApplyLB, which then calls
dynamicClusterApply. Thus while dynamicClusterApply does handle tasks
in a load-balancing fashion, sending out individual tasks as previous
tasks complete, parLapplyLB preempts that by splitting up the tasks in
advance into as many groups of tasks as there are cluster
2018 Mar 15
0
clusterApply arguments
On Thu, Mar 15, 2018 at 3:39 AM, <FlorianSchwendinger at gmx.at> wrote:
> Thank you for your answer!
> I agree with you except for the 3 (Error) example and
> I realize now I should have started with that in the explanation.
>
> From my point of view
> parLapply(cl = clu, X = 1:2, fun = fun, c = 1)
> shouldn't give an error.
>
> This could be easily avoided by
2018 Mar 15
1
clusterApply arguments
On 03/15/2018 05:25 PM, Henrik Bengtsson wrote:
> On Thu, Mar 15, 2018 at 3:39 AM, <FlorianSchwendinger at gmx.at> wrote:
>> Thank you for your answer!
>> I agree with you except for the 3 (Error) example and
>> I realize now I should have started with that in the explanation.
>>
>> From my point of view
>> parLapply(cl = clu, X = 1:2, fun = fun, c =
2018 Mar 15
2
clusterApply arguments
Thank you for your answer!
I agree with you except for the 3 (Error) example and
I realize now I should have started with that in the explanation.
>From my point of view
parLapply(cl = clu, X = 1:2, fun = fun, c = 1)
shouldn't give an error.
This could be easily avoided by using all the argument
names in the custerApply call of parLapply which means changing,
parLapply <-
2018 Mar 14
0
clusterApply arguments
This is nothing specific to parallel::clusterApply() per se. It is the
default behavior of R where it allows for partial argument names. I
don't think there's much that can be done here except always using
fully named arguments to the "apply" function itself as you show.
You can "alert" yourself when there's a mistake by using:
options(warnPartialMatchArgs =
2018 Mar 14
2
clusterApply arguments
Hi!
I recognized that the argument matching of clusterApply (and therefore parLapply) goes wrong when one of the arguments of the function is called "c". In this case, the argument "c" is used as cluster and the functions give the following error message "Error in checkCluster(cl) : not a valid cluster".
Of course, "c" is for many reasons an unfortunate
2009 Aug 13
0
Efficiently Extracting Meta Data from TM Corpora
I'm using text miner (the "tm" package) to process large numbers of blog and message board postings (about 245,000). Does anyone have any advice for how to efficiently extract the meta data from a corpus of this size?
TM does a great job of using MPI for many functions (e.g. tmMap) which greatly speed up the processing. However, the "meta" function that I need does not
2019 Apr 13
0
SUGGESTION: Settings to disable forked processing in R, e.g. parallel::mclapply()
I think it's worth saying that mclapply() works as documented: it
relies on forking, and so doesn't work well in environments where it's
unsafe to fork. This is spelled out explicitly in the documentation of
?mclapply:
It is strongly discouraged to use these functions in GUI or embedded
environments, because it leads to several processes sharing the same
GUI which will likely cause
2010 Dec 02
1
parLapply - Error in do.call("fun", lapply(args, enquote)) : could not find function "fun"
Hello everybody,
I've got a bit of a problem with parLapply that's left me scratching my head
today. I've tried this in R 2.11 and the 23 bit Revolution R Enterprise and
gotten the same result, OS in question is Windows XP, the package involved
is the snow package.
I've got a list of 20 rain/no rain (1/0) situations for these two stations i
and j, all the items in this list look
2006 Dec 11
0
double boostrap with clusterApplyLB
Dear R-Users,
we are using a linux-cluster with RMPI and the snow package.
We would like to do a double boostrap.
We have a general function that implements the first boostrap (the outer) and
we are wondering if we can include another bootstrap (the inner) in the
same general function including another clusterApplyLB.
For example:
general function = function(...) {
clusterApplyLB(cl,
2009 Nov 17
2
SVM Param Tuning with using SNOW package
Hello,
Is the first time I am using SNOW package and I am trying to tune the cost
parameter for a linear SVM, where the cost (variable cost1) takes 10 values
between 0.5 and 30.
I have a large dataset and a pc which is not very powerful, so I need to
tune the parameters using both CPUs of the pc.
Somehow I cannot manage to do it. It seems that both CPUs are fitting the
model for the same values
2013 Jun 26
2
Error on executing functions from installed package
Hi,
I am currently building an R package and I am facing a peculiar problem
where some of the functions does not work within the package. However, if I
source the script the function works.
For example, in a method for parallelization of analysis on each chromosome
simultaneously I am receiving error at the following position of the code:
# this profile the information chromosome wise and
2013 Feb 07
1
R intermittently crashes across cluster
Greetings,
I am having an interesting problem and I wonder if anyone else has
seen this behavior.
I am running R 2.11.1 with SNOW 0.3-3 on a Dell cluster running CentOS 5.5.
I create my cluster using:
cluster<- makeCluster(nodes,type="SOCK",port=10191) # nodes is a
vector of compute nodes
I then wrap a loop around clusterApplyLB to evaluate my function
multiple times, with