similar to: HPC question: torques replacement

Displaying 20 results from an estimated 1000 matches similar to: "HPC question: torques replacement"

2020 Apr 17
0
HPC question: torques replacement
Hey Valeri - IIRC, midway (and maybe midway2?) use slurm for job scheduling. I don't know how many of your faculty use both your nodes and midway, but maybe consolidating on to a single scheduler would be easier for them? (also, it's been a while ... hi! ? ) Richard -----Original Message----- From: CentOS <centos-bounces at centos.org> On Behalf Of Valeri Galtsev Sent: Friday,
2008 Apr 26
1
Xen and Torque
Dear Xen users. Have anyone tried to integrate Xen with Torque resource management system? Could you please help me with an advice for a system I''m developing that relies on torque. Let me describe the system first. The part of the system that talks with torque should request a certain amount on nodes of a cluster and launch there a virtual machine instance (one vm instance per host).
2007 Dec 29
2
OpenMPI not compiled with Torque support
The OpenMPI package that ships with CentOS 5.1 does not seem to be compiled with torque support. It does, however, seem to be compiled with gridengine and slurm support. Would it be possible to get this changed?
2008 Sep 30
1
Broken pipe, x86_64 CentOS 5.2
Hi All, I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to add there's no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04. The problem is that pbs_mom's child quits without giving any error logs. [root at frodo9 torque-2.3.3]# strace -f pbs_mom . . . bind(6, {sa_family=AF_INET, sin_port=htons(15002), sin_addr=inet_addr("0.0.0.0")}, 16) = 0 time(NULL)
2015 May 27
1
serious problem with torque
On Wed, May 27, 2015 10:55 am, Zachary Giles wrote: > Mark, You might really want to compile torque from source (into an RPM > if you'd like) and redistribute that. Every version is a little wonky > and those of us that use(d) it often will poke around until we find a > version / patch-set that makes us happy and stick with that for a bit. > It's not an exact science and
2015 May 27
5
serious problem with torque
Hi, folks, The other admin updated torque without testing it on one machine, and we had Issues. The first I knew was when a user reported qstat returning socket_connect_unix failed: 15137 socket_connect_unix failed: 15137 socket_connect_unix failed: 15137 qstat: cannot connect to server (null) (errno=15137) could not connect to trqauthd Attempting to restart the pbs_server did the same.
2015 Feb 19
0
Anyone using torque/pbs/munge?
CentOS 6.6 I've got two servers, server1 and hbs (honkin' big server). Both are running munge, and torque... *separately*. My problem is that I've got users who want to be able to submit from server1 to hbs. I see that munged can be pointed to an alternate keyfile... but is there any way to tell qsub what to use? (And yes, I got on the torque users' list, and I'm trying
2015 May 27
2
serious problem with torque
Johnny Hughes wrote: > On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote: >> Hi, folks, >> >> The other admin updated torque without testing it on one machine, and >> we had Issues. The first I knew was when a user reported qstat >> returning >> socket_connect_unix failed: 15137 >> socket_connect_unix failed: 15137 >> socket_connect_unix
2012 Dec 04
2
SUGGESTION: Add get/setCores() to 'parallel' (and command line option --max-cores)
In the 'parallel' package there is detectCores(), which tries its best to infer the number of cores on the current machine. This is useful if you wish to utilize the *maximum* number of cores on the machine. Several are using this to set the number of cores when parallelizing, sometimes also hardcoded within 3rd-party scripts/package code, but there are several settings where you wish to
2008 Oct 22
1
torque/psb & snow library
Hello all; I'm trying to execute parallel jobs trough library snow on a cluster built through torque/PSB. I'm succesfully obtaining the cluster with: >system("cat $PBS_NODEFILE > cluster.txt") >mycluster <- scan(file="cluster.txt",what="character") >cl <- makeSOCKcluster(mycluster) The only problem, at the moment, is that if I use
2015 May 14
1
Condor cluster setup advice (pointers) needed
Dear Experts, Could someone recommend some "quick and dirty" HOWTO on condor based cluster? A did a bunch of cluster setups, I always used PBS (or torque lately). But my professor wants his to be Condor. I started reading Condor documentation, and it is vast. And quick route, like: architecture chart, and parameters I need to configure on master/submit, and compute nodes - somehow
2019 Mar 07
2
Dynamically allow users with OpenSSH?
Peter and Jason, thanks for your replies on this. I was able to accomplish this with a combination of Peter's solution and setting "AuthorizedKeysFile none" as suggested in the Stack Overflow question. On Wed, Mar 6, 2019 at 2:30 PM Peter Moody <mindrot at hda3.com> wrote: > > why aren't the authorized keys/principals commands sufficient? > > $ getent group
2009 Jan 16
1
postfix relay and mail host for HPC cluster
I've so far been unable to make the following work: I have a small cluster with a master node ( called bayes.bc.edu on the public network, and called master.cl.bc.edu on the internal 10.0.0.0 network). and a number of nodes which are purely on the private network. I want the master to receive mail and deliver locally (or use .forward and alias rules) to messages sent from the nodes
2017 Jul 12
2
submitting R scripts with command_line_arguments to PBS HPC clusters
Dear all, please could you advise me on the following : I've written a R script that reads 3 arguments from the command line, i.e. : " args <- commandArgs(TRUE) TUMOR <- args[1] GERMLINE <- args[2] CHR <- args[3] ". when I submit the R script to a PBS HPC scheduler, I do the following (below), but ... I am getting an error message. (I am not posting the error message,
2017 Jul 12
0
submitting R scripts with command_line_arguments to PBS HPC clusters
This sounds like an operating system specific question, in that "submit the R script to a PBS HPC scheduler" would be the kind of action that would run R with very different environment variables and possibly different access credentials than your usual interactive terminal. A thorough reading of the "Installation and Administration Guide" and some study of your HPC
2002 Jul 25
1
password authentication failing for winbind
I originally posted this issue with the heading "winbind: challenge/response password authentication failed". I was using the redhat 7.3 samba 2.2.3 rpm then. I've upgraded to 2.2.5, but all that's changed is the return message. wbinfo -a VENUS0+tassadar%torque used to get me: plaintext password authentication succeeded challenge/response password authentication failed
2017 Jul 12
1
submitting R scripts with command_line_arguments to PBS HPC clusters
Hi, The problem is most likely, you need to call a R CMD BATCH with your arguments and the R-script inside of a shell script that you submit to your qsub. Unfortunately we don't use qsub anymore so can't test it, but it should be as follows: R-script eg. test.R: > ##First read in the arguments listed at the command line > args=(commandArgs(TRUE)) > > ##args is now a list of
2009 Jul 27
2
Simple resource manager?
I need to serialize computing job requests for two different multicore machines, and in some near future, for a cluster. I have worked with SGE but it requires NFS and other administrative steps, plus it seems a bit overkill for my needs. I guess some simpler queue managing engine may have been developed, possibly over SSH. Any pointers? TIA. -- Eduardo Grosclaude Universidad Nacional del
2009 Mar 02
1
xyplot color question
Hi, I am plotting scatterplots of horsepower by torque, conditional on brand (I'm just making up the variables for this example), and the goal is to see both the scatterplot points as well as the smoothed line. When I do the following code, I get the same color for the points and line, and would like the colors to be different, such as black points and a red smoothed line. How do I do that?
2015 May 27
0
serious problem with torque
Mark, You might really want to compile torque from source (into an RPM if you'd like) and redistribute that. Every version is a little wonky and those of us that use(d) it often will poke around until we find a version / patch-set that makes us happy and stick with that for a bit. It's not an exact science and newer / higher versions are not always better. As for the downgrade comment: