thr3ads.net - similar to: "HPC question: torques replacement"

Displaying 20 results from an estimated 1000 matches similar to: "HPC question: torques replacement"

2020 Apr 17

HPC question: torques replacement

Hey Valeri - IIRC, midway (and maybe midway2?) use slurm for job scheduling. I don't know how many of your faculty use both your nodes and midway, but maybe consolidating on to a single scheduler would be easier for them? (also, it's been a while ... hi! ? ) Richard -----Original Message----- From: CentOS <centos-bounces at centos.org> On Behalf Of Valeri Galtsev Sent: Friday,

Xen and Torque

2008 Apr 26

Xen and Torque

Dear Xen users. Have anyone tried to integrate Xen with Torque resource management system? Could you please help me with an advice for a system I''m developing that relies on torque. Let me describe the system first. The part of the system that talks with torque should request a certain amount on nodes of a cluster and launch there a virtual machine instance (one vm instance per host).

OpenMPI not compiled with Torque support

2007 Dec 29

OpenMPI not compiled with Torque support

The OpenMPI package that ships with CentOS 5.1 does not seem to be compiled with torque support. It does, however, seem to be compiled with gridengine and slurm support. Would it be possible to get this changed?

Broken pipe, x86_64 CentOS 5.2

2008 Sep 30

Broken pipe, x86_64 CentOS 5.2

Hi All, I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to add there's no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04. The problem is that pbs_mom's child quits without giving any error logs. [root at frodo9 torque-2.3.3]# strace -f pbs_mom . . . bind(6, {sa_family=AF_INET, sin_port=htons(15002), sin_addr=inet_addr("0.0.0.0")}, 16) = 0 time(NULL)

serious problem with torque

2015 May 27

serious problem with torque

On Wed, May 27, 2015 10:55 am, Zachary Giles wrote: > Mark, You might really want to compile torque from source (into an RPM > if you'd like) and redistribute that. Every version is a little wonky > and those of us that use(d) it often will poke around until we find a > version / patch-set that makes us happy and stick with that for a bit. > It's not an exact science and

serious problem with torque

2015 May 27

serious problem with torque

Hi, folks, The other admin updated torque without testing it on one machine, and we had Issues. The first I knew was when a user reported qstat returning socket_connect_unix failed: 15137 socket_connect_unix failed: 15137 socket_connect_unix failed: 15137 qstat: cannot connect to server (null) (errno=15137) could not connect to trqauthd Attempting to restart the pbs_server did the same.

Anyone using torque/pbs/munge?

2015 Feb 19

Anyone using torque/pbs/munge?

CentOS 6.6 I've got two servers, server1 and hbs (honkin' big server). Both are running munge, and torque... *separately*. My problem is that I've got users who want to be able to submit from server1 to hbs. I see that munged can be pointed to an alternate keyfile... but is there any way to tell qsub what to use? (And yes, I got on the torque users' list, and I'm trying

serious problem with torque

2015 May 27

serious problem with torque

Johnny Hughes wrote: > On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote: >> Hi, folks, >> >> The other admin updated torque without testing it on one machine, and >> we had Issues. The first I knew was when a user reported qstat >> returning >> socket_connect_unix failed: 15137 >> socket_connect_unix failed: 15137 >> socket_connect_unix

SUGGESTION: Add get/setCores() to 'parallel' (and command line option --max-cores)

2012 Dec 04

SUGGESTION: Add get/setCores() to 'parallel' (and command line option --max-cores)

In the 'parallel' package there is detectCores(), which tries its best to infer the number of cores on the current machine. This is useful if you wish to utilize the *maximum* number of cores on the machine. Several are using this to set the number of cores when parallelizing, sometimes also hardcoded within 3rd-party scripts/package code, but there are several settings where you wish to

torque/psb & snow library

2008 Oct 22

torque/psb & snow library

Hello all; I'm trying to execute parallel jobs trough library snow on a cluster built through torque/PSB. I'm succesfully obtaining the cluster with: >system("cat $PBS_NODEFILE > cluster.txt") >mycluster <- scan(file="cluster.txt",what="character") >cl <- makeSOCKcluster(mycluster) The only problem, at the moment, is that if I use

Condor cluster setup advice (pointers) needed

2015 May 14

Condor cluster setup advice (pointers) needed

Dear Experts, Could someone recommend some "quick and dirty" HOWTO on condor based cluster? A did a bunch of cluster setups, I always used PBS (or torque lately). But my professor wants his to be Condor. I started reading Condor documentation, and it is vast. And quick route, like: architecture chart, and parameters I need to configure on master/submit, and compute nodes - somehow

Dynamically allow users with OpenSSH?

2019 Mar 07

Dynamically allow users with OpenSSH?

Peter and Jason, thanks for your replies on this. I was able to accomplish this with a combination of Peter's solution and setting "AuthorizedKeysFile none" as suggested in the Stack Overflow question. On Wed, Mar 6, 2019 at 2:30 PM Peter Moody <mindrot at hda3.com> wrote: > > why aren't the authorized keys/principals commands sufficient? > > $ getent group

postfix relay and mail host for HPC cluster

2009 Jan 16

postfix relay and mail host for HPC cluster

I've so far been unable to make the following work: I have a small cluster with a master node ( called bayes.bc.edu on the public network, and called master.cl.bc.edu on the internal 10.0.0.0 network). and a number of nodes which are purely on the private network. I want the master to receive mail and deliver locally (or use .forward and alias rules) to messages sent from the nodes

submitting R scripts with command_line_arguments to PBS HPC clusters

2017 Jul 12

submitting R scripts with command_line_arguments to PBS HPC clusters

Dear all, please could you advise me on the following : I've written a R script that reads 3 arguments from the command line, i.e. : " args <- commandArgs(TRUE) TUMOR <- args[1] GERMLINE <- args[2] CHR <- args[3] ". when I submit the R script to a PBS HPC scheduler, I do the following (below), but ... I am getting an error message. (I am not posting the error message,

submitting R scripts with command_line_arguments to PBS HPC clusters

2017 Jul 12

submitting R scripts with command_line_arguments to PBS HPC clusters

This sounds like an operating system specific question, in that "submit the R script to a PBS HPC scheduler" would be the kind of action that would run R with very different environment variables and possibly different access credentials than your usual interactive terminal. A thorough reading of the "Installation and Administration Guide" and some study of your HPC

password authentication failing for winbind

2002 Jul 25

password authentication failing for winbind

I originally posted this issue with the heading "winbind: challenge/response password authentication failed". I was using the redhat 7.3 samba 2.2.3 rpm then. I've upgraded to 2.2.5, but all that's changed is the return message. wbinfo -a VENUS0+tassadar%torque used to get me: plaintext password authentication succeeded challenge/response password authentication failed

submitting R scripts with command_line_arguments to PBS HPC clusters

2017 Jul 12

submitting R scripts with command_line_arguments to PBS HPC clusters

Hi, The problem is most likely, you need to call a R CMD BATCH with your arguments and the R-script inside of a shell script that you submit to your qsub. Unfortunately we don't use qsub anymore so can't test it, but it should be as follows: R-script eg. test.R: > ##First read in the arguments listed at the command line > args=(commandArgs(TRUE)) > > ##args is now a list of

Simple resource manager?

2009 Jul 27

Simple resource manager?

I need to serialize computing job requests for two different multicore machines, and in some near future, for a cluster. I have worked with SGE but it requires NFS and other administrative steps, plus it seems a bit overkill for my needs. I guess some simpler queue managing engine may have been developed, possibly over SSH. Any pointers? TIA. -- Eduardo Grosclaude Universidad Nacional del

xyplot color question

2009 Mar 02

xyplot color question

Hi, I am plotting scatterplots of horsepower by torque, conditional on brand (I'm just making up the variables for this example), and the goal is to see both the scatterplot points as well as the smoothed line. When I do the following code, I get the same color for the points and line, and would like the colors to be different, such as black points and a red smoothed line. How do I do that?

serious problem with torque

2015 May 27

serious problem with torque

Mark, You might really want to compile torque from source (into an RPM if you'd like) and redistribute that. Every version is a little wonky and those of us that use(d) it often will poke around until we find a version / patch-set that makes us happy and stick with that for a bit. It's not an exact science and newer / higher versions are not always better. As for the downgrade comment:

similar to: HPC question: torques replacement