Displaying 20 results from an estimated 1000 matches similar to: "serious problem with torque"
2015 May 27
2
serious problem with torque
Johnny Hughes wrote:
> On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
>> Hi, folks,
>>
>> The other admin updated torque without testing it on one machine, and
>> we had Issues. The first I knew was when a user reported qstat
>> returning
>> socket_connect_unix failed: 15137
>> socket_connect_unix failed: 15137
>> socket_connect_unix
2015 May 27
1
serious problem with torque
On Wed, May 27, 2015 10:55 am, Zachary Giles wrote:
> Mark, You might really want to compile torque from source (into an RPM
> if you'd like) and redistribute that. Every version is a little wonky
> and those of us that use(d) it often will poke around until we find a
> version / patch-set that makes us happy and stick with that for a bit.
> It's not an exact science and
2015 May 27
0
serious problem with torque
Mark, You might really want to compile torque from source (into an RPM
if you'd like) and redistribute that. Every version is a little wonky
and those of us that use(d) it often will poke around until we find a
version / patch-set that makes us happy and stick with that for a bit.
It's not an exact science and newer / higher versions are not always better.
As for the downgrade comment:
2015 May 27
0
serious problem with torque
On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
> Hi, folks,
>
> The other admin updated torque without testing it on one machine, and
> we had Issues. The first I knew was when a user reported qstat
> returning
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> qstat: cannot connect to server (null)
2015 May 27
0
serious problem with torque
On Wed, May 27, 2015 9:46 am, m.roth at 5-cent.us wrote:
> Johnny Hughes wrote:
>> On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
>>> Hi, folks,
>>>
>>> The other admin updated torque without testing it on one machine,
>>> and
>>> we had Issues. The first I knew was when a user reported qstat
>>> returning
>>>
2008 Apr 26
1
Xen and Torque
Dear Xen users.
Have anyone tried to integrate Xen with Torque resource management system?
Could you please help me with an advice for a system I''m developing that
relies on torque.
Let me describe the system first.
The part of the system that talks with torque should request a certain
amount on nodes of a cluster and launch there a virtual machine instance
(one vm instance per host).
2008 Oct 22
1
torque/psb & snow library
Hello all;
I'm trying to execute parallel jobs trough library snow on a cluster built
through torque/PSB. I'm succesfully obtaining the cluster with:
>system("cat $PBS_NODEFILE > cluster.txt")
>mycluster <- scan(file="cluster.txt",what="character")
>cl <- makeSOCKcluster(mycluster)
The only problem, at the moment, is that if I use
2007 Dec 29
2
OpenMPI not compiled with Torque support
The OpenMPI package that ships with CentOS 5.1 does not seem to be
compiled with torque support. It does, however, seem to be compiled
with gridengine and slurm support. Would it be possible to get this
changed?
2015 May 27
1
was, Re: serious problem with torque, is firefox
Valeri Galtsev wrote:
> On Wed, May 27, 2015 9:46 am, m.roth at 5-cent.us wrote:
>> Johnny Hughes wrote:
>>> On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
<snip>
>>
>> Thanks, Johnny. I *just* posted an apology, that I realized it was an
>> EPEL issue.... Talk about an "upgrade disaster"! I think the other admin -
>> he's been here
2008 Jul 07
1
SIGPIPE in assorted apps after "yum update"
Hello,
I have several systems which I recently updated with
yum -y update
to all the latest packages. These systems use yum-priorities and use
the CentOS (priority 1) EPEL (priority 5) and rpmforge (priority 10)
repositories. After the updates, dhcpd stopped working with a SIGPIPE
error which occurs shortly after it attempts to fork into the
background. I worked around that problem by building
2020 Apr 17
4
HPC question: torques replacement
Dear Experts,
I know there are many HPC (high performance computing) experts on this
list. I'd like to ask your advise.
Almost two decades ago I chose to go with OpenPBS (turned down condor
and other alternatives for whatever reason) for clusters and number
crunchers I support for the Department at the university. It turned out
to be not bad, long lived choice. At some point I smoothly
2003 Jun 06
2
R help: Correlograms
Hello,
I have time series and need to draw simple and partial correlograms with associated Q-statistics (the same as in EViews). Can I do it in R? Thanks
---------------------------------
[[alternate HTML version deleted]]
2015 Feb 19
0
Anyone using torque/pbs/munge?
CentOS 6.6
I've got two servers, server1 and hbs (honkin' big server). Both are
running munge, and torque... *separately*. My problem is that I've got
users who want to be able to submit from server1 to hbs. I see that munged
can be pointed to an alternate keyfile... but is there any way to tell
qsub what to use?
(And yes, I got on the torque users' list, and I'm trying
2015 Oct 16
0
Semi-OT: torque, pbs_mom, cpuset, loglevel
We're running the current version of torque. On our small supercomputer
(an SGI), no updates to torque since July, but just recently - someone may
be trying something new - /var/log/messages is on-and-off being spammed
with Oct 15 18:02:04 servername pbs_mom: LOG_INFO::create_job_cpuset,
creating cpuset for job 1971[656].york.cit.nih.gov: 1 cpus (12), 1 mems
(1)
and I mean thousands of lines.
2008 Sep 30
1
Broken pipe, x86_64 CentOS 5.2
Hi All,
I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to add there's
no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04.
The problem is that pbs_mom's child quits without giving any error logs.
[root at frodo9 torque-2.3.3]# strace -f pbs_mom
.
.
.
bind(6, {sa_family=AF_INET, sin_port=htons(15002),
sin_addr=inet_addr("0.0.0.0")}, 16) = 0
time(NULL)
2002 Jul 25
1
password authentication failing for winbind
I originally posted this issue with the heading "winbind: challenge/response
password authentication failed". I was using the redhat 7.3 samba 2.2.3 rpm
then. I've upgraded to 2.2.5, but all that's changed is the return message.
wbinfo -a VENUS0+tassadar%torque
used to get me:
plaintext password authentication succeeded
challenge/response password authentication failed
2009 Jul 27
2
Simple resource manager?
I need to serialize computing job requests for two different multicore
machines, and in some near future, for a cluster. I have worked with
SGE but it requires NFS and other administrative steps, plus it seems
a bit overkill for my needs. I guess some simpler queue managing
engine may have been developed, possibly over SSH. Any pointers? TIA.
--
Eduardo Grosclaude
Universidad Nacional del
2009 Mar 02
1
xyplot color question
Hi,
I am plotting scatterplots of horsepower by torque, conditional on brand
(I'm just making up the variables for this example), and the goal is to see
both the scatterplot points as well as the smoothed line. When I do the
following code, I get the same color for the points and line, and would like
the colors to be different, such as black points and a red smoothed line.
How do I do that?
2010 Apr 07
6
Consecutive Jobs
Anyone know how to submit jobs to at or anything else that allows jobs
submitted to a queue to be executed consecutively?
I have a series of servers that submits a job via an ssh background
job but I can only have one execute at any given time.
Possibly some clever bash work?
Thanks!
jlc
2011 Nov 17
1
set random numbers seed for different cpu's
Hi
I'm running the same R script (throuth linux shell) of several cpu's. This
R program uses random numbers and the result should be different every time.
But if put jobs (through Torque) for several cpu's I get the same result. As
a resealt my program saves numbers in file with randomly generated names.
works like a charm on one cpu, but I get the same result from different