Displaying 20 results from an estimated 8000 matches similar to: "Anyone using torque/pbs/munge?"
2017 Jul 12
1
submitting R scripts with command_line_arguments to PBS HPC clusters
Hi,
The problem is most likely, you need to call a R CMD BATCH with your arguments and the R-script inside of a shell script that you submit to your qsub.
Unfortunately we don't use qsub anymore so can't test it, but it should be as follows:
R-script eg. test.R:
> ##First read in the arguments listed at the command line
> args=(commandArgs(TRUE))
>
> ##args is now a list of
2015 May 27
0
serious problem with torque
Mark, You might really want to compile torque from source (into an RPM
if you'd like) and redistribute that. Every version is a little wonky
and those of us that use(d) it often will poke around until we find a
version / patch-set that makes us happy and stick with that for a bit.
It's not an exact science and newer / higher versions are not always better.
As for the downgrade comment:
2015 May 27
0
serious problem with torque
On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
> Hi, folks,
>
> The other admin updated torque without testing it on one machine, and
> we had Issues. The first I knew was when a user reported qstat
> returning
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> socket_connect_unix failed: 15137
> qstat: cannot connect to server (null)
2015 May 27
0
serious problem with torque
On Wed, May 27, 2015 9:46 am, m.roth at 5-cent.us wrote:
> Johnny Hughes wrote:
>> On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
>>> Hi, folks,
>>>
>>> The other admin updated torque without testing it on one machine,
>>> and
>>> we had Issues. The first I knew was when a user reported qstat
>>> returning
>>>
2015 May 27
1
serious problem with torque
On Wed, May 27, 2015 10:55 am, Zachary Giles wrote:
> Mark, You might really want to compile torque from source (into an RPM
> if you'd like) and redistribute that. Every version is a little wonky
> and those of us that use(d) it often will poke around until we find a
> version / patch-set that makes us happy and stick with that for a bit.
> It's not an exact science and
2015 May 27
2
serious problem with torque
Johnny Hughes wrote:
> On 05/27/2015 09:07 AM, m.roth at 5-cent.us wrote:
>> Hi, folks,
>>
>> The other admin updated torque without testing it on one machine, and
>> we had Issues. The first I knew was when a user reported qstat
>> returning
>> socket_connect_unix failed: 15137
>> socket_connect_unix failed: 15137
>> socket_connect_unix
2008 Apr 26
1
Xen and Torque
Dear Xen users.
Have anyone tried to integrate Xen with Torque resource management system?
Could you please help me with an advice for a system I''m developing that
relies on torque.
Let me describe the system first.
The part of the system that talks with torque should request a certain
amount on nodes of a cluster and launch there a virtual machine instance
(one vm instance per host).
2017 Jul 12
0
submitting R scripts with command_line_arguments to PBS HPC clusters
This sounds like an operating system specific question, in that "submit the R script to a PBS HPC scheduler" would be the kind of action that would run R with very different environment variables and possibly different access credentials than your usual interactive terminal. A thorough reading of the "Installation and Administration Guide" and some study of your HPC
2015 May 27
5
serious problem with torque
Hi, folks,
The other admin updated torque without testing it on one machine, and
we had Issues. The first I knew was when a user reported qstat
returning
socket_connect_unix failed: 15137
socket_connect_unix failed: 15137
socket_connect_unix failed: 15137
qstat: cannot connect to server (null) (errno=15137) could not connect to
trqauthd
Attempting to restart the pbs_server did the same.
2017 Jul 12
2
submitting R scripts with command_line_arguments to PBS HPC clusters
Dear all,
please could you advise me on the following : I've written a R script that
reads 3 arguments from the command line, i.e. :
" args <- commandArgs(TRUE)
TUMOR <- args[1]
GERMLINE <- args[2]
CHR <- args[3] ".
when I submit the R script to a PBS HPC scheduler, I do the following
(below), but ... I am getting an error message.
(I am not posting the error message,
2017 Jun 20
2
CentOS 6 and luksOpen
Upgraded a RAID. Copied everything from backup.
And then my manager said I had to encrypt the drive.
I've done that, and made the filesystem, but I can't mount it.
CentOS 6.
I have the entry in /etc/crypttab, and a key in /etc/crypt.pw, and the
luks UUID in /etc/fstab. I cannot find the command that tells it to create
the device in /dev/mapper from the info in /etc/crypttab.
Clues for
2014 Mar 25
3
NVidia, again
Got a HBS (y'know, Honkin' Big Server, one o' them technical terms), a
Dell 720 with two Tesla GPUs. I updated the o/s, 6.5, and I cannot get the
GPUs recognized. As a last resort, I d/l NVidia's proprietary
driver/installer, 325, and it builds fine... I've yum removed the
kmod-nvidia I had on the system, nouveau is blacklisted, and when I
reboot, lsmod shows me nvidia loaded,
2020 Apr 17
4
HPC question: torques replacement
Dear Experts,
I know there are many HPC (high performance computing) experts on this
list. I'd like to ask your advise.
Almost two decades ago I chose to go with OpenPBS (turned down condor
and other alternatives for whatever reason) for clusters and number
crunchers I support for the Department at the university. It turned out
to be not bad, long lived choice. At some point I smoothly
2020 Apr 17
0
HPC question: torques replacement
Hey Valeri -
IIRC, midway (and maybe midway2?) use slurm for job scheduling. I don't know how many of your faculty use both your nodes and midway, but maybe consolidating on to a single scheduler would be easier for them?
(also, it's been a while ... hi! ? )
Richard
-----Original Message-----
From: CentOS <centos-bounces at centos.org> On Behalf Of Valeri Galtsev
Sent: Friday,
2008 May 10
1
DO NOT REPLY [Bug 5457] New: Add a client-side --munge-symlinks option
https://bugzilla.samba.org/show_bug.cgi?id=5457
Summary: Add a client-side --munge-symlinks option
Product: rsync
Version: 3.0.3
Platform: Other
OS/Version: Linux
Status: NEW
Severity: enhancement
Priority: P3
Component: core
AssignedTo: wayned@samba.org
ReportedBy: matt@mattmccutchen.net
2017 Jun 20
2
CentOS 6 and luksOpen
Leon Fauster wrote:
>> Am 20.06.2017 um 16:53 schrieb m.roth at 5-cent.us:
>>
>> Upgraded a RAID. Copied everything from backup.
>>
>> And then my manager said I had to encrypt the drive.
>>
>> I've done that, and made the filesystem, but I can't mount it.
>>
>> CentOS 6.
>> I have the entry in /etc/crypttab, and a key in
2017 Jun 20
0
CentOS 6 and luksOpen
> Am 20.06.2017 um 16:53 schrieb m.roth at 5-cent.us:
>
> Upgraded a RAID. Copied everything from backup.
>
> And then my manager said I had to encrypt the drive.
>
> I've done that, and made the filesystem, but I can't mount it.
>
> CentOS 6.
> I have the entry in /etc/crypttab, and a key in /etc/crypt.pw, and the
> luks UUID in /etc/fstab. I cannot
2017 Jun 20
0
CentOS 6 and luksOpen
> Am 20.06.2017 um 17:12 schrieb m.roth at 5-cent.us:
>
> Leon Fauster wrote:
>>> Am 20.06.2017 um 16:53 schrieb m.roth at 5-cent.us:
>>>
>>> Upgraded a RAID. Copied everything from backup.
>>>
>>> And then my manager said I had to encrypt the drive.
>>>
>>> I've done that, and made the filesystem, but I can't
2015 Oct 16
0
Semi-OT: torque, pbs_mom, cpuset, loglevel
We're running the current version of torque. On our small supercomputer
(an SGI), no updates to torque since July, but just recently - someone may
be trying something new - /var/log/messages is on-and-off being spammed
with Oct 15 18:02:04 servername pbs_mom: LOG_INFO::create_job_cpuset,
creating cpuset for job 1971[656].york.cit.nih.gov: 1 cpus (12), 1 mems
(1)
and I mean thousands of lines.
2008 Oct 22
1
torque/psb & snow library
Hello all;
I'm trying to execute parallel jobs trough library snow on a cluster built
through torque/PSB. I'm succesfully obtaining the cluster with:
>system("cat $PBS_NODEFILE > cluster.txt")
>mycluster <- scan(file="cluster.txt",what="character")
>cl <- makeSOCKcluster(mycluster)
The only problem, at the moment, is that if I use