search for: pbs_mom

Displaying 5 results from an estimated 5 matches for "pbs_mom".

2015 Oct 16
0
Semi-OT: torque, pbs_mom, cpuset, loglevel
We're running the current version of torque. On our small supercomputer (an SGI), no updates to torque since July, but just recently - someone may be trying something new - /var/log/messages is on-and-off being spammed with Oct 15 18:02:04 servername pbs_mom: LOG_INFO::create_job_cpuset, creating cpuset for job 1971[656].york.cit.nih.gov: 1 cpus (12), 1 mems (1) and I mean thousands of lines. I tried to adjust the loglevel of pbs_mom, but it appeared to make *no* change, and their "documentation" and "manpage" simply does not descri...
2008 Sep 30
1
Broken pipe, x86_64 CentOS 5.2
Hi All, I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to add there's no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04. The problem is that pbs_mom's child quits without giving any error logs. [root at frodo9 torque-2.3.3]# strace -f pbs_mom . . . bind(6, {sa_family=AF_INET, sin_port=htons(15002), sin_addr=inet_addr("0.0.0.0")}, 16) = 0 time(NULL) = 1222785330 listen(6, 512)...
2008 Jul 07
1
SIGPIPE in assorted apps after "yum update"
...fter it attempts to fork into the background. I worked around that problem by building a new server with no additional repos, only CentOS and dhcpd works fine on that system. Since then I have found the problem, or similar problems with a few more applications. Here is what the tail of an strace of pbs_mom as it attempts to fork into the background: listen(5, 512) = 0 socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 6 setsockopt(6, SOL_SOCKET, SO_REUSEADDR, [1], 4) = 0 bind(6, {sa_family=AF_INET, sin_port=htons(15003), sin_addr=inet_addr("0.0.0.0")}, 16) = 0 listen(6, 51...
2012 Jun 26
0
abrtd problems
...ogram's running. Sometimes it won't do it for hours, other times it's literally every 10 min. I've run iostat, netstat, have top running, tail -f /var/log/dmesg, *nada*. Nothing out of the ordinary. One thing that's constant: as the system's coming back up, we see a segv of pbs_mom (we're using torque for clustering), and every time it saves the core dump, then a second or so later, Jun 26 14:29:58 <servername> abrtd: Package 'torque-mom' isn't signed with proper key Jun 26 14:29:58 <servername> abrtd: Corrupted or bad dump /var/spool/abrt/ccpp-201...
2009 Mar 23
1
question about top output
...84m 1.6g 4188 R 23 10.5 109:07.52 xhpl 8571 jgreen 25 0 2067m 1.6g 4188 R 23 10.4 109:08.51 xhpl 8569 jgreen 25 0 2072m 1.6g 4196 R 22 10.4 109:07.77 xhpl 8573 jgreen 25 0 2062m 1.6g 4204 R 22 10.4 109:08.23 xhpl 4457 root 15 0 12056 1424 992 S 0 0.0 8:01.62 pbs_mom 1 root 15 0 10316 792 660 S 0 0.0 0:02.74 init 2 root RT -5 0 0 0 S 0 0.0 0:00.01 migration/0 3 root 34 19 0 0 0 S 0 0.0 0:00.01 ksoftirqd/0 4 root RT -5 0 0 0 S 0 0.0 0:00.0...