Displaying 5 results from an estimated 5 matches for "pbs_mom".
2015 Oct 16
0
Semi-OT: torque, pbs_mom, cpuset, loglevel
We're running the current version of torque. On our small supercomputer
(an SGI), no updates to torque since July, but just recently - someone may
be trying something new - /var/log/messages is on-and-off being spammed
with Oct 15 18:02:04 servername pbs_mom: LOG_INFO::create_job_cpuset,
creating cpuset for job 1971[656].york.cit.nih.gov: 1 cpus (12), 1 mems
(1)
and I mean thousands of lines. I tried to adjust the loglevel of pbs_mom,
but it appeared to make *no* change, and their "documentation" and
"manpage" simply does not descri...
2008 Sep 30
1
Broken pipe, x86_64 CentOS 5.2
Hi All,
I have a problem with torque (openPBS) on x86_64 CentOS 5.2. Just to add there's
no problem on a 32bit CentOS 5.2 or 64bit Ubuntu 8.04.
The problem is that pbs_mom's child quits without giving any error logs.
[root at frodo9 torque-2.3.3]# strace -f pbs_mom
.
.
.
bind(6, {sa_family=AF_INET, sin_port=htons(15002),
sin_addr=inet_addr("0.0.0.0")}, 16) = 0
time(NULL) = 1222785330
listen(6, 512)...
2008 Jul 07
1
SIGPIPE in assorted apps after "yum update"
...fter it attempts to fork into the
background. I worked around that problem by building a new server with
no additional repos, only CentOS and dhcpd works fine on that system.
Since then I have found the problem, or similar problems with a few
more applications. Here is what the tail of an strace of pbs_mom as it
attempts to fork into the background:
listen(5, 512) = 0
socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 6
setsockopt(6, SOL_SOCKET, SO_REUSEADDR, [1], 4) = 0
bind(6, {sa_family=AF_INET, sin_port=htons(15003),
sin_addr=inet_addr("0.0.0.0")}, 16) = 0
listen(6, 51...
2012 Jun 26
0
abrtd problems
...ogram's running. Sometimes it won't do
it for hours, other times it's literally every 10 min. I've run iostat,
netstat, have top running, tail -f /var/log/dmesg, *nada*. Nothing out of
the ordinary.
One thing that's constant: as the system's coming back up, we see a segv
of pbs_mom (we're using torque for clustering), and every time it saves
the core dump, then a second or so later,
Jun 26 14:29:58 <servername> abrtd: Package 'torque-mom' isn't signed with
proper key
Jun 26 14:29:58 <servername> abrtd: Corrupted or bad dump
/var/spool/abrt/ccpp-201...
2009 Mar 23
1
question about top output
...84m 1.6g 4188 R 23 10.5 109:07.52 xhpl
8571 jgreen 25 0 2067m 1.6g 4188 R 23 10.4 109:08.51 xhpl
8569 jgreen 25 0 2072m 1.6g 4196 R 22 10.4 109:07.77 xhpl
8573 jgreen 25 0 2062m 1.6g 4204 R 22 10.4 109:08.23 xhpl
4457 root 15 0 12056 1424 992 S 0 0.0 8:01.62 pbs_mom
1 root 15 0 10316 792 660 S 0 0.0 0:02.74 init
2 root RT -5 0 0 0 S 0 0.0 0:00.01 migration/0
3 root 34 19 0 0 0 S 0 0.0 0:00.01 ksoftirqd/0
4 root RT -5 0 0 0 S 0 0.0 0:00.0...