Displaying 20 results from an estimated 400 matches similar to: "OSTs inactive on one client (only)"
2008 Mar 06
2
strange lustre errors
Hi,
On a few of the hpc cluster nodes, i am seeing a new lustre
error that is pasted below. The volumes are working fine and there
is nothing on the oss and mds to report.
LustreError: 5080:0:(import.c:607:ptlrpc_connect_interpret())
data3-OST0000_UUID at 192.168.2.98@tcp changed handle from
0xfe51139158c64fae to 0xfe511392a35878b3; copying, but this may
foreshadow disaster
2008 Mar 07
2
Multihomed question: want Lustre over IB andEthernet
Chris,
Perhaps you need to perform some write_conf like command. I''m not sure if this is needed in 1.6 or not.
Shane
----- Original Message -----
From: lustre-discuss-bounces at lists.lustre.org <lustre-discuss-bounces at lists.lustre.org>
To: lustre-discuss <lustre-discuss at lists.lustre.org>
Sent: Fri Mar 07 12:03:17 2008
Subject: Re: [Lustre-discuss] Multihomed
2007 Nov 07
1
ll_cfg_requeue process timeouts
Hi,
Our environment is: 2.6.9-55.0.9.EL_lustre.1.6.3smp
I am getting following errors from two OSS''s
...
Nov 7 10:39:51 storage09.beowulf.cluster kernel: LustreError:
23045:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at 00000100b410be00 x4190687/t0 o101->MGS at MGC10.143.245.201@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 7 10:39:51
2008 Jan 10
4
1.6.4.1 - active client evicted
Hi!
We''ve started to poke and prod at Lustre 1.6.4.1, and it seems to
mostly work (we haven''t had it OOPS on us yet like the earlier
1.6-versions did).
However, we had this weird incident where an active client (it was
copying 4GB files and running ls at the time) got evicted by the MDS
and all OST''s. After a while logs indicate that it did recover the
connection
2007 Oct 25
1
Error message
I''m seeing this error message on one of my OSS''s but not the other
three. Any idea what is causing it?
Oct 25 13:58:56 oss2 kernel: LustreError:
3228:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at f6b13200 x18040/t0 o101->MGS at MGC192.168.0.200@tcp_0:26 lens 176/184
ref 1 fl Rpc:/0/0 rc 0/0
Oct 25 13:58:56 oss2 kernel: LustreError:
2008 Mar 03
1
Quota setup fails because of OST ordering
Hi all,
after installing a Lustre test file system consisting of 34 OSTs, I
encountered a strange error when trying to set up quotas:
lfs quotacheck gave me an "Input/Output error", while in
/var/log/kern.log I found a Lustre error
LustreError: 20807:0:(quota_check.c:227:lov_quota_check()) lov idx 32
inactive
Indeed, in /proc/fs/lustre/lov/.../target_obd all 34 OSTs were listed
2010 Sep 16
2
Lustre module not getting loaded in MDS
Hello All,
I have installed and configured Lustre 1.8.4 on SuSe 11.0 and everything
works fine if i run modprobe lustre and when the lustre module is getting
loaded. But when the server reboots it is not getting loaded. Kindly help.
Lnet is configured in /etc/modprobe.conf.local as below.
options lnet networks=tcp0(eth0) accept=all
For loading lustre module i tried including lustre module in
2011 Aug 26
1
Add lines to xy plot with temporal x axis
Hi. I'm trying to add some vertical lines to an xyplot in which x axis is a
temporal variable (class=dates) and y axis a factor variable.
This is my code:
xyplot(factor(Abacus$Emisor)~Abacus$Dia,xlab="Date", ylab="Fish",pch=124,
scales=list(x=list(format="%b %Y",tick.number=30))
, panel=function(...){
panel.xyplot(...)
panel.abline(v=08/08/2008)
})
I'd
2010 Jun 21
3
features.conf - parkedcalls - transfer
Hello dear list.
I am having issues on parkedcalls.
I am using a Cisco SPA525G as a test phone, and I have the transfer button there when I am in a call,
But when I want to transfer the current call I am in, I push the transfer button, and onscreen I se "Enter Number", and if I enter ex sip 200, I have to wait
Almost 10 seconds, before the transfer to sip 200 is made, can I reduce
2008 Jan 31
2
lustre+samba
Dear All,
I try to use our cluster though samba share. Everything work fine, but
I think, we should have -o flock at lustre mount time.
Great, it''s work. But when I want to save a file on the share, I get
this on the logs:
Jan 31 10:45:24 opteron-ren-11 kernel: LustreError: 24836:0:(file.c:2309:ll_file_flock()) unknown fcntl lock type: 32
Jan 31 10:45:24 opteron-ren-11 kernel:
2010 Jun 18
1
Error trying to add context: Context 'internal' tries to include nonexistent context 'nighttime|12:30-8:00|mon-fri|*|*'
Hello again dear list.
Could you please help with this?
Thank you for all support, you are great, and i am now at a late stage in the setup and tweaking this server,
So I hope you can help me again.
I Can't make include the context nighttime. Just to demonstrate if it works, I have a playback function there.
But CLI reports:
CLI
[Jun 18 14:20:22] WARNING[2287]: pbx.c:9542
2010 Jun 14
1
Call queues - issues, can't make it work.
Hello there
I have been struggling with queues, because i think this is the right module for our business.
My main goal, is when we receive external calls, the receptionist should be able to transfer the call to us
Technicians, and I am trying to add 2 extensions to a queue name [teknisk]
Extension 301 and 302.
I have a test setup now which I thought should look like this:
When a external call
1997 Dec 05
3
New Program: Abacus Sentry - Port Scan Detector
Hello,
I just made available a beta version of a port scan detector that I''ve
been working on. The program, called Abacus Sentry, is a port scan/probe
detector that offers what I think are a number of unique and useful
features:
- Runs on TCP or UDP sockets. Configurable by the user to bind to
multiples of sockets for increased detection coverage.
- Adjustable scan detection value with
2002 Sep 18
2
Abacus DOS problem Lock Selector
A lawyer in California is trying to change to linux. In CA, there are a
number of programs required by the courts, and so far none are ported to
linux at any price. His reason for wishing to change has to do with the
fact that Microsoft now controls his access to his own files, and he finds
this unacceptable.
I tried one of his apps, an old DOS version of Abacus. Using the WINe with
Redhat
2008 May 30
2
inconsistent output when using variable substitution
I am extremely puzzled by this behavior in R. I have a data frame called
Trials in which I have results from an experiment. I am trying to do a
subjects analysis, but getting weird results. Each row has 1 trial in it,
which includes a column for the subject number I get the list of subject
numbers like so:
> Subj=unique(sort(Trials$Subj))
Then I loop over them. But I get strange results. As
2002 Jul 11
2
Samba + Abacus Law anyone?
I'm attempting to install Abacus Law (v13) under Samba and Linux. Install
completes, and I get as far as a single user successfully logged in, but
the 2nd user is nothing but trouble. Samba logs show nothing unusual to my
semi-trained eye, but locking code or timing is a likely suspect. Technical
support is of no help as they haven't tested (and therefore don't support)
the
2008 Feb 22
0
lustre error
Dear All,
Yesterday evening or cluster has stopped.
Two of our nodes tried to take the resource from each other, they
haven''t seen the other side, if I saw well.
I stopped heartbeat, resources, start it again, and back to online,
worked fine.
This morning I saw this in logs:
Feb 22 03:25:07 node4 kernel: Lustre:
7:0:(linux-debug.c:98:libcfs_run_upcall()) Invoked LNET upcall
2007 Sep 28
0
llog_origin_handle_cancel and other LustreErrors
Hi again!
Same setup as before (Lustre 1.6.2 + 2.6.18 kernel).
This time things suddenly started to be very slow (as in periodically
stalling), and we found a bunch of llog_ LustreErrors on the MDS. Some
time later stuff had automagically recovered and is back to normal
speed.
Any idea on the meaning/cause of these errors?
What are the seriousness of "LustreError" errors in
2008 Apr 15
5
o2ib module prevents shutdown
Hello,
Not sure if this is the right forum: I''m encountering difficulties
with o2ib which prevents an LNET shutdown from proceeding:
Unloading OpenIB kernel modules:NET: Unregistered protocal family 27
Failed to unload rdma_cm
Failed to unload rdma_cm
Failed to unload ib_cm
Failed to unload ib_sa
LustreError: 131-3: Received notification of device removal
Please shutdown LNET
2010 Aug 11
0
OSS: IMP_CLOSED errors
Hello.
OS CentOS 5.4
uname -a
Linux oss0 2.6.18-128.7.1.el5_lustre.1.8.1.1 #1 SMP Tue Oct 6 05:48:57 MDT 2009 x86_64 x86_64 x86_64 GNU/Linux
Lustre 1.8.1.1
OSS server.
A lot of errors in /var/log/messages:
Aug 10 14:46:34 oss0 kernel: LustreError: 2802:0:(client.c:775:ptlrpc_import_delay_req()) Skipped 1 previous similar message
Aug 10 15:07:01 oss0 kernel: LustreError: