Displaying 20 results from an estimated 5000 matches similar to: "upgrade from 1.6.2 -> 1.6.3 questions?"
2008 Jan 15
19
How do you make an MGS/OSS listen on 2 NICs?
I am running on CentOS 5 distribution without adding any updates from CentOS. I am using the lustre 1.6.4.1 kernel and software.
I have two NICs that run though different switches.
I have the lustre options in my modprobe.conf to look like this:
options lnet networks=tcp0(eth1,eth0)
My MGS seems to be only listening on the first interface however.
When I try and ping the 1st interface (eth1)
2007 Oct 23
5
SLES kernel update
Hello CFS,
I was wondering if the new SLES10 kernel will be part of the 1.6.4
release. It appears to be tracked by:
https://bugzilla.lustre.org/show_bug.cgi?id=13874
As this contains a major security fix (privilege escalation), it is a
mandatory upgrade in many places and it would be nice to not have to
roll our own kernel.
Thanks,
Kit
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2007 Nov 26
15
bad 1.6.3 striped write performance
Hi,
I''m seeing what can only be described as dismal striped write
performance from lustre 1.6.3 clients :-/
1.6.2 and 1.6.1 clients are fine. 1.6.4rc3 clients (from cvs a couple
of days ago) are also terrible.
the below shows that the OS (centos4.5/5) or fabric (gigE/IB) or lustre
version on the servers doesn''t matter - the problem is with the 1.6.3
and 1.6.4rc3 client kernels
2007 Nov 16
5
Lustre Debug level
Hi,
Lustre manual 1.6 v18 says that that in production lustre debug level
should be set to fairly low. Manual also says that I can verify that
level by running following commands:
# sysctl portals.debug
This gives ne following error
error: ''portals.debug'' is an unknown key
cat /proc/sys/lnet/debug
gives output:
ioctl neterror warning error emerg ha config console
cat
2007 Oct 25
1
Error message
I''m seeing this error message on one of my OSS''s but not the other
three. Any idea what is causing it?
Oct 25 13:58:56 oss2 kernel: LustreError:
3228:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
req at f6b13200 x18040/t0 o101->MGS at MGC192.168.0.200@tcp_0:26 lens 176/184
ref 1 fl Rpc:/0/0 rc 0/0
Oct 25 13:58:56 oss2 kernel: LustreError:
2007 Oct 12
1
Update - ksym errors when installing modules in SLES10
I''ve just set up a small lustre installation on mixed SLES10 32-bit and
SLES10 SP1 64-bit. 32-bit would give ksym errors upon installation of
modules rpm, while 64-bit SP1 it was only warnings. I did a little
research on the ksym package and it seemed to me it was only needed if
you were trying to install a module compiled for a different kernel
version. Since CFS is providing a
2007 Nov 29
2
Balancing I/O Load
We are seeing some disturbing (probably due to our ignorance)
behavior from lustre 1.6.3 right now. We have 8 OSSs with 3 OSTs
per OSS (24 physical LUNs). We just created a brand new lustre file
system across this configuration using the default mkfs.lustre
formatting options. We have this file system mounted across 400
clients.
At the moment, we have 63 IOzone threads running
2007 Dec 13
1
MPI-Blast + Lustre
Anyone have any experience with MpiBlast and Lustre. We have
MpiBlast-1.4.0-pio and lustre-1.6.3 and we are seeing some pretty
poor performance with most of the mpiblast threads spending 20% to
50% of their time in disk wait. We have the genbank nt database
split into 24 fragments (one for each of our OSTs, 3 per OSS). The
individual fragments are not striped due to the
2007 Aug 31
2
sles10: installing module-binaries: "error: Failed dependencies: ksym(panic) .."
Hi, After some hard trying to use lustre on our debian servers with an up2date kernel I decided to buy sles10 to make everything more comftable: I installed (rpm -i) the new kernelbinaries from 1.6.2/sles10-i686 to a sles10 test box (AMD64). When i try to install the modules form 1.6.2/sles10-i686 it says : sles2:/usr/src/lustre # rpm -i
2007 Nov 12
8
More failover issues
In 1.6.0, when creating a MDT, you could specify multiple --mgsnode options
and it would failover between them. 1.6.3 only seems to take the last one
and --mgsnode=192.168.1.252 at o2ib:192.168.1.253 at o2ib doesn''t seem to failover
to the other node. Any ideas how to get around this?
Robert
Robert LeBlanc
College of Life Sciences Computer Support
Brigham Young University
leblanc at
2007 Nov 06
4
Checksum Algorithm
Hi,
We have seen a huge performance drop in 1.6.3, due to the checksum being enabled by default. I looked at the algorithm being used, and it is actually a CRC32, which is a very strong algorithm for detecting all sorts of problems, such as single bit errors, swapped bytes, and missing bytes.
I''ve been experimenting with using a simple XOR algorithm. I''ve been able to recover
2007 Nov 19
6
Dedicated MGS?
This may be in the documentation. If so, I missed it.
If a site has multiple Lustre file systems, the documentation implies
that there only needs to be a single MGS for an entire site
(regardless of the number of file systems). However, I also know
it is fairly common to have a combined MGS/MDT. So here are the
questions.
1. If we are going to have several Lustre file systems,
2004 Jan 11
3
Lustre 1.0.2 packages available
Greetings--
Packages for Lustre 1.0.2 are now available in the usual place
http://www.clusterfs.com/download.html
This bug-fix release resolves a number of issues, of which a few are
user-visible:
- the default debug level is now a more reasonable production value
- zero-copy TCP is now enabled by default, if your hardware supports it
- you should encounter fewer allocation failures
2004 Jan 11
3
Lustre 1.0.2 packages available
Greetings--
Packages for Lustre 1.0.2 are now available in the usual place
http://www.clusterfs.com/download.html
This bug-fix release resolves a number of issues, of which a few are
user-visible:
- the default debug level is now a more reasonable production value
- zero-copy TCP is now enabled by default, if your hardware supports it
- you should encounter fewer allocation failures
2007 Mar 20
15
How to bypass failed OST without blocking?
Hi
I want my lustre do such things during OST failed: if some file
has stripe data on th failed OST, any operation on the file will
return IO error without blocking, and also at this moment I can
create and read/write new file or read/write files which have no stripe
data on the failed OST without blocking.
What should I do ? How to configure?
thanks!
swin
-------------- next part
2008 Feb 14
2
kickstart file problem
I have a kickstart file that I am using to install multiple machines. If I install with no %post script, everything runs great. When I add the following %post section, if fails.
I have been working on this for a few days now without luck, Any help would be appreciated.
Here is the error, the script follows.
Traceback (most recent call first):
File
2008 Jan 02
9
lustre quota problems
Hello,
I''ve several problems with quota on our testcluster:
When I set the quota for a person to a given value (e.g. the values which
are provided in the operations manual), I''m able to write exact the amount
which is set with setquota.
But when I delete the files(file) I''m not able to use this space again.
Here is what I''ve done in detail:
lfs checkquota