Displaying 20 results from an estimated 700 matches similar to: "SLES 11 SP1 Client rpms built but not working"
2008 Dec 24
6
Bug when using /dev/cciss/c0d2 as mdt/ost
I am trying to build lustre-1.6.6 against the pre-patched kernel downloaded
from SUN.
But as written in Operations manual, it creates rpms for
2.6.18-92.1.10.el5_lustrecustom. Is there a way to ask it not to append
custom as extraversion.
Running kernel is 2.6.18-92.1.10.el5_lustre.1.6.6smp.
--
Regards--
Rishi Pathak
National PARAM Supercomputing Facility
Center for Development of Advanced
2010 Jun 22
7
lnet infiniband config
Hi all,
I''m getting my feet wet in the infiniband lake and of course I run into
some problems.
It would seem I got the compilation part of sles11 kernel 2.6.27 +
Lustre 1.8.3 + ofed 1.4.2 right, because it allows me to see and use the
infiniband fabric, and because ko2iblnd loads without any complaints.
In /etc/modprobe.d/lustre (this is a Debian system, hence this subdir of
2007 Nov 16
5
Lustre Debug level
Hi,
Lustre manual 1.6 v18 says that that in production lustre debug level
should be set to fairly low. Manual also says that I can verify that
level by running following commands:
# sysctl portals.debug
This gives ne following error
error: ''portals.debug'' is an unknown key
cat /proc/sys/lnet/debug
gives output:
ioctl neterror warning error emerg ha config console
cat
2008 Jan 15
19
How do you make an MGS/OSS listen on 2 NICs?
I am running on CentOS 5 distribution without adding any updates from CentOS. I am using the lustre 1.6.4.1 kernel and software.
I have two NICs that run though different switches.
I have the lustre options in my modprobe.conf to look like this:
options lnet networks=tcp0(eth1,eth0)
My MGS seems to be only listening on the first interface however.
When I try and ping the 1st interface (eth1)
2013 Dec 17
2
Setting up a lustre zfs dual mgs/mdt over tcp - help requested
Hi all,
Here is the situation:
I have 2 nodes MDS1 , MDS2 (10.0.0.22 , 10.0.0.23) I wish to use as
failover MGS, active/active MDT with zfs.
I have a jbod shelf with 12 disks, seen by both nodes as das (the
shelf has 2 sas ports, connected to a sas hba on each node), and I
am using lustre 2.4 on centos 6.4 x64
I have created 3 zfs pools:
1. mgs:
# zpool
2008 Apr 15
5
o2ib module prevents shutdown
Hello,
Not sure if this is the right forum: I''m encountering difficulties
with o2ib which prevents an LNET shutdown from proceeding:
Unloading OpenIB kernel modules:NET: Unregistered protocal family 27
Failed to unload rdma_cm
Failed to unload rdma_cm
Failed to unload ib_cm
Failed to unload ib_sa
LustreError: 131-3: Received notification of device removal
Please shutdown LNET
2012 Nov 02
3
lctl ping of Pacemaker IP
Greetings!
I am working with Lustre-2.1.2 on RHEL 6.2. First I configured it
using the standard defaults over TCP/IP. Everything worked very
nicely usnig a real, static --mgsnode=a.b.c.x value which was the
actual IP of the MGS/MDS system1 node.
I am now trying to integrate it with Pacemaker-1.1.7. I believe I
have most of the set-up completed with a particular exception. The
"lctl
2013 Apr 16
2
UID/GID access control in Lustre
Hello list members,
I started to develop a kernel module which hooks into Lustre 2.3 for
controlling data access based on nid and uid/gid. The background
is the following: Here at GSI we have currently a reserved uid/gid space
which partner institutes are using to access our exported Lustre mounts.
However, we currently have no mechanism to control (guaranty) that the
reserved uid/gid space are
2014 Mar 10
1
gfs2 and quotas - system crash
I have tried sending this before, but it did not appear to get through.
Hello,
When using gfs2 with quotas on a SAN that is providing storage to two
clustered systems running CentOS6.5, one of the systems
can crash. This crash appears to be caused when a user tries
to add something to a SAN disk when they have exceeded their
quota on that disk. Sometimes a stack trace is produced in
2008 Mar 07
2
Multihomed question: want Lustre over IB andEthernet
Chris,
Perhaps you need to perform some write_conf like command. I''m not sure if this is needed in 1.6 or not.
Shane
----- Original Message -----
From: lustre-discuss-bounces at lists.lustre.org <lustre-discuss-bounces at lists.lustre.org>
To: lustre-discuss <lustre-discuss at lists.lustre.org>
Sent: Fri Mar 07 12:03:17 2008
Subject: Re: [Lustre-discuss] Multihomed
2010 Sep 22
5
Meaning of LND/neterrors ?
Hello
I''ve noticed that Lustre network error, especially LND errors, are considered as maskable errors.
That means that on a production node, where debug mask is 0, those specific errors won''t be displayed if they happened.
Does that mean that they are harmless?
Do upper-layers resend their RPC/packet if LNDs report an error?
When, in my case, o2iblnd says something like
2010 Sep 22
5
Meaning of LND/neterrors ?
Hello
I''ve noticed that Lustre network error, especially LND errors, are considered as maskable errors.
That means that on a production node, where debug mask is 0, those specific errors won''t be displayed if they happened.
Does that mean that they are harmless?
Do upper-layers resend their RPC/packet if LNDs report an error?
When, in my case, o2iblnd says something like
2010 Sep 16
2
Lustre module not getting loaded in MDS
Hello All,
I have installed and configured Lustre 1.8.4 on SuSe 11.0 and everything
works fine if i run modprobe lustre and when the lustre module is getting
loaded. But when the server reboots it is not getting loaded. Kindly help.
Lnet is configured in /etc/modprobe.conf.local as below.
options lnet networks=tcp0(eth0) accept=all
For loading lustre module i tried including lustre module in
2007 Oct 15
3
iptables rules for lustre 1.6.x and MGS recovery procedures
Hi,
I would like to know what TCP/UDP ports should i keep open in my
firewall policies on my MGS server such that I can have my MGS server
fire-walled. Also if in a event of loss of MGT would it be possible
to recreate the MGT without loosing data or bringing the filesystem
down (i.e. by using cached information from MDT''s and OST''s)
Thanks
Anand
2012 Dec 28
6
problem with installing lustre and ofed
Hello,
I am having trouble installing the server modules for lustre 2.1.4 and use mellanox''s OFED distribution so we may use infiniband. Would you folks look at my procedure and results below and let me know what you think? Thanks very much!
The mellanox ofed installation builds and installs some kernel modules too, so I used this method to ensure OFED compiled against the correct
2010 Aug 11
3
Version mismatch of Lustre client and server
Hello,
I am planning on deploying a few more clients in my lustre environment and
was wondering which client version to install. I know it is okay to run a
newer client version than your lustre server for upgrade purposes. However,
would it be okay to be in this state for a longer period of time (for the
life of this filesystem)? My lustre server is currently running 1.8.1.1 on
RHEL 5.3 and I
2006 Sep 25
4
Re: [openib-general] problems with lustre o2ib module & ofed
It seems that lustre puts its modules in /lib/modules/2.6.16.21-0.8-default
despite the fact that my kernel is 2.6.16.21-0.8-smp !
uname -a
Linux n32 2.6.16.21-0.8-smp #4 SMP Sun Sep 24 08:47:30 BST 2006 i686 i686 i386 GNU/Linux
make[3]: Nothing to be done for `install-exec-am''.
/bin/sh ../../mkinstalldirs /lib/modules/2.6.16.21-0.8-default/kernel/fs/lustre
/usr/bin/install -c -m 644
2008 Feb 07
2
Lustre behaviour when multiple network paths are available?
Hi there,
When Lustre is configured in an environment where there are multiple paths
to the same destination of the same length (i.e. two paths, each one hop
away), which path(s) will be used for sending and receiving data?
I have my cluster configured with two OSTs with two GigE NICs in each. I am
seeing identical performance metrics when I use LACP to aggregate, and when
I use two separate
2012 Oct 19
6
Large Corosync/Pacemaker clusters
Hi,
We''re setting up fairly large Lustre 2.1.2 filesystems, each with 18
nodes and 159 resources all in one Corosync/Pacemaker cluster as
suggested by our vendor. We''re getting mixed messages on how large of a
Corosync/Pacemaker cluster will work well between our vendor an others.
1. Are there Lustre Corosync/Pacemaker clusters out there of this
size or larger?
2.
2008 Feb 04
32
Luster clients getting evicted
on our cluster that has been running lustre for about 1 month. I have
1 MDT/MGS and 1 OSS with 2 OST''s.
Our cluster uses all Gige and has about 608 nodes 1854 cores.
We have allot of jobs that die, and/or go into high IO wait, strace
shows processes stuck in fstat().
The big problem is (i think) I would like some feedback on it that of
these 608 nodes 209 of them have in dmesg