Displaying 20 results from an estimated 500 matches similar to: "lnet infiniband config"
2008 Mar 07
2
Multihomed question: want Lustre over IB andEthernet
Chris,
Perhaps you need to perform some write_conf like command. I''m not sure if this is needed in 1.6 or not.
Shane
----- Original Message -----
From: lustre-discuss-bounces at lists.lustre.org <lustre-discuss-bounces at lists.lustre.org>
To: lustre-discuss <lustre-discuss at lists.lustre.org>
Sent: Fri Mar 07 12:03:17 2008
Subject: Re: [Lustre-discuss] Multihomed
2006 Sep 25
4
Re: [openib-general] problems with lustre o2ib module & ofed
It seems that lustre puts its modules in /lib/modules/2.6.16.21-0.8-default
despite the fact that my kernel is 2.6.16.21-0.8-smp !
uname -a
Linux n32 2.6.16.21-0.8-smp #4 SMP Sun Sep 24 08:47:30 BST 2006 i686 i686 i386 GNU/Linux
make[3]: Nothing to be done for `install-exec-am''.
/bin/sh ../../mkinstalldirs /lib/modules/2.6.16.21-0.8-default/kernel/fs/lustre
/usr/bin/install -c -m 644
2008 Apr 15
5
o2ib module prevents shutdown
Hello,
Not sure if this is the right forum: I''m encountering difficulties
with o2ib which prevents an LNET shutdown from proceeding:
Unloading OpenIB kernel modules:NET: Unregistered protocal family 27
Failed to unload rdma_cm
Failed to unload rdma_cm
Failed to unload ib_cm
Failed to unload ib_sa
LustreError: 131-3: Received notification of device removal
Please shutdown LNET
2012 Nov 02
3
lctl ping of Pacemaker IP
Greetings!
I am working with Lustre-2.1.2 on RHEL 6.2. First I configured it
using the standard defaults over TCP/IP. Everything worked very
nicely usnig a real, static --mgsnode=a.b.c.x value which was the
actual IP of the MGS/MDS system1 node.
I am now trying to integrate it with Pacemaker-1.1.7. I believe I
have most of the set-up completed with a particular exception. The
"lctl
2007 Dec 21
0
FW: faking IB multi-rail with multihomed clients
Guys,
For those of you not party to the original email exchange, this is
about how we can aggregate bandwidth across both rails of a dual-rail
IB cluster using current lustre/LNET (i.e. before we have implemented
transparant LNET support for failover and bandwidth aggregation across
multiple networks).
The following 2 points are fundamental - everything below is a direct
consequence...
1. LNET
2008 Jan 15
19
How do you make an MGS/OSS listen on 2 NICs?
I am running on CentOS 5 distribution without adding any updates from CentOS. I am using the lustre 1.6.4.1 kernel and software.
I have two NICs that run though different switches.
I have the lustre options in my modprobe.conf to look like this:
options lnet networks=tcp0(eth1,eth0)
My MGS seems to be only listening on the first interface however.
When I try and ping the 1st interface (eth1)
2008 Mar 11
2
Problems mountine lustre thru an ib2ip gateway
Hello,
I am trying to mount a lustre filesystem thru an ib2ip gateway.
The MDS''s have infiniband connections. The client nodes are tcp/ip
connections. I am able to route between the client nodes and the MDS''s.
I have the following in /etc/fstab:
abe-mds1 at o2ib0,abe-mds2 at o2ib0:/home/client /abehome lustre
_netdev,flock 0 0
I get the following when trying
2008 Feb 04
32
Luster clients getting evicted
on our cluster that has been running lustre for about 1 month. I have
1 MDT/MGS and 1 OSS with 2 OST''s.
Our cluster uses all Gige and has about 608 nodes 1854 cores.
We have allot of jobs that die, and/or go into high IO wait, strace
shows processes stuck in fstat().
The big problem is (i think) I would like some feedback on it that of
these 608 nodes 209 of them have in dmesg
2013 Dec 17
2
Setting up a lustre zfs dual mgs/mdt over tcp - help requested
Hi all,
Here is the situation:
I have 2 nodes MDS1 , MDS2 (10.0.0.22 , 10.0.0.23) I wish to use as
failover MGS, active/active MDT with zfs.
I have a jbod shelf with 12 disks, seen by both nodes as das (the
shelf has 2 sas ports, connected to a sas hba on each node), and I
am using lustre 2.4 on centos 6.4 x64
I have created 3 zfs pools:
1. mgs:
# zpool
2008 Jan 02
9
lustre quota problems
Hello,
I''ve several problems with quota on our testcluster:
When I set the quota for a person to a given value (e.g. the values which
are provided in the operations manual), I''m able to write exact the amount
which is set with setquota.
But when I delete the files(file) I''m not able to use this space again.
Here is what I''ve done in detail:
lfs checkquota
2013 Apr 16
2
UID/GID access control in Lustre
Hello list members,
I started to develop a kernel module which hooks into Lustre 2.3 for
controlling data access based on nid and uid/gid. The background
is the following: Here at GSI we have currently a reserved uid/gid space
which partner institutes are using to access our exported Lustre mounts.
However, we currently have no mechanism to control (guaranty) that the
reserved uid/gid space are
2012 Dec 28
6
problem with installing lustre and ofed
Hello,
I am having trouble installing the server modules for lustre 2.1.4 and use mellanox''s OFED distribution so we may use infiniband. Would you folks look at my procedure and results below and let me know what you think? Thanks very much!
The mellanox ofed installation builds and installs some kernel modules too, so I used this method to ensure OFED compiled against the correct
2013 Mar 26
1
Lustre 2.2 with centos 6.3 gives problem while loading o2ib module for infiniband
Dear All,
we are facing problem while connecting o2ib module.
Lustre 2.2 with centos 6.3 gives problem while loading o2ib module for
infiniband.
Thanks in advance
Regards,
Faheem Patel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20130326/d0eb1e39/attachment.html
2013 Mar 18
1
OST0006 : inactive device
I installed 1 MDS , 2 OSS/OST and 2 Lustre Client. My MDS shows:
[code]
[root at MDS ~]# lctl list_nids
10.94.214.185 at tcp
[root at MDS ~]#
[/code]
On Lustre Client1:
[code]
[root at lustreclient1 lustre]# lfs df -h
UUID bytes Used Available Use% Mounted on
lustre-MDT0000_UUID 4.5G 274.3M 3.9G 6% /mnt/lustre[MDT:0]
lustre-OST0000_UUID
2007 Mar 20
15
How to bypass failed OST without blocking?
Hi
I want my lustre do such things during OST failed: if some file
has stripe data on th failed OST, any operation on the file will
return IO error without blocking, and also at this moment I can
create and read/write new file or read/write files which have no stripe
data on the failed OST without blocking.
What should I do ? How to configure?
thanks!
swin
-------------- next part
2013 Mar 18
1
lustre showing inactive devices
I installed 1 MDS , 2 OSS/OST and 2 Lustre Client. My MDS shows:
[code]
[root at MDS ~]# lctl list_nids
10.94.214.185 at tcp
[root at MDS ~]#
[/code]
On Lustre Client1:
[code]
[root at lustreclient1 lustre]# lfs df -h
UUID bytes Used Available Use% Mounted on
lustre-MDT0000_UUID 4.5G 274.3M 3.9G 6%
/mnt/lustre[MDT:0]
2007 Nov 06
4
Checksum Algorithm
Hi,
We have seen a huge performance drop in 1.6.3, due to the checksum being enabled by default. I looked at the algorithm being used, and it is actually a CRC32, which is a very strong algorithm for detecting all sorts of problems, such as single bit errors, swapped bytes, and missing bytes.
I''ve been experimenting with using a simple XOR algorithm. I''ve been able to recover
2007 Nov 12
8
More failover issues
In 1.6.0, when creating a MDT, you could specify multiple --mgsnode options
and it would failover between them. 1.6.3 only seems to take the last one
and --mgsnode=192.168.1.252 at o2ib:192.168.1.253 at o2ib doesn''t seem to failover
to the other node. Any ideas how to get around this?
Robert
Robert LeBlanc
College of Life Sciences Computer Support
Brigham Young University
leblanc at
2008 Feb 14
9
how do you mount mountconf (i.e. 1.6) lustre on your servers?
As any of you using version 1.6 of Lustre knows, Lustre servers can now
be started simply my mounting the devices it is using. Even
an /etc/fstab entry can be used if you can have the mount delayed until
the network is started.
Given this change, you have also notices that we have eliminated the
initscript for Lustre that used to exist for releases prior to 1.6.
I''d like to take a
2010 Apr 07
6
using ipoib with xcp
Hello,
I have been playing with the XCP for a while now, and must say I''m very
exited about the technology. I had no prior experience with Xen so it has
taken me a while to understand the concepts, but now I feel most important
issues are solved and I''ve purchased some hardware to build my (tiny) cloud
on.
The box is a Supermicro 1026TT-IBXF, so I have 2 x Ethernet and 1 x