Displaying 20 results from an estimated 200 matches similar to: "obdfilter/datafs-OST0000/recovery_status"
2010 Aug 14
0
Lost OSTs, remounted, now /proc/fs/lustre/obdfilter/$UUID/ is empty
Hello,
We had a problem with our disk controller that required a reboot. 2 of
our OSTs remounted and went through the recovery window but clients
hang trying to access them. Also /proc/fs/lustre/obdfilter/$UUID/ is
empty for that OST UUID.
LDISKFS FS on dm-5, internal journal on dm-5:8
LDISKFS-fs: delayed allocation enabled
LDISKFS-fs: file extents enabled
LDISKFS-fs: mballoc enabled
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2007 Aug 30
2
Announcing an updated document version (v1.7) of the Lustre 1.6 Operations Manual
Dear Lustre users,
CFS is pleased to announce an updated document version (v1.7) of the
Lustre? 1.6 Operations Manual, available in both PDF and HTML formats at
http://www.manual.lustre.org.
This edition of the Operations Manual includes the following enhancement:
* Addition of mballoc3 content to the Lustre Proc chapter
If you have any questions, suggestions, or recommended edits to the
2010 Sep 13
2
1.8.4 and write-through cache
Afternoon
I upgraded our oss''s from 1.8.3 to 1.8.4 on Saturday (due to
https://bugzilla.lustre.org/show_bug.cgi?id=22755) and suffered a
great deal of pain.
We have 30 oss''s of multiple vintages. The basic difference between them is
* md on first 20 nodes
* 3ware 9650SE ML12 on last 10 nodes
After the upgrade to 1.8.4 we were seeing terrible throughput on the
nodes with
2008 Feb 12
0
Lustre-discuss Digest, Vol 25, Issue 17
Hi,
i just want to know whether there are any alternative file systems for HP SFS.
I heard that there is Cluster Gateway from Polyserve. Can anybody plz help me in finding more abt this Cluster Gateway.
Thanks and Regards,
Ashok Bharat
-----Original Message-----
From: lustre-discuss-bounces at lists.lustre.org on behalf of lustre-discuss-request at lists.lustre.org
Sent: Tue 2/12/2008 3:18 AM
2007 Nov 07
9
How To change server recovery timeout
Hi,
Our lustre environment is:
2.6.9-55.0.9.EL_lustre.1.6.3smp
I would like to change recovery timeout from default value 250s to
something longer
I tried example from manual:
set_timeout <secs> Sets the timeout (obd_timeout) for a server
to wait before failing recovery.
We performed that experiment on our test lustre installation with one
OST.
storage02 is our OSS
[root at
2010 Aug 06
1
Depreciated client still shown on OST exports
Some clients have been removed several weeks ago but are still listed in:
ls -l /proc/fs/lustre/obdfilter/*/exports/
This was found after tracing back mystery tcp packets to the OSS.
Although this is causing no damage, it raises the question of when
former clients will be cleared from the OSS. Is there a way to manually
remove these exports from the OSS?
--
Regards,
David
2010 Aug 11
3
Failure when mounting Lustre
Hi,
I get the following error when I try to mount lustre on the clients.
Permanent disk data:
Target: lustre-OSTffff
Index: unassigned
Lustre FS: lustre
Mount type: ldiskfs
Flags: 0x72
(OST needs_index first_time update )
Persistent mount opts: errors=remount-ro,extents,mballoc
Parameters: mgsnode=164.107.119.231 at tcp
sh: losetup: command not found
mkfs.lustre: error 32512 on losetup:
2008 Jan 10
4
1.6.4.1 - active client evicted
Hi!
We''ve started to poke and prod at Lustre 1.6.4.1, and it seems to
mostly work (we haven''t had it OOPS on us yet like the earlier
1.6-versions did).
However, we had this weird incident where an active client (it was
copying 4GB files and running ls at the time) got evicted by the MDS
and all OST''s. After a while logs indicate that it did recover the
connection
2010 Jul 13
4
Enable async journals
Hi all,
we use SLES 11 and Lustre 1.8.1.1 + patches and like convert a lustre FS
using external journals to one with async journals enabled.
Question is whether the procedure:
umount <filesystem> on all clients
umount <osts> on all OSSes
e2fsck <ost-device> on all OSSes for all all OSTs
tune2fs -O ^has_journal <ost-device> on all
2013 Oct 24
0
Re: [zfs-discuss] Problems getting Lustre started with ZFS
> You need to use unique index numbers for each OST, i.e. OST0000,
> OST00001, etc.
I cannot see how to control this? I am creating new OST''s but they are
all getting the same index number.
Could this be a problem with the mgs?
Thanks,
Andrew
>
> Ned
>
> To unsubscribe from this group and stop receiving emails from it, send an email to
2008 Mar 07
2
Multihomed question: want Lustre over IB andEthernet
Chris,
Perhaps you need to perform some write_conf like command. I''m not sure if this is needed in 1.6 or not.
Shane
----- Original Message -----
From: lustre-discuss-bounces at lists.lustre.org <lustre-discuss-bounces at lists.lustre.org>
To: lustre-discuss <lustre-discuss at lists.lustre.org>
Sent: Fri Mar 07 12:03:17 2008
Subject: Re: [Lustre-discuss] Multihomed
2008 Feb 14
9
how do you mount mountconf (i.e. 1.6) lustre on your servers?
As any of you using version 1.6 of Lustre knows, Lustre servers can now
be started simply my mounting the devices it is using. Even
an /etc/fstab entry can be used if you can have the mount delayed until
the network is started.
Given this change, you have also notices that we have eliminated the
initscript for Lustre that used to exist for releases prior to 1.6.
I''d like to take a
2008 Mar 06
0
oss umount hangs forever
Hello,
I''m not sure about this, when a device is set read-only,
are journal commit still allowed then, or is this the reason,
why the umount hangs forever?
[44825.302262] LustreError: Skipped 572 previous similar messages
[44882.668079] Lustre: Failing over pfs1work-OST0026
[44882.674578] Lustre: *** setting obd pfs1work-OST0026 device ''unknown-block(9,7)'' read-only
2014 Nov 13
0
OST acting up
whoops, sent from wrong email address, form right address now:
Hello,
I am using Lustre 2.4.2 and have an OST that doesn't seem to be written to.
When I check the MDS with 'lctl dl' I do not see that OST in the list.
However when I check the OSS that OST belongs to I can see it is mounted
and up;
0 UP osd-zfs l2-OST0003-osd l2-OST0003-osd_UUID 5
3 UP obdfilter l2-OST0003
2013 Feb 12
2
Lost folders after changing MDS
OK, so our old MDS had hardware issues so I configured a new MGS / MDS on a VM (this is a backup lustre filesystem and I wanted to separate the MGS / MDS from OSS of the previous), and then did this:
For example:
mount -t ldiskfs /dev/old /mnt/ost_old
mount -t ldiskfs /dev/new /mnt/ost_new
rsync -aSv /mnt/ost_old/ /mnt/ost_new
# note trailing slash on ost_old/
If you are unable to connect both
2007 Nov 12
8
More failover issues
In 1.6.0, when creating a MDT, you could specify multiple --mgsnode options
and it would failover between them. 1.6.3 only seems to take the last one
and --mgsnode=192.168.1.252 at o2ib:192.168.1.253 at o2ib doesn''t seem to failover
to the other node. Any ideas how to get around this?
Robert
Robert LeBlanc
College of Life Sciences Computer Support
Brigham Young University
leblanc at
2010 Jul 07
0
How to evict a dead client?
Dear, everyone
We have stuck with the problem that the OSS connect one dead client or one with changed IP address all the time until we reboot the dead client. From the OSS log message, we can get the information as follows:
Jul 7 14:45:07 com01 kernel: Lustre: 12180:0:(socklnd_cb.cLustre: 12180:0:(socklnd_cb.c:915:ksocknal_launch_packet()) No usable routes to 12345-202.Lustre:
2008 Jan 02
9
lustre quota problems
Hello,
I''ve several problems with quota on our testcluster:
When I set the quota for a person to a given value (e.g. the values which
are provided in the operations manual), I''m able to write exact the amount
which is set with setquota.
But when I delete the files(file) I''m not able to use this space again.
Here is what I''ve done in detail:
lfs checkquota