thr3ads.net - similar to: "lustre showing inactive devices"

Displaying 20 results from an estimated 200 matches similar to: "lustre showing inactive devices"

2013 Mar 18

OST0006 : inactive device

I installed 1 MDS , 2 OSS/OST and 2 Lustre Client. My MDS shows: [code] [root at MDS ~]# lctl list_nids 10.94.214.185 at tcp [root at MDS ~]# [/code] On Lustre Client1: [code] [root at lustreclient1 lustre]# lfs df -h UUID bytes Used Available Use% Mounted on lustre-MDT0000_UUID 4.5G 274.3M 3.9G 6% /mnt/lustre[MDT:0] lustre-OST0000_UUID

No space left on device on not full filesystem

2010 Jul 08

No space left on device on not full filesystem

Hello, We have running lustre 1.8.1 and have met "No space lest on device" error when uploading 500 Gb small files (less then 100 Kb each). The problem seems to depends on the number of files. If we remove one file, we can create one new file, even with Gb size; but if we haven''t remove something we can''t create even very little file, as an example using touch

Bad reporting inodes free

2012 Sep 27

Bad reporting inodes free

Hello, When I run a "df -i" in my clients I get 95% indes used or 5% inodes free: Filesystem Inodes IUsed IFree IUse% Mounted on lustre-mds-01:lustre-mds-02:/cetafs 22200087 20949839 1250248 95% /mnt/data But if I run lfs df -i i get: UUID Inodes IUsed IFree I

Lost folders after changing MDS

2013 Feb 12

Lost folders after changing MDS

OK, so our old MDS had hardware issues so I configured a new MGS / MDS on a VM (this is a backup lustre filesystem and I wanted to separate the MGS / MDS from OSS of the previous), and then did this: For example: mount -t ldiskfs /dev/old /mnt/ost_old mount -t ldiskfs /dev/new /mnt/ost_new rsync -aSv /mnt/ost_old/ /mnt/ost_new # note trailing slash on ost_old/ If you are unable to connect both

OSTs inactive on one client (only)

2013 Apr 29

OSTs inactive on one client (only)

Hi everyone, I have seen this question here before, but without a very satisfactory answer. One of our half a dozen clients has lost access to a set of OSTs: > lfs osts OBDS:: 0: lustre-OST0000_UUID ACTIVE 1: lustre-OST0001_UUID ACTIVE 2: lustre-OST0002_UUID INACTIVE 3: lustre-OST0003_UUID INACTIVE 4: lustre-OST0004_UUID INACTIVE 5: lustre-OST0005_UUID ACTIVE 6: lustre-OST0006_UUID ACTIVE

Speeding up configuration log regeneration?

2013 Oct 17

Speeding up configuration log regeneration?

Hi, We run four-node Lustre 2.3, and I needed to both change hardware under MGS/MDS and reassign an OSS ip. Just the same, I added a brand new 10GE network to the system, which was the reason for MDS hardware change. I ran tunefs.lustre --writeconf as per chapter 14.4 in Lustre Manual, and everything mounts fine. Log regeneration apparently works, since it seems to do something, but

How to remove OST permanently?

2007 Nov 23

How to remove OST permanently?

All, I''ve added a new 2.2 TB OST to my cluster easily enough, but this new disk array is meant to replace several smaller OSTs that I used to have of which were only 120 GB, 500 GB, and 700 GB. Adding an OST is easy, but how do I REMOVE the small OSTs that I no longer want to be part of my cluster? Is there a command to tell luster to move all the file stripes off one of the nodes?

lustre quota problems

2008 Jan 02

lustre quota problems

Hello, I''ve several problems with quota on our testcluster: When I set the quota for a person to a given value (e.g. the values which are provided in the operations manual), I''m able to write exact the amount which is set with setquota. But when I delete the files(file) I''m not able to use this space again. Here is what I''ve done in detail: lfs checkquota

strange lustre errors

2008 Mar 06

strange lustre errors

Hi, On a few of the hpc cluster nodes, i am seeing a new lustre error that is pasted below. The volumes are working fine and there is nothing on the oss and mds to report. LustreError: 5080:0:(import.c:607:ptlrpc_connect_interpret()) data3-OST0000_UUID at 192.168.2.98@tcp changed handle from 0xfe51139158c64fae to 0xfe511392a35878b3; copying, but this may foreshadow disaster

1.6.4.1 - active client evicted

2008 Jan 10

1.6.4.1 - active client evicted

Hi! We''ve started to poke and prod at Lustre 1.6.4.1, and it seems to mostly work (we haven''t had it OOPS on us yet like the earlier 1.6-versions did). However, we had this weird incident where an active client (it was copying 4GB files and running ls at the time) got evicted by the MDS and all OST''s. After a while logs indicate that it did recover the connection

How To change server recovery timeout

2007 Nov 07

How To change server recovery timeout

Hi, Our lustre environment is: 2.6.9-55.0.9.EL_lustre.1.6.3smp I would like to change recovery timeout from default value 250s to something longer I tried example from manual: set_timeout <secs> Sets the timeout (obd_timeout) for a server to wait before failing recovery. We performed that experiment on our test lustre installation with one OST. storage02 is our OSS [root at

bad 1.6.3 striped write performance

2007 Nov 26

bad 1.6.3 striped write performance

Hi, I''m seeing what can only be described as dismal striped write performance from lustre 1.6.3 clients :-/ 1.6.2 and 1.6.1 clients are fine. 1.6.4rc3 clients (from cvs a couple of days ago) are also terrible. the below shows that the OS (centos4.5/5) or fabric (gigE/IB) or lustre version on the servers doesn''t matter - the problem is with the 1.6.3 and 1.6.4rc3 client kernels

Need to unmount an LV from host system

2013 Mar 05

Need to unmount an LV from host system

Greetings - Ok, I made a mistake that I need to fix. Fortunately it is not a destructive mistake, but I need some advice on how to correct the problem. CentOS 6.3 host system named Earth I was creating some new logical volumes within my exiting volume group for a new virtual machine using the LVM GUI. When I created the LV that I plan to use for root partition of the new VM (Bacteria) I

df -h shows system at 100% used, but there is space left

2009 Apr 24

df -h shows system at 100% used, but there is space left

Has anyone seen output like this before? Notice on my /dev/md0 That it reports 100% used, but also 5.3 Gb available. What would cause this? # df -h Filesystem Size Used Avail Use% Mounted on /dev/sda2 71G 1.6G 66G 3% / /dev/sda1 99M 22M 73M 23% /boot tmpfs 506M 0 506M 0% /dev/shm /dev/md0 903G 851G 5.3G 100%

Strange Behavior using FUSE client

2011 Jul 15

Strange Behavior using FUSE client

I've recently setup a distributed/replicated cluster and have had an issue with seeing the directories on the cluster. Also, a df -h only shows data from one of the three bricks. The strange behavior doesn't end there. If I log into the 'primary' server as root, then do an ls on the client, the directories appear. However, df -h is still incorrect. I'm not sure exactly

Luster clients getting evicted

2008 Feb 04

Luster clients getting evicted

on our cluster that has been running lustre for about 1 month. I have 1 MDT/MGS and 1 OSS with 2 OST''s. Our cluster uses all Gige and has about 608 nodes 1854 cores. We have allot of jobs that die, and/or go into high IO wait, strace shows processes stuck in fstat(). The big problem is (i think) I would like some feedback on it that of these 608 nodes 209 of them have in dmesg

Determine available disk space for Win 95

2001 Nov 12

Determine available disk space for Win 95

I have a mixture of Win95 and Win98 computers. From the Samba host I mount the Win95/98 host using a script containing commands similar to the following text. mount -t smbfs //pent14/pent14-e /dosworld/pent14/e -o username=fred\%xxxxxxx After I have mounted all the drives I issue the following command. df -h Results show the available space on the Win98 hosts properly, however all the

Lustre-discuss Digest, Vol 25, Issue 17

2008 Feb 12

Lustre-discuss Digest, Vol 25, Issue 17

Hi, i just want to know whether there are any alternative file systems for HP SFS. I heard that there is Cluster Gateway from Polyserve. Can anybody plz help me in finding more abt this Cluster Gateway. Thanks and Regards, Ashok Bharat -----Original Message----- From: lustre-discuss-bounces at lists.lustre.org on behalf of lustre-discuss-request at lists.lustre.org Sent: Tue 2/12/2008 3:18 AM

ocfs doesn't free space?

2004 Sep 01

ocfs doesn't free space?

an ocfs-volume was nearly full (only 800MB free). i deleted some datafiles to free space: $ df -h . Filesystem Size Used Avail Use% Mounted on /dev/sdp1 10G 5.3G 4.8G 53% /db/DPS so there are more than 4GB available. $ sqlplus /nolog SQL*Plus: Release 9.2.0.4.0 - Production on Wed Sep 1 12:57:48 2004 Copyright (c) 1982, 2002, Oracle Corporation. All rights

lustre error

2008 Feb 22

lustre error

Dear All, Yesterday evening or cluster has stopped. Two of our nodes tried to take the resource from each other, they haven''t seen the other side, if I saw well. I stopped heartbeat, resources, start it again, and back to online, worked fine. This morning I saw this in logs: Feb 22 03:25:07 node4 kernel: Lustre: 7:0:(linux-debug.c:98:libcfs_run_upcall()) Invoked LNET upcall

similar to: lustre showing inactive devices