thr3ads.net - Lustre discuss - [Lustre-discuss] tuning max

If this information is useful, please help other people find it:
Share via:

Götz Waschk

2009-Apr-17 11:08 UTC

[Lustre-discuss] tuning max_sectors

Hi everyone,

on starting an OST I get this message in the log:

Lustre: zn_atlas-OST0000: underlying device cciss/c1d0p1 should be
tuned for larger I/O requests: max_sectors = 1024 could be up to
max_hw_sectors=2048

/dev/cciss/c1d0p1 is a RAID6 on a HP Smart Array P800 controller with
12 750 GB SATA drives. The stripe size is 128 KB according to the
hpacucli script.

The OST was created with
mkfs.lustre --fsname=zn_atlas --ost --mkfsoptions="-E stride=32 -E
stripe-width=320 -J device=/dev/vg00/j-ost0 -i 1048576"
--mgsnode=192.168.224.2 at o2ib1,141.34.218.7 at tcp0 /dev/cciss/c1d0p1

What can I do?


Regards, G?tz Waschk

Brian J. Murrell

2009-Apr-17 11:25 UTC

head link

[Lustre-discuss] tuning max_sectors

On Fri, 2009-04-17 at 13:08 +0200, G?tz Waschk wrote:> 
> Lustre: zn_atlas-OST0000: underlying device cciss/c1d0p1 should be
> tuned for larger I/O requests: max_sectors = 1024 could be up to
> max_hw_sectors=2048
> What can I do?
IIRC, that''s in reference to /sys/block/$device/queue/max_sectors_kb.
If you inspect that it should report 1024.  You can simply echo a new
value into that the way you can with /proc variables.

b.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
Url :
http://lists.lustre.org/pipermail/lustre-discuss/attachments/20090417/80bc63c5/attachment.bin

Robin Humble

2009-Apr-17 12:40 UTC

head link

[Lustre-discuss] tuning max_sectors

On Fri, Apr 17, 2009 at 07:25:30AM -0400, Brian J. Murrell
wrote:>On Fri, 2009-04-17 at 13:08 +0200, G?tz Waschk wrote:
>> Lustre: zn_atlas-OST0000: underlying device cciss/c1d0p1 should be
tuned for larger I/O requests: max_sectors = 1024 could be up to
max_hw_sectors=2048
we have a similar problem.
  Lustre: short-OST0001: underlying device md0 should be tuned for larger I/O
requests: max_sectors = 1024 could be up to max_hw_sectors=1280
>> What can I do?
>IIRC, that''s in reference to
/sys/block/$device/queue/max_sectors_kb.
>If you inspect that it should report 1024.  You can simply echo a new
>value into that the way you can with /proc variables.
sadly, that sys entry doesn''t exist:
  cat: /sys/block/md0/queue/max_sectors_kb: No such file or directory

do you have any other suggestions?
perhaps the devices below md need looking at?
they all report /sys/block/sd*/queue/max_sectors_kb == 512.
we have an md raid6 8+2.

uname -a
  Linux sox2 2.6.18-92.1.10.el5_lustre.1.6.6.fixR5 #2 SMP Wed Feb 4 16:58:30 EST
2009 x86_64 x86_64 x86_64 GNU/Linux
(which is 1.6.6 + the patch from bz 15428 which is (I think) now in 1.6.7.1)

cat /proc/mdstat
...
md0 : active raid6 sdc[0] sdl[9] sdk[8] sdj[7] sdi[6] sdh[5] sdg[4] sdf[3]
sde[2] sdd[1]
      5860595712 blocks level 6, 64k chunk, algorithm 2 [10/10] [UUUUUUUUUU]
                in: 64205147 reads, 97489370 writes; out: 3730773413 reads,
3281459807 writes
                2222983790 in raid5d, 498868 out of stripes, 4280451425 handle
called
                reads: 0 for rmw, 709671189 for rcw. zcopy writes: 1573400576,
copied writes: 20983045
                0 delayed, 0 bit delayed, 0 active, queues: 0 in, 0 out
                0 expanding overlap

cheers,
robin

Andreas Dilger

2009-Apr-17 19:06 UTC

head link

[Lustre-discuss] tuning max_sectors

On Apr 17, 2009  08:40 -0400, Robin Humble wrote:> we have a similar problem.
>   Lustre: short-OST0001: underlying device md0 should be tuned for larger
I/O requests: max_sectors = 1024 could be up to max_hw_sectors=1280
> 
> sadly, that sys entry doesn''t exist:
>   cat: /sys/block/md0/queue/max_sectors_kb: No such file or directory
> 
> do you have any other suggestions?
> perhaps the devices below md need looking at?
> they all report /sys/block/sd*/queue/max_sectors_kb == 512.
> we have an md raid6 8+2.
Since MD RAID is really composed of underlying disks, and doing the
mapping from /dev/md0 -> /sys/block/sd* is difficult, mount.lustre
can''t do the tuning itself.  Instead, you should add a line into
/etc/init.d/rc.local like:

for DEV in sdc sdl sdk sdj sdi sdh sdg sdf sde sdd; do
	cp /sys/block/$DEV/queue/{max_hw_sectors_kb,max_sectors_kb}
done
> uname -a
>   Linux sox2 2.6.18-92.1.10.el5_lustre.1.6.6.fixR5 #2 SMP Wed Feb 4
16:58:30 EST 2009 x86_64 x86_64 x86_64 GNU/Linux
> (which is 1.6.6 + the patch from bz 15428 which is (I think) now in
1.6.7.1)
> 
> cat /proc/mdstat
> ...
> md0 : active raid6 sdc[0] sdl[9] sdk[8] sdj[7] sdi[6] sdh[5] sdg[4] sdf[3]
sde[2] sdd[1]
>       5860595712 blocks level 6, 64k chunk, algorithm 2 [10/10]
[UUUUUUUUUU]
>                 in: 64205147 reads, 97489370 writes; out: 3730773413 reads,
3281459807 writes
>                 2222983790 in raid5d, 498868 out of stripes, 4280451425
handle called
>                 reads: 0 for rmw, 709671189 for rcw. zcopy writes:
1573400576, copied writes: 20983045
>                 0 delayed, 0 bit delayed, 0 active, queues: 0 in, 0 out
>                 0 expanding overlap
> 
> cheers,
> robin
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.

Stuart Midgley

2009-Apr-18 00:28 UTC

head link

[Lustre-discuss] tuning max_sectors

Even if max_hw_sectors_in=32767?

--
Stuart Midgley Ph.D.
Manager Production Services
DownUnder GeoSolutions
stuartm at dugeo.com

On 18/04/2009, at 3:06, Andreas Dilger <adilger at sun.com> wrote:
> On Apr 17, 2009  08:40 -0400, Robin Humble wrote:
>> we have a similar problem.
>>  Lustre: short-OST0001: underlying device md0 should be tuned for  
>> larger I/O requests: max_sectors = 1024 could be up to  
>> max_hw_sectors=1280
>>
>> sadly, that sys entry doesn''t exist:
>>  cat: /sys/block/md0/queue/max_sectors_kb: No such file or directory
>>
>> do you have any other suggestions?
>> perhaps the devices below md need looking at?
>> they all report /sys/block/sd*/queue/max_sectors_kb == 512.
>> we have an md raid6 8+2.
>
> Since MD RAID is really composed of underlying disks, and doing the
> mapping from /dev/md0 -> /sys/block/sd* is difficult, mount.lustre
> can''t do the tuning itself.  Instead, you should add a line into
> /etc/init.d/rc.local like:
>
> for DEV in sdc sdl sdk sdj sdi sdh sdg sdf sde sdd; do
>    cp /sys/block/$DEV/queue/{max_hw_sectors_kb,max_sectors_kb}
> done
>
>> uname -a
>>  Linux sox2 2.6.18-92.1.10.el5_lustre.1.6.6.fixR5 #2 SMP Wed Feb 4  
>> 16:58:30 EST 2009 x86_64 x86_64 x86_64 GNU/Linux
>> (which is 1.6.6 + the patch from bz 15428 which is (I think) now in  
>> 1.6.7.1)
>>
>> cat /proc/mdstat
>> ...
>> md0 : active raid6 sdc[0] sdl[9] sdk[8] sdj[7] sdi[6] sdh[5] sdg[4]  
>> sdf[3] sde[2] sdd[1]
>>      5860595712 blocks level 6, 64k chunk, algorithm 2 [10/10]  
>> [UUUUUUUUUU]
>>                in: 64205147 reads, 97489370 writes; out: 3730773413  
>> reads, 3281459807 writes
>>                2222983790 in raid5d, 498868 out of stripes, 4280451425 
>>  handle called
>>                reads: 0 for rmw, 709671189 for rcw. zcopy writes:
1573400576
>> , copied writes: 20983045
>>                0 delayed, 0 bit delayed, 0 active, queues: 0 in, 0  
>> out
>>                0 expanding overlap
>>
>> cheers,
>> robin
>> _______________________________________________
>> Lustre-discuss mailing list
>> Lustre-discuss at lists.lustre.org
>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>
> Cheers, Andreas
> --
> Andreas Dilger
> Sr. Staff Engineer, Lustre Group
> Sun Microsystems of Canada, Inc.
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss

Lustre discuss - Apr 2009 - tuning max_sectors

[Lustre-discuss] tuning max_sectors

[Lustre-discuss] tuning max_sectors

[Lustre-discuss] tuning max_sectors

[Lustre-discuss] tuning max_sectors

[Lustre-discuss] tuning max_sectors