thr3ads.net - Ocfs2 users - [Ocfs2-users] Diagnosing poor write performance [Mar 2016]

If this information is useful, please help other people find it:
Share via:

Eric Ren

2016-Mar-31 02:17 UTC

[Ocfs2-users] Diagnosing poor write performance

Hi,
>>
>> How did you perform the testing? It really matters. If you write a file
on
>> shared disk from one node, and read this file from another node,
without,
>> or with very little interval, the writing IO speed could decrease by
~20
>> times according my previous testing(just as a reference). It's a
extremely
>> bad situation for 2 nodes cluster, isn't?
>>
>> But it's incredible that in your case writing speed drop by
>3000 times!
>
>
> I simply used 'dd' to create a file with /dev/zero as a source. If
there is
> a better way to do this I am all ears.
Alright, you just did a local IO on ocfs2, then the performance 
shouldn't be that bad. I guess the ocfs2 volume has been used over 60%? 
or seriously fragmented?
Please give info with `df -h`, and super block with debugfs.ocfs2, and 
also the exact `dd` command you performed. Additionally, perform `dd` on 
each node.

You know, ocfs2 is a shared disk fs. So 3 basic testing cases I can 
think of are:
1. only one node of cluster do IO;
2. more than one nodes of cluster perform IO, but each nodes just 
read/write its own file on shared disk;
3. like 2), but some nodes read and some write a same file on shared disk.

The above model is much theoretically simplified though. The practical 
scenarios could be much more complicated, like fragmentation issue that 
your case much likely is.

>> Could you firstly do test on LVM, then DRBD, and then OCFS2? Let's
blame
>> on them more fairly.
>>
> If I do a similar write of a file to a directory that exists on a LVM LV I
> get roughly 100 megabytes/sec.
>
> I can't write straight to the DRBD device, as that would entail wiping
the
> customer's OCFS2 filesystem, which I cannot do.
OK, it's product environment. I can understand.

Eric

Graeme Donaldson

2016-Mar-31 05:52 UTC

head link

[Ocfs2-users] Diagnosing poor write performance

On 31 March 2016 at 04:17, Eric Ren <zren at suse.com> wrote:
> Hi,
>
>> How did you perform the testing? It really matters. If you write a file
on
>>> shared disk from one node, and read this file from another node,
without,
>>> or with very little interval, the writing IO speed could decrease
by ~20
>>> times according my previous testing(just as a reference). It's
a
>>> extremely
>>> bad situation for 2 nodes cluster, isn't?
>>>
>>> But it's incredible that in your case writing speed drop by
>3000 times!
>>>
>>
>>
>> I simply used 'dd' to create a file with /dev/zero as a source.
If there
>> is
>> a better way to do this I am all ears.
>>
>
> Alright, you just did a local IO on ocfs2, then the performance
shouldn't
> be that bad. I guess the ocfs2 volume has been used over 60%? or seriously
> fragmented?
> Please give info with `df -h`, and super block with debugfs.ocfs2, and
> also the exact `dd` command you performed. Additionally, perform `dd` on
> each node.
>
> You know, ocfs2 is a shared disk fs. So 3 basic testing cases I can think
> of are:
> 1. only one node of cluster do IO;
> 2. more than one nodes of cluster perform IO, but each nodes just
> read/write its own file on shared disk;
> 3. like 2), but some nodes read and some write a same file on shared disk.
>
> The above model is much theoretically simplified though. The practical
> scenarios could be much more complicated, like fragmentation issue that
> your case much likely is.

Here is all the output requested: http://pastebin.com/raw/BnJAQv9T

It's interesting to me that you guessed the usage is over 60%. It is indeed
sitting at 65%. Is the solution as simple as ensuring that an OCFS2
filesystem doesn't go over the 60% usage mark? Or am I getting ahead of
myself a little?

Thanks for your effort so far!

Graeme.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20160331/8d6b8bcc/attachment.html

Ocfs2 users - Mar 2016 - Diagnosing poor write performance

[Ocfs2-users] Diagnosing poor write performance

[Ocfs2-users] Diagnosing poor write performance