thr3ads.net - Lustre devel - [Lustre-devel] Async close RPC [Mar 2009]

If this information is useful, please help other people find it:
Share via:

Oleg Drokin

2009-Mar-28 19:08 UTC

[Lustre-devel] Async close RPC

Hello!

    It recently struck me as how unfair many file creation tests are  
to Lustre. The problem is only with tests because they portray Lustre
    creation rates lower than actual speed to be seen by application.  
(same goes for opens, btw).
    The problem is typical create-rate test does an open-close  
sequence in a loop. Now close is synchronous RPC on Lustre in majority  
of tests.
    By making close to be asynchronous we would make Lustre to appear  
more fast at creates by reducing all of that overhead. Of course no
    real application would benefit, because I am not aware of any  
where there is a tight open-close loop there. Real applications are
    opening some files at some point, then do i/o for some extended  
time and only then close would happen.

    I know in some of CMD cases this idea was considered, but did not  
pan out for some reason (I am not familiar with that implementation).

    Anyway I performed a test at ORNL Jaguar system running an  
application creating 10000 files (open-creat, with O_LOV_DELAY_CREATE  
flag to reduce
    OST influence, since we are working separately on addressing that)  
and then closing 10000 files, all in 2 timed loops. The app was run on  
a scale
    of 1 to 64 clients (in power of 2 increments).
    From the test it is easily observable that the closes easily bring  
in 50% penalty to overall creation rate.
    E.g. at a scale 1: 10k opens take 1.946946, 10k subsequent closes  
take 1.031471. (5136 real creates/sec vs 3357 "reported by usual test"
creates sec)
         at a scale 8: 80k opens take 6.21 sec, 80k subsequent closes  
take 3.51 sec  (12800 real creates/sec vs 8230 "reported by usual  
test" creates sec).

    Now of course if we make closes completely asynchronous, they  
would still be competing for CPU at MDS with opens, inducing some  
penalty still, so
    for this type of test ideally we would like all closes to go to  
some separate portal with only one handling thread to minimize cpu  
consumption, but
    this is not really idea for real workloads, of course, the real  
impact here could be made by NRS, where opens from same job would get  
prioritized
    ahead of closes from the same job.

    Anyway, I am thinking it is good idea to implement async closes if  
only to make us look better (read - more realistic) in these tests,  
and for proper
    implementation to work we need to get rid of close sending  
serialization (since spawning a separate close thread for every close  
would be stupid).
    I think the close serialization is not needed anyway. If the close  
reply was lost, it would be resent and we can just supress the  
resulting error
    seeing how resent close just tried to close nonexistent close  
handle. On recovery we care even less, there is nothing to close after  
server restart.
    (I am not sure what SOM implications that might have? But I  
suspect none - there is some extra state in mfd that could tell us if  
we already executed
    this close and we probably can reconstruct necessary reply state  
for resend from it, Vitaly?)

    Any comments or concerns from anyone?

Bye,
     Oleg

Brian J. Murrell

2009-Mar-30 14:42 UTC

head link

[Lustre-devel] Async close RPC

On Sat, 2009-03-28 at 15:08 -0400, Oleg Drokin wrote:> Hello!
> 
>     It recently struck me as how unfair many file creation tests are  
> to Lustre. The problem is only with tests because they portray Lustre
>     creation rates lower than actual speed to be seen by application.  
> (same goes for opens, btw).
The only thing I would add/say Oleg, is have you looked at the Hendrix
CMD specific metadata tests?  I only mention this because from what I
recall, we took advantage of every opportunity we could (within the
requirements of the project) to show performance in the best light we
could.  I''d imagine that those tests (and the corresponding Lustre
code_
would have considered some of the same things you are considering.

It may be however that some of the solutions were CMD specific and not
really applicable to you, but perhaps in the least, looking at some of
those tests (i.e. mdsrate was one key test) might prevent you from
re-inventing tests.

Cheers,
b.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
Url :
http://lists.lustre.org/pipermail/lustre-devel/attachments/20090330/3045bfe1/attachment.bin

Oleg Drokin

2009-Mar-30 14:52 UTC

head link

[Lustre-devel] Async close RPC

Hello!

On Mar 30, 2009, at 10:42 AM, Brian J. Murrell wrote:>>    It recently struck me as how unfair many file creation tests are
>> to Lustre. The problem is only with tests because they portray Lustre
>>    creation rates lower than actual speed to be seen by application.
>> (same goes for opens, btw).
> The only thing I would add/say Oleg, is have you looked at the Hendrix
> CMD specific metadata tests?  I only mention this because from what I
> recall, we took advantage of every opportunity we could (within the
> requirements of the project) to show performance in the best light we
> could.  I''d imagine that those tests (and the corresponding Lustre
> code_
> would have considered some of the same things you are considering.
Yes. As I said, I am aware CMD project tried this idea and found it to  
be
of no help. I am not deeply aware of the details, though.
> It may be however that some of the solutions were CMD specific and not
> really applicable to you, but perhaps in the least, looking at some of
> those tests (i.e. mdsrate was one key test) might prevent you from
> re-inventing tests.
I am not reinventing tests, I just want to make existing tests to show
more fair number when run on Lustre.

Bye,
     Oleg

Lustre devel - Mar 2009 - Async close RPC

[Lustre-devel] Async close RPC

[Lustre-devel] Async close RPC

[Lustre-devel] Async close RPC