thr3ads.net - Mongrel users - [Mongrel] HTTP Pipelining [Aug 2006]

If this information is useful, please help other people find it:
Share via:

Brian McCallister

2006-Aug-07 19:19 UTC

[Mongrel] HTTP Pipelining

I am trying to understand why Mongrel so forcefully disables http  
pipelining. The docs say because the spec is unclear, and it hurts  
performance. These reasons smell... wrong. The HTTP spec is pretty  
clear, and, er, I cannot find anywhere else that claims there is a  
performance drawback, and lots of studies (and personal benchmarks  
across years of writing webapps) showing how much it helps.

The only common case I can think of for getting a possible  
performance boost from forcing a connection close is if you with  
certainty that there are no followup resource requests to the same  
domain, and the cost of maintaining connection state in memory is too  
high for the app server. This holds true for folks like Yahoo! or  
whatnot who use a CDN for resources (and use pipelining on the CDN  
connections) and separate app servers for the dynamic page elements,  
but... it seems to be a strange assumption for a web server to force  
on users.

Anyway, trying to understand why it works this way. Anyone know?

-Brian

Zed Shaw

2006-Aug-08 03:08 UTC

head link

[Mongrel] HTTP Pipelining

On Mon, 2006-08-07 at 12:19 -0700, Brian McCallister
wrote:> I am trying to understand why Mongrel so forcefully disables http  
> pipelining. The docs say because the spec is unclear, and it hurts  
> performance. These reasons smell... wrong. The HTTP spec is pretty  
> clear, and, er, I cannot find anywhere else that claims there is a  
> performance drawback, and lots of studies (and personal benchmarks  
> across years of writing webapps) showing how much it helps.
> 
The problem is performance, resources, and usage related.

First, Ruby''s IO subsystem isn''t that great at processing HTTP
style
protocols since you have to parse off chunks, then parse more chunks,
and since there''s no decent ring buffer it requires tons of string
creation.  I''ve worked on this a bit but it''s a real pain so I
focused
on just making Mongrel work well in the simple case.

Second, Ruby only has 1024 file descriptors open for *all* files, in
practical usage on a Rails server this is about 500 sockets before the
server tanks badly.  Allowing clients to keep sockets open means that
clients very easily crash Ruby by just not closing them off.  As it is
now Mongrel has to boot clients that take too long in order to keep
service levels high.  It would get much more complex in a
pipeline/keepalive situation where the sockets are kept open.  Throw in
threading issues around Rails, socket and file usage by random authors,
and problems with how pipelined resources are dealt with (not by
mongrel, but by the frameworks) and you''ve got a total mess. 
It''s just
simpler to process one request and go away.

Third, Mongrel is more often used behind another more capable server and
off localhost. Mongrel''s not intended to be a full blown server, but
rather just small enough and fast enough to get a Ruby application
going.  Rather than waste a lot of resources on making Mongrel handle
all the nuances of the HTTP RFC I went and implemented what worked the
fastest in *this* situation.

This also indirectly helps with a common queuing problem when a series
of pipeline requests cause one backend to be taken over by a client,
thus shutting out many others.  It turns out that when you''re in a
clustering situation most of the requests Mongrel handles are better
done being sprayed around to multiple servers so that all clients get a
fair chance at service.

Those would be the reasons right now.  Things may change in the future
when the technology landscape for Mongrel changes, but until then it''s
enough work to just get this simplest case going well.
> The only common case I can think of for getting a possible  
> performance boost from forcing a connection close is if you with  
> certainty that there are no followup resource requests to the same  
> domain, and the cost of maintaining connection state in memory is too  
> high for the app server. This holds true for folks like Yahoo! or  
> whatnot who use a CDN for resources (and use pipelining on the CDN  
> connections) and separate app servers for the dynamic page elements,  
> but... it seems to be a strange assumption for a web server to force  
> on users.
> 
Again, forcing the connection closed works better in this situation
because it''s expected to work on localhost, and there''s not a
statistical difference in performance in that situation.

But, if you''re reading through the spec you might be able to help me
out, since I''m writing a test suite for this very purpose (and then
exploits around it).  If you can, help me find the answers to these:

1) Can a client perpetually send pipelined requests eating up available
socket descriptors (remember, ruby''s only got 1024 available FDs, about
500 sockets in practical usage)?
2) Can a client send 20 or 30 requests right away and not process any
responses, and then suddenly close?
3) Can a client "trickle" requests (send them very slowly and very
chunked) in such a way that the server has to perform tons of
processing?
4) Who closes?  It''s not clear if the client closes, the server closes,
who''s allowed to close, when, what situations.  This is really unclear
but incredibly important in a TCP/IP protocol, and in the HTTP RFC it''s
hidden in little SHOULD and MAY statements in all sorts of irrelevant
sections.
5) What are the official size limits of each element of HTTP?  Can
someone send a 1M header element?
6) Why are servers required to be nice to malicious clients?  All over
the spec are things where the server is required to read all of the
client''s garbage, and then politely return an error.  With DDoS
you''d
think this would change.  So when is it appropriate for a server to be
mean in order to protect itself?
7) What''s the allowed time limit for a client to complete it''s
request?
8) Are pipelined requests all sent at once, and then all processed at
once?  Or, are they sent/processed/sent/processed in keeping with HTTP
request/response semantics.
  a) If a client can pipeline 20 requests, but request #3 causes an
error that requires the client be closed, does the server have to
process the remaining 17 before responding (see #6).
  b) If a client does request/response then why have pipeline at all?
  c) How does a client make 20 requests, and then after getting #6 abort
the remaining 13?
  d) What does the server do with all the resources it''s gathered up if
the socket is closed?
  e) The server can''t just start sending since client receive buffers
and server send buffers are finite and set by the OS.  If this is the
case, then either the server has to queue up all response and send when
the client is done, or the client has to do request/response.
  d) If they do request/response, how do they synchronize the
processing?  It''s a catch-22 if you say they can send 20 pipelined
requests, but in actuality due to send/recv buffers they have to also
process requests at the same time.  Without a clear decision on this
it''s very difficult and pretty much either side can just stop
processing
without the other side knowing.
9) If both sides just keep sockets open and process whatever comes their
way, then what prevents a malicious client or server from doing nothing
and eating up resources.
10) If there''s pipelined requests and responses then why is there
chunked encoding, multipart mime, byte ranges, and other mechanisms for
doing nearly the same thing.
11) If it''s not explicitly declared that both sides will pipeline, and
neither side needs to declare the size of it''s content, then what
prevents both sides from sending tons of junk?  How does either side
really know the end of a request?

That''s from my latest notes.  As you can see most of the problems
encountered tend to come from a lack of clarity in the areas of:

* Asynchronous vs. Synchronous processing.
* Request/Response vs. Batch vs. spray and pray. :-)
* Abuse of resources by clients.
* Changes in the technology landscape since 1999 that makes it so that
servers are at a major disadvantage (DDoS baby).
* A lack of understanding of the needs for web applications like Mongrel
which typically run on localhost or highly controlled networks where
much of this isn''t necessary and only adds complexity.
* Not anticipating that the *real* performance problem in web
applications is *not* TCP/IP connection times, but rather the slow
nature of dynamic page generation (can we get something other than Etag
please?)
> Anyway, trying to understand why it works this way. Anyone know?
Yeah, you know what we should do, and you might get a kick out of this,
but I''m working on a test suite in RFuzz that''s exploring all
the parts
of the RFC.  I''ve got sections 3 and 4 laid out and ready to be filled
in with more to come.  It basically goes through each part and makes
sure a server is compliant.  I''m also working up attacks and DDoS
operations that exploit the ambiguous parts of the RFC using RFuzz.

If you want, hook up with me off list and maybe we can fill out the
RFuzz test suite that does this part of the RFC, then work out the
exploits, *then* beef up Mongrel to deal with it.  Could be fun.

-- 
Zed A. Shaw
http://www.zedshaw.com/
http://mongrel.rubyforge.org/
http://www.railsmachine.com/ -- Need Mongrel support?

Dan Kubb

2006-Aug-08 04:12 UTC

head link

[Mongrel] HTTP Pipelining

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Zed,
> But, if you''re reading through the spec you might be able to help
me
> out, since I''m writing a test suite for this very purpose (and
then
> exploits around it).  If you can, help me find the answers to these:
I don''t know very many of the answers to your questions since
I''ve
never implemented the pipelining part of an HTTP server before.  Here''s
what I could find for a few of your questions from RFC 2616.  Hope
this helps.
> 4) Who closes?  It''s not clear if the client closes, the server  
> closes,
> who''s allowed to close, when, what situations.  This is really
unclear
> but incredibly important in a TCP/IP protocol, and in the HTTP RFC  
> it''s
> hidden in little SHOULD and MAY statements in all sorts of irrelevant
> sections.
http://rfc.net/rfc2616.html#s8.1.4

    When a client or server wishes to time-out it SHOULD issue a  
graceful
    close on the transport connection. Clients and servers SHOULD both
    constantly watch for the other side of the transport close, and
    respond to it as appropriate. If a client or server does not detect
    the other side''s close promptly it could cause unnecessary resource
    drain on the network.

    A client, server, or proxy MAY close the transport connection at any
    time. For example, a client might have started to send a new request
    at the same time that the server has decided to close the "idle"
    connection. From the server''s point of view, the connection is
being
    closed while it was idle, but from the client''s point of view, a
    request is in progress.

    This means that clients, servers, and proxies MUST be able to  
recover
    from asynchronous close events. Client software SHOULD reopen the
    transport connection and retransmit the aborted sequence of requests
    without user interaction so long as the request sequence is
    idempotent (see section 9.1.2). Non-idempotent methods or sequences
    MUST NOT be automatically retried, although user agents MAY offer a
    human operator the choice of retrying the request(s).  
Confirmation by
    user-agent software with semantic understanding of the application
    MAY substitute for user confirmation. The automatic retry SHOULD NOT
    be repeated if the second sequence of requests fails.

    Servers SHOULD always respond to at least one request per  
connection,
    if at all possible. Servers SHOULD NOT close a connection in the
    middle of transmitting a response, unless a network or client  
failure
    is suspected.

Not completely clear, but gives a hint anyway. Hmm, is this one of
those "irrelevant sections" you were talking about? ;-)
> 5) What are the official size limits of each element of HTTP?  Can
> someone send a 1M header element?
Header lengths?  Actually I didn''t see a direct reference, but there
a hint in a round-about sort of way here:

http://rfc.net/rfc2616.html#s19.4.7

    HTTP implementations which share code with MHTML [45]  
implementations
    need to be aware of MIME line length limitations. Since HTTP does  
not
    have this limitation, HTTP does not fold long lines. MHTML messages
    being transported by HTTP follow all conventions of MHTML, including
    line length limitations and folding, canonicalization, etc., since
    HTTP transports all message-bodies as payload (see section 3.7.2)  
and
    does not interpret the content or any MIME header lines that  
might be
    contained therein.

The key phrase being "implementations need to be aware of MIME line  
length
limitations. Since HTTP does not have this limitation" leads me to  
believe
they do not define any limit on header lengths.

Actually the HTTP spec doesn''t limit anything as far as I can tell.
> 7) What''s the allowed time limit for a client to complete
it''s
> request?
http://rfc.net/rfc2616.html#s8.1.4

    Servers will usually have some time-out value beyond which they will
    no longer maintain an inactive connection. Proxy servers might make
    this a higher value since it is likely that the client will be  
making
    more connections through the same server. The use of persistent
    connections places no requirements on the length (or existence) of
    this time-out for either the client or the server.
> 8) Are pipelined requests all sent at once, and then all processed at
> once?  Or, are they sent/processed/sent/processed in keeping with HTTP
> request/response semantics.
http://rfc.net/rfc2616.html#s8.1.1

     Pipelining allows a client to make multiple requests without
     waiting for each response, allowing a single TCP connection to
     be used much more efficiently, with much lower elapsed time.

http://rfc.net/rfc2616.html#s8.1.2.2

    A client that supports persistent connections MAY "pipeline" its
    requests (i.e., send multiple requests without waiting for each
    response).

- --

Thanks,

Dan
__________________________________________________________________

Dan Kubb
Autopilot Marketing Inc.

Email: dan.kubb at autopilotmarketing.com
Phone: 1 (604) 820-0212
Web:   http://autopilotmarketing.com/
vCard: http://autopilotmarketing.com/~dan.kubb/vcard
__________________________________________________________________



-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.2 (Darwin)

iD8DBQFE2A8x4DfZD7OEWk0RAsvNAJwKpNaOf8J0+iWYUUdt9JqTU2aXWwCcDK1J
oaRSXVCheKOBqd/m3cys4HA=Lt+j
-----END PGP SIGNATURE-----

Seemingly Similar Threads

Search for more maybe matching threads

Mongrel users - Aug 2006 - HTTP Pipelining

[Mongrel] HTTP Pipelining

[Mongrel] HTTP Pipelining

[Mongrel] HTTP Pipelining

Seemingly Similar Threads