Apache :: Sporadic hangs with Event MPM?

neilgunton · Joined: 12 Feb 2016 Posts: 2 Location: Albany, Oregon

I am using Apache 2.4.18 (front end caching reverse proxy, Event MPM, SSL) and 2.2.31 (mod_perl backend, prefork MPM), both compiled from source, both on the same dedicated server, to host a community website that I have been developing and running for the last 15 years. I have noticed recently that with the Event MPM enabled on the front end, I would see occasional, sporadic hangs when trying to load the website in my browser. It would just sit there saying "connecting". Then it would often just clear up by itself, sometimes after a few seconds, sometimes more. Sometimes I would have to restart apache, but I don't know if it would have eventually come out of it if I had just left it - but being a live website, when I see this my first priority is to get the site working again. When I am at my console and able to access the server, I run htop during those moments, and can see that Apache is not serving requests. Tail the server log, nobody is getting any pages. And, no activity in terms of CPU either, according to htop. It's just sitting there. So it's not just a browser or DNS issue on the client side. Then suddenly it would all clear up, and you'd see the requests coming through as usual, as if nothing had happened.

Then one day recently I had a brainstorm and decided to try switching the build on the front end to Worker MPM instead of Event. Nothing else changed, just the MPM. And lo and behold, no more hangs since then.

I am aware that the Event MPM behaves like Worker when SSL is in use. But not everybody accesses my site via SSL, so I thought it might help at least some of the time. Shouldn't matter in any case, should it?

Now I look back, I can remember when I first started using Event MPM (sometime last summer, I think it was, right about when I was getting slammed by hits from China and had to revamp my config to be able to handle the traffic better), when I set MaxConnectionsPerChild for the front end to anything but 0, then eventually the server that had those threads would just sit there in the status for a while. It's like it had a problem closing down the threads. The server was still serving requests, but these "zombie" threads were still showing up in the status page. I worked around it by simply making MaxConnectionsPerChild 0. I thought, from what I read, that Event was a better choice than Worker given the heavy traffic I was getting from China (it was really bad, I think some DNS got redirected over to my server for whatever reason, because I was getting tons of requests that were clearly for other domains, and related to bittorrent etc, which I don't run). Anyway, I eventually solved the DDoS problem by banning China completely at the firewall level, so that's not so much of an issue any more. They still hit me daily, but not at a volume that presents any issue with performance. I handle using iptables + ipset, and ip blocks from http://www.ipdeny.com/ipblocks/.

So the question is: Has anyone else noticed any issues with Event MPM in terms of occasional, sporadic hangs? I won't bother posting my entire configuration here for now, because it's quite large and I'm not sure it would be relevant, given that I am seeing such a difference between Event and Worker MPM's. Should there be any difference at all if you switch between them, but leave everything else the same? Surely not, but I can't really be of much help in tracing what the code is doing, since the hangs are so occasional and so unpredictable, and nothing crashes (I think), it just hangs for a while and then eventually recovers. There's never anything in the log files afterwards. During the hangs, I am still able to access the server remotely via SSH without any issues, so it's not a network problem. It's just Apache that stops responding.

Just wondering if anyone else has noticed this as being an issue with Event vs Worker. Everything's good now with Worker, so I'm happy to leave it like that (and I'm finding that the status page is a bit more informative now too), I just thought I'd bring this up in case the Apache people weren't aware of a bug somewhere in there.

My server is 64-bit Debian Wheezy. The website gets around 500,000 to 700,000 or more http requests per day (around 100,000 - 150,000 page requests). So, not that busy, but constantly getting reasonable traffic. It's a bicycle touring journal community website with forums, so basically a lot of text and pics, but no video or anything like that. The server also runs DNS, MySQL and Sendmail. When the hangs happen, the server is nowhere near its capacity or hitting any limits with traffic.

Thanks for any insights,

Neil

James Blond

If it comes to SSL connection I didn't notice any difference event vs worker.

You wrote that you use it as reverse proxy and caching. It might be an issue of t he caching cleaning process. At least I had that.

neilgunton · Joined: 12 Feb 2016 Posts: 2 Location: Albany, Oregon

James Blond

I don't have that problem with event mpm.
I also compile apache and all deps from source. See https://github.com/JBlond/debian_build_apache24