__z9ia
31.9K
· June 2013 ·
Jackjon

Scale PHP on Ec2 to 30,000 Concurrent Users / Server

RockThePost.com is a LAMP stack hosted on Ec2. We're preparing to be featured in an email which will be sent to ~1M investors... all at the same time. For our 2 person engineering department, that meant we had to do a quick sanity check to see just how many people we can support concurrently.

Our app uses PHP's Zend Framework 2. Our web servers are two m1.medium Ec2 machines with an ELB in front of them setup to split the load. On the backend, we're running a master/slave MySQL database configuration. Very typical for most of the small data shops I've worked at.

Here are my opinions and thoughts in random order.

  • Use PHP's APC feature. I don't understand why this isn't on by default but having APC enabled is really a requirement in order for a website to have a chance at performing well.

  • Put everything that's not a .php request on a CDN. No need to bog down your origin server with static file requests. Our deployer puts everything on S3 and we abs path everything to CloudFront. Disclaimer: CloudFront has had some issues lately and we've recently set the site to instead serve all static materials directly from S3 until the CloudFront issues are resolved.

  • Don't make connections to other servers in your PHP code, such as the database and memcache servers, unless it's mandatory and there's really NO other way to do what you're trying to do. I'd guess that the majority of PHP web apps out there use a MySQL database for the backend session management and a memcache pool for caching. Making connections to other servers during your execution flow is not efficient. It blocks, runs up the CPU, and tends to hold up the line, as it were. Instead, use the APC key/value store for storing data in PHP and Varnish for full page caching.

  • I repeat... Use Varnish. In all likelihood, most of the pages on your site do not change, or hardly ever change. Varnish is the Memcache/ModRewrite of web server caching. Putting Varnish on my web server made the single biggest difference to my web app when I was load testing it.

  • Use a c1.xlarge. The m1.medium has only 1 CPU to handle all of the requests. I found that upgrading to a c1.xlarge, which has 8 CPU's, really paid off. During my load test, I did an apt-get install nmon and then ran nmon and monitored the CPU. You can literally watch the server use all 8 of its cores to churn out pages.

Google Analytics shows us how many seconds an average user spends on an average page. Using this information, we ran some load tests with Siege and were able to extrapolate that with the above recommendations, we should be able to handle 30,000 users concurrently per web server on the "exterior" of the site. For "interior" pages which are PHP powered, we're expecting to see something more along the lines of 1,000 concurrent sessions per server. In advance of the email blast, we'll launch a bunch of c1.xlarges and then kill them off slowly as we get more comfortable with the actual load we're seeing on blast day.

Finally, we will do more work to make our code itself more scaleable... however, the code is only about 4 months old and this is the first time we've ever been challenged to actually make it scale.

About RockThePost.com

We help entrepreneurs raise money for a new venture. If you're thinking about crowdfunding your next venture, use coupon code "HACKERNEWS" for a 50% discount on any of our plans.

Sign in or sign up to add your response.

9 Responses

7394
 dsc9440 small

Nice article Jonathan, it supports my own opinions.

over 1 year ago ·
7397
Jackjon

@luceos Thank you!

over 1 year ago ·
7422
Jackjon

ck2k on hacker news also added the following comment to this article which I thought was interesting.. he wrote:
.....
Stop using APC and switch to the "new" Zend Opcache.
It can serve 10-30% more requests per second than APC and has several layers of optimization.
It's bundled with PHP 5.5 but will work just fine with PHP 5.3 and 5.4
ps. also turn off file stat in the opcache on production servers
pps. make sure you are using PHP-FPM

over 1 year ago ·
7660
93d1c969262406ec439a8071fbacb57b

Nice article!
I just want to ask a question: How do you "use the APC key/value store for storing data in PHP" instead of using a mysql database, can you provide a use case?

over 1 year ago ·
7671
Jackjon

ntdunglc - Sure - lets say for example you want to display the list of the 50 states. It makes no sense to query your database for that. It's really slow. Put the list of the 50 states in APC with a high cache TTL. There probably won't be any new states joining the union any time soon. ;)

over 1 year ago ·
7680
93d1c969262406ec439a8071fbacb57b

Thank you !

over 1 year ago ·
7838

Nice and misleading title, you claim to power 30k concurrent connections on PHP then near the end you state it's on "exterior" pages which are either coming from a CDN or in your varnish cache.

over 1 year ago ·
8008
Ad3b632cd2093fb46d9da42bf93041a2

Interesting article, but you say 'just use APC to store key-value data', how do you get it in there / manage it in the first place? What happens if you bring a new server up / need to restart one - surely APC isn't safe to use there?

over 1 year ago ·
14514
B4814d6790e91f01c77cac9d25db12b6

While it certainly helps an enormous amount on a single server, APC (and all other in-process caches) isn't a scalable solution because it can't maintain cache coherence beyond a single host. As soon as you have more than one web server, you need to use something distributed such as memcached or redis.
For front ends, both nginx and haproxy are faster than varnish and will scale to higher connection counts. They don't necessarily do the same thing, but there is a lot of overlap for proxying, balancing, failover, caching, SSL unwrapping etc that they can make a better job of than varnish.

over 1 year ago ·
Featured Programming Job

Senior Rails Developer
·
Brooklyn, NYC
·
Full Time
Search all programming jobs