Facebook's Architecture: Built to Scale

Home Profile Create Page

Facebook's Architecture: Built to Scale

Adam Rifkin stashed this in Scaling

MySQL 5 6 Replication Enabling the Next Generation of Web & Cloud Services

Source: Collected by Michaël Figuière, Software Engineer at Xebia. Posted 14th January by Zaur Amikishiyev.

Adam Rifkin
11:53 AM Apr 10 2012

Stashed in: DevOps, Facebook!

To save this post, select a stash from drop-down menu or type in a new one:

It turns out the original source is not the above blog post, but Quora:

PHP + HipHop, Thrift, Java, MySQL, Memcached, ~~Cassandra~~ Hadoop's HBase, Hive, Scribe, Scribe-HDFS, BigPipe, Varnich, Haystack, Cell, Erlang.

Facebook runs over 60,000 servers.

Their Oregon datacenter is based on entirely self-designed hardware aka Open Compute Project.

Numbers: 300 TB of data is stored in Memcached processes. Their Hadoop and Hive cluster has 3000 servers with 8 cores, 32 GB RAM, 12 TB disks that is a total of 24k cores, 96 TB RAM and 36 PB disks.

Scaling: 100b hits/day, 50 billion photos, 3 trillion objects cached, 130 TB of logs per day as of july 2010.

It turns out the original source is not the above blog post, but <a rel="nofollow" target="_blank" href="http://www.quora.com/What-is-Facebooks-architecture">Quora</a>:

<blockquote>PHP + HipHop, Thrift, Java, MySQL, Memcached, <strike>Cassandra</strike> Hadoop's HBase, Hive, Scribe, Scribe-HDFS, BigPipe, Varnich, Haystack, Cell, Erlang.</blockquote>

Facebook runs over <a rel="nofollow" target="_blank" href="http://www.datacenterknowledge.com/archives/2009/05/14/whos-got-the-most-web-servers">60,000 servers</a>.

Their Oregon datacenter is based on <a rel="nofollow" target="_blank" href="http://www.facebook.com/note.php?note_id=10150144039563920">entirely self-designed hardware</a> aka <a rel="nofollow" target="_blank" href="http://opencompute.org/">Open Compute Project</a>.

<a rel="nofollow" target="_blank" href="http://www.devoxx.com/">Numbers</a>: 300 TB of data is stored in Memcached processes. Their Hadoop and Hive cluster has 3000 servers with 8 cores, 32 GB RAM, 12 TB disks that is a total of 24k cores, 96 TB RAM and 36 PB disks.

<a rel="nofollow" target="_blank" href="http://www.facebook.com/note.php?note_id=409881258919">Scaling</a>: 100b hits/day, 50 billion photos, 3 trillion objects cached, 130 TB of logs per day as of july 2010.

Adam Rifkin
12:02 PM Apr 10 2012

Quite the beast! It amazes me that this has been built, managed and maintained by ~3000 employees? It would be nice to know how the system evolved from php/mysql to what it is today; when certain technologies were brought in to handle the issues and challenges they faced.

Eric Thedaker
12:08 PM Apr 10 2012

Actually, you'd probably enjoy Scaling to 500 million users and beyond by Robert Johnson on the Facebook blog.

He's brilliant and was there for a lot of the scaling decisions.

You can read the whole evolution by scanning the Facebook Engineering Blog, too.

Adam Rifkin
12:14 PM Apr 10 2012

Facebook's Architecture: Built to Scale

Adam Rifkin stashed this in Scaling

You May Also Like: