Apache Kafka is the circulatory system in use at LinkedIn.

Mo Data stashed this in Big Data Technologies

https://engineering.linkedin.com/kafka/running-kafka-scale

If data is the lifeblood of high technology, Apache Kafka is the circulatory system in use at LinkedIn. We use Kafka for moving every type of data around between systems, and it touches virtually every server, every day. The complexity of the infrastructure, as well as the reasoning behind choices that have been made in its implementation, has developed out of a need to move a large amount of data around quickly and reliably.

What is Kafka?Apache Kafka is a publish/subscribe messaging system with a twist: it combines queuing with message retention on disk. Think of it as a commit log that is distributed over several systems in a cluster. Messages are organized into topics and partitions, and each topic can support multiple publishers (producers) and multiple subscribers (consumers). Messages are retained by the Kafka cluster in a well-defined manner for each topic:

For a specific amount of time (measured in days at LinkedIn)
For a specific total size of messages in a partition
Based on a key in the message, storing only the most recent message per key

Kafka provides reliability, resiliency, and retention, all while performing at high throughput.

There have been many papers and talks on Kafka, including a talk given at ApacheCon 2014 by Clark Haskins and myself. If you are not yet familiar with Kafka, you may want to check those links out to learn the basics of how it operates.

How Big is Big?Kafka itself is not concerned with the content of the messages themselves. Data of many different types can easily coexist on the same cluster, divided into topics for each type of data. Producers and consumers only need to concern themselves with the topics they are interested in. LinkedIn goes one step further, and defines four categories of messages: queuing, metrics, logs and tracking data that each live in their own cluster.

When combined, the Kafka ecosystem at LinkedIn is sent over 800 billion messages per day which amounts to over 175 terabytes of data. Over 650 terabytes of messages are then consumed daily, which is why the ability of Kafka to handle multiple producers and multiple consumers for each topic is important. At the busiest times of day, we are receiving over 13 million messages per second, or 2.75 gigabytes of data per second. To handle all these messages, LinkedIn runs over 1100 Kafka brokers organized into more than 60 clusters.

QueuingQueuing is the standard messaging type that most people think of: messages are produced by one part of an application and consumed by another part of that same application. Other applications aren't interested in these messages, because they're for coordinating the actions or state of a single system. This type of message is used for sending out emails, distributing data sets that are computed by another online application, or coordinating with a backend component.

MetricsMetrics handles all measurements generated by applications in their operation. It includes everything from OS and hardware statistics to application-specific measurements which are critical to ensuring the proper functioning of the system. This is the eyes and ears of LinkedIn, providing visibility into the status of all servers and applications, and driving our internal monitoring and alerting systems. If you’d like to know more about our metrics, you can read about the original design of our Autometrics system, as well as a recent post by Stephen Bisordi on where Autometrics is headed next.

LoggingLogging includes application, system, and public access logs. Originally, metrics and logging coexisted on the same cluster for convenience. We now keep logging data separate simply because of how much there is. The logging data is produced into Kafka by applications, and then read by other systems for log aggregation purposes.

TrackingTracking includes every action taken on the front lines of LinkedIn's infrastructure, whether by users or applications. These actions need to be communicated to other applications as well as to stream processing in Apache Samza and batch processing in Apache Hadoop. This is the bread and butter of big data: the information we need to keep search indices up to date, track usage of paid services, and measure numerous growth vectors in real time. All four types of messaging are critically important to the proper functioning of LinkedIn, however tracking data is the most visible as it is often seen at the executive levels and is what drives revenue.

Tiers and AggregationLike all large websites, LinkedIn operates out of multiple datacenters. Some applications, such as those serving a specific user's requests, are only concerned with what is going on in a single datacenter. Many other applications, such as those maintaining the indices that enable search, need a view of what is going on in all datacenters.

For each message category, LinkedIn has a cluster named local containing messages created in the datacenter. There is also an aggregate cluster, which combines messages from all local clusters for a given category. We use the Kafka mirror maker application to copy messages forward, from local into aggregate. This avoids any message loops between local clusters.

Kafka Tier Architecture Figure 1: Layout of tiered Kafka architecture

Moving the data within the Kafka infrastructure reduces network costs and latency by allowing us to copy the messages a minimum number of times (once per datacenter). Consumers access the data locally, which simplifies their configuration and allows them to not worry about many types of cross-datacenter network problems. The producer and consumer complete the concept of tiers within our Kafka infrastructure. The producer is the first tier, the local cluster (across all datacenters) is the second, and each of the aggregate clusters is an additional tier. The consumer itself is the final tier.

This tiered infrastructure solves many problems, but it greatly complicates monitoring Kafka and assuring its health. While a single Kafka cluster, when running normally, will not lose messages, the introduction of additional tiers, along with additional components such as mirror makers, creates myriad points of failure where messages can disappear. In addition to monitoring the Kafka clusters and their health, we needed to create a means to assure that all messages produced are present in each of the tiers, and make it to the critical consumers of that data.

Auditing CompletenessKafka Audit is an internal tool at LinkedIn that helps to make sure all messages produced are copied to every tier without loss. Message schemas contain a header containing critical data common to every message, such as the message timestamp, the producing service, and the originating host. As an individual producer sends messages into Kafka, it keeps a count of how many messages it has sent during the current time interval. Periodically, it transmits that count as a message to a special auditing topic. This gives us information about how many messages each producer attempted to send into a specific topic.

One of our Kafka infrastructure applications, called the Kafka Console Auditor, consumes all messages from all topics in a single Kafka cluster. Like the producer, it periodically sends messages into the auditing topic stating how many messages it consumed from that cluster for each topic for the last time interval. By comparing these counts to the producer counts, we are able to determine that all of the messages produced actually got to Kafka. If the numbers differ, then we know that a producer is having problems, and we are able to trace that back to the specific service and host that is failing. Each Kafka cluster has its own console auditor that verifies its messages. By comparing each tier's counts against each other, we can assure that every tier has the same number of messages present. This assures that we have neither loss, nor duplication, of messages and can take immediate action if there is a problem.

Audit Overview Figure 2: Overview of audit of Kafka messages

Certain critical consumers of messages, such as the Hadoop grids, also write back auditing information as a separate tier. This allows us to not only monitor to make sure the producers are all working and that Kafka is passing messages, but also validates that the consumer is receiving all of those messages. Should there be a problem with the application copying messages from Kafka to Hadoop, it will show up in the Kafka Audit tool as an error specific to the tier names that Hadoop uses. This final piece provides us with end-to-end assurance that every message produced was ultimately consumed.

Bringing It All TogetherThis may seem like a lot of complexity to layer on top of a simple Kafka cluster -- giving us an overwhelming task of making sure that all applications at LinkedIn do things the same way -- but we have an ace in the hole. LinkedIn has a Kafka engineering team comprised of some of the top open source Kafka developers. They provide internal support to LinkedIn's development community, assisting them with using Kafka in a consistent and maintainable manner. They are a common point of contact for anyone who wants to know how to implement a producer or consumer, or deep dive into specific design concerns around how to use Kafka in the best way for their application.

The Kafka development team also provides an additional benefit for LinkedIn, which is a set of custom libraries that layer over the open source Kafka libraries and tie all of the extras together. For example, almost all producers of Kafka messages within LinkedIn use a library called TrackerProducer. When the application calls it to send a message, it takes care of inserting message header fields, schema registration, as well as tracking and sending the auditing messages. Likewise, the consumer library takes care of fetching schemas from the registry and deserializing Avro messages. The majority of the Kafka infrastructure applications, such as the console auditor, are also maintained by the development team.

Figure 3: Kafka as a Service at LinkedIn

Moving ForwardAs Mammad Zadeh, our Director of Engineering, recently posted, LinkedIn’s commitment to Apache Kafka remains strong. The engineering team is actively working with the open source community on evolving Kafka. This includes the addition of strong security controls, quotas, and making sure that LinkedIn is able to scale to 1 trillion messages a day and beyond. Samza, our stream processing framework layered over Kafka, has recently graduated from Apache Incubator to a top-level project.

The SRE team is working in lockstep with engineering, matching our deep operational experience with the developers’ knowledge of the code. SRE is also working on continuing to automate the process of running Kafka, creating tooling for tasks such as moving partitions that will graduate into integrated components of Kafka. We are also constantly evaluating the best tuning for running Kafka at scale and communicating our findings back to the community at large.

We are active on the Apache Kafka mailing lists, and LinkedIn is proud to host both the Apache Kafka Meetup and the Bay Area Samza Meetup, alternating monthly. Join us in person, or stream the meetups remotely, to find out more about what LinkedIn, and other companies, are doing with Kafka and Samza!

<a rel="nofollow" target="_blank" href="https://engineering.linkedin.com/kafka/running-kafka-scale">https://engineering.linkedin.com/kafka/running-kafka-scale</a>

If data is the lifeblood of high technology, <a rel="nofollow" target="_blank" href="http://kafka.apache.org/">Apache Kafka</a>
 is the circulatory system in use at LinkedIn. We use Kafka for moving 
every type of data around between systems, and it touches virtually 
every server, every day. The complexity of the infrastructure, as well 
as the reasoning behind choices that have been made in its 
implementation, has developed out of a need to move a large amount of 
data around quickly and reliably.
What is Kafka?Apache Kafka is a publish/subscribe messaging system with a twist: it
 combines queuing with message retention on disk. Think of it as a 
commit log that is distributed over several systems in a cluster. 
Messages are organized into topics and partitions, and each topic can 
support multiple publishers (producers) and multiple subscribers 
(consumers). Messages are retained by the Kafka cluster in a 
well-defined manner for each topic:

<ul><li>For a specific amount of time (measured in days at LinkedIn)</li>
<li>For a specific total size of messages in a partition</li>
<li>Based on a key in the message, storing only the most recent message per key</li></ul>

Kafka provides reliability, resiliency, and retention, all while performing at high throughput.

There have been <a rel="nofollow" target="_blank" href="https://cwiki.apache.org/confluence/display/KAFKA/Kafka+papers+and+presentations">many papers and talks on Kafka</a>, including a <a rel="nofollow" target="_blank" href="http://www.slideshare.net/ToddPalino/enterprise-kafka-kafka-as-a-service">talk given at ApacheCon 2014</a> by <a rel="nofollow" target="_blank" href="https://www.linkedin.com/in/clarkhaskins">Clark Haskins</a>
 and myself. If you are not yet familiar with Kafka, you may want to 
check those links out to learn the basics of how it operates.How Big is Big?Kafka itself is not concerned with the content of the messages 
themselves. Data of many different types can easily coexist on the same 
cluster, divided into topics for each type of data. Producers and 
consumers only need to concern themselves with the topics they are 
interested in. LinkedIn goes one step further, and defines four 
categories of messages: queuing, metrics, logs and tracking data that 
each live in their own cluster.

When combined, the Kafka ecosystem at LinkedIn is sent over 800 
billion messages per day which amounts to over 175 terabytes of data. 
Over 650 terabytes of messages are then consumed daily, which is why the
 ability of Kafka to handle multiple producers and multiple consumers 
for each topic is important. At the busiest times of day, we are 
receiving over 13 million messages per second, or 2.75 gigabytes of data
 per second. To handle all these messages, LinkedIn runs over 1100 Kafka
 brokers organized into more than 60 clusters.QueuingQueuing is the standard messaging type that most people think 
of: messages are produced by one part of an application and consumed by 
another part of that same application. Other applications aren't 
interested in these messages, because they're for coordinating the 
actions or state of a single system. This type of message is used for 
sending out emails, distributing data sets that are computed by another 
online application, or coordinating with a backend component.MetricsMetrics handles all measurements generated by applications in 
their operation. It includes everything from OS and hardware statistics 
to application-specific measurements which are critical to ensuring the 
proper functioning of the system. This is the eyes and ears of LinkedIn,
 providing visibility into the status of all servers and applications, 
and driving our internal monitoring and alerting systems. If you’d like 
to know more about our metrics, you can read about the <a rel="nofollow" target="_blank" href="https://engineering.linkedin.com/52/autometrics-self-service-metrics-collection">original design of our Autometrics system</a>, as well as a recent post by Stephen Bisordi on <a rel="nofollow" target="_blank" href="https://engineering.linkedin.com/metrics/scaling-collection-self-service-metrics">where Autometrics is headed next</a>.LoggingLogging includes application, system, and public access logs. 
Originally, metrics and logging coexisted on the same cluster for 
convenience. We now keep logging data separate simply because of how 
much there is. The logging data is produced into Kafka by applications, 
and then read by other systems for log aggregation purposes.TrackingTracking includes every action taken on the front lines of 
LinkedIn's infrastructure, whether by users or applications. These 
actions need to be communicated to other applications as well as to 
stream processing in <a rel="nofollow" target="_blank" href="http://samza.apache.org/">Apache Samza</a> and batch processing in <a rel="nofollow" target="_blank" href="http://hadoop.apache.org/">Apache Hadoop</a>.
 This is the bread and butter of big data: the information we need to 
keep search indices up to date, track usage of paid services, and 
measure numerous growth vectors in real time. All four types of 
messaging are critically important to the proper functioning of 
LinkedIn, however tracking data is the most visible as it is often seen 
at the executive levels and is what drives revenue.Tiers and AggregationLike all large websites, LinkedIn operates out of multiple 
datacenters. Some applications, such as those serving a specific user's 
requests, are only concerned with what is going on in a single 
datacenter. Many other applications, such as those maintaining the 
indices that enable search, need a view of what is going on in all 
datacenters.

For each message category, LinkedIn has a cluster named local 
containing messages created in the datacenter. There is also an 
aggregate cluster, which combines messages from all local clusters for a
 given category. We use the Kafka mirror maker application to copy 
messages forward, from local into aggregate. This avoids any message 
loops between local clusters.

<img src="//img.pandawhale.com/post-60475-Kafka-Tier-Architecture-Kf3o.png" alt="Kafka Tier Architecture" />Figure 1: Layout of tiered Kafka architecture

Moving the data within the Kafka infrastructure reduces network costs
 and latency by allowing us to copy the messages a minimum number of 
times (once per datacenter). Consumers access the data locally, which 
simplifies their configuration and allows them to not worry about many 
types of cross-datacenter network problems. The producer and consumer 
complete the concept of tiers within our Kafka infrastructure. The 
producer is the first tier, the local cluster (across all datacenters) 
is the second, and each of the aggregate clusters is an additional tier.
 The consumer itself is the final tier.

This tiered infrastructure solves many problems, but it greatly 
complicates monitoring Kafka and assuring its health. While a single 
Kafka cluster, when running normally, will not lose messages, the 
introduction of additional tiers, along with additional components such 
as mirror makers, creates myriad points of failure where messages can 
disappear. In addition to monitoring the Kafka clusters and their 
health, we needed to create a means to assure that all messages produced
 are present in each of the tiers, and make it to the critical consumers
 of that data.Auditing CompletenessKafka Audit is an internal tool at LinkedIn that helps to make sure 
all messages produced are copied to every tier without loss. Message 
schemas contain a header containing critical data common to every 
message, such as the message timestamp, the producing service, and the 
originating host. As an individual producer sends messages into Kafka, 
it keeps a count of how many messages it has sent during the current 
time interval. Periodically, it transmits that count as a message to a 
special auditing topic. This gives us information about how many 
messages each producer attempted to send into a specific topic.

One of our Kafka infrastructure applications, called the Kafka 
Console Auditor, consumes all messages from all topics in a single Kafka
 cluster. Like the producer, it periodically sends messages into the 
auditing topic stating how many messages it consumed from that cluster 
for each topic for the last time interval. By comparing these counts to 
the producer counts, we are able to determine that all of the messages 
produced actually got to Kafka. If the numbers differ, then we know that
 a producer is having problems, and we are able to trace that back to 
the specific service and host that is failing. Each Kafka cluster has 
its own console auditor that verifies its messages. By comparing each 
tier's counts against each other, we can assure that every tier has the 
same number of messages present. This assures that we have neither loss,
 nor duplication, of messages and can take immediate action if there is a
 problem.

<img src="//img.pandawhale.com/post-60475-Audit-Overview-LR3m.png" alt="Audit Overview" />Figure 2: Overview of audit of Kafka messages

Certain critical consumers of messages, such as the Hadoop grids, 
also write back auditing information as a separate tier. This allows us 
to not only monitor to make sure the producers are all working and that 
Kafka is passing messages, but also validates that the consumer is 
receiving all of those messages. Should there be a problem with the 
application copying messages from Kafka to Hadoop, it will show up in 
the Kafka Audit tool as an error specific to the tier names that Hadoop 
uses. This final piece provides us with end-to-end assurance that every 
message produced was ultimately consumed.Bringing It All TogetherThis may seem like a lot of complexity to layer on top of a simple 
Kafka cluster -- giving us an overwhelming task of making sure that all 
applications at LinkedIn do things the same way -- but we have an ace in
 the hole. LinkedIn has a Kafka engineering team comprised of some of 
the top open source Kafka developers. They provide internal support to 
LinkedIn's development community, assisting them with using Kafka in a 
consistent and maintainable manner. They are a common point of contact 
for anyone who wants to know how to implement a producer or consumer, or
 deep dive into specific design concerns around how to use Kafka in the 
best way for their application.

The Kafka development team also provides an additional benefit for 
LinkedIn, which is a set of custom libraries that layer over the open 
source Kafka libraries and tie all of the extras together. For example, 
almost all producers of Kafka messages within LinkedIn use a library 
called <code>TrackerProducer</code>. When the application calls it to 
send a message, it takes care of inserting message header fields, schema
 registration, as well as tracking and sending the auditing messages. 
Likewise, the consumer library takes care of fetching schemas from the 
registry and deserializing Avro messages. The majority of the Kafka 
infrastructure applications, such as the console auditor, are also 
maintained by the development team.

<img src="//img.pandawhale.com/post-60475-LinkedIn-Kafka-Service-d4fA.png" alt="LinkedIn Kafka Service" />Figure 3: Kafka as a Service at LinkedIn
Moving ForwardAs <a rel="nofollow" target="_blank" href="https://www.linkedin.com/in/mammadz">Mammad Zadeh</a>, our Director of Engineering, <a rel="nofollow" target="_blank" href="https://engineering.linkedin.com/kafka/kafka-linkedin-current-and-future">recently posted</a>,
 LinkedIn’s commitment to Apache Kafka remains strong. The engineering 
team is actively working with the open source community on evolving 
Kafka. This includes the addition of strong security controls, quotas, 
and making sure that LinkedIn is able to scale to 1 trillion messages a 
day and beyond. Samza, our stream processing framework layered over 
Kafka, has recently <a rel="nofollow" target="_blank" href="https://engineering.linkedin.com/samza/apache-samza-graduates-apache-incubator">graduated from Apache Incubator to a top-level project</a>.

The SRE team is working in lockstep with engineering, matching our 
deep operational experience with the developers’ knowledge of the code. 
SRE is also working on continuing to automate the process of running 
Kafka, creating tooling for tasks such as moving partitions that will 
graduate into integrated components of Kafka. We are also constantly 
evaluating the best tuning for running Kafka at scale and communicating 
our findings back to the community at large.

We are active on the Apache Kafka mailing lists, and LinkedIn is proud to host both the <a rel="nofollow" target="_blank" href="http://www.meetup.com/http-kafka-apache-org/">Apache Kafka Meetup</a> and the <a rel="nofollow" target="_blank" href="http://www.meetup.com/Bay-Area-Samza-Meetup/">Bay Area Samza Meetup</a>,
 alternating monthly. Join us in person, or stream the meetups remotely,
 to find out more about what LinkedIn, and other companies, are doing 
with Kafka and Samza!

Mo Data
1:18 AM Mar 23 2015

Stashed in: Big Data!, @timberners_lee, Singularity!

To save this post, select a stash from drop-down menu or type in a new one:

Man there are a lot of Apache projects!

Adam Rifkin
1:21 AM Mar 23 2015

Yes, around 150 apparently http://projects.apache.org/indexes/alpha.html

Open Source is a profound movement, changing the world we live in.

What more might we achieve if we work in a distributed collaborative environment.

This was Tim Berners Lee's vision for the web, a powerful force for social change and individual creativity.

When he started building, did he have the notion of Google search, Facebook and Twitter social media, Smartphones, Sensors.

Seems like Arthur C Clarke might have, so TBL might have too.

What comes next though?

My guess is robots + neuroscience = cyborgs. Will allow us to boldly go...

Mo Data
1:41 AM Mar 23 2015

Well I find it interesting that Apache is a totally separate organization from TimBL's W3C organization.

Apache seems more interested in software whereas W3C seems more interested in policy.

And yes, a lot has happened in the 25 years since the Web was invented.

The next 25 years will test whether the Singularity is in fact something real.

Adam Rifkin
1:45 AM Mar 23 2015

I think that the separation has allowed them both to operate - so each plays a role that is not conflicted. The IETF is also a player in this with yet another role.

Seems like standards and protocols are necessary and valuable and they can actually work when the balance between utility and enforcement are correct.

Feels like it's time for the laws of robotics now

Mo Data
2:00 AM Mar 23 2015

Yeah, I agree. Hopefully the robots will first do no harm.

Adam Rifkin
2:04 AM Mar 23 2015

Are we too late for that? Feels like (at least in the public domain) we are teetering on the edge of autonomous killing machines. There is enough machine intelligence to deploy them so they are effective at the hunt and kill part, but the reasoning part feels like it's not ready.

http://www.wired.com/2015/02/can-now-build-autonomous-killing-machines-thats-bad-idea/

https://www.hrw.org/news/2014/11/28/killer-robots-why-world-should-ban-autonomous-weapons-systems

Mo Data
2:22 AM Mar 23 2015

The machines already know how to kill. Now we have to teach them a moral compass.

Adam Rifkin
2:37 AM Mar 23 2015

Apache Kafka is the circulatory system in use at LinkedIn.

Mo Data stashed this in Big Data Technologies

You May Also Like: