论文标题

我们可以插入多快? Apache Kafka的经验性能评估

How Fast Can We Insert? An Empirical Performance Evaluation of Apache Kafka

论文作者

Hesse, Guenter, Matthies, Christoph, Uflacker, Matthias

论文摘要

讯息经纪人在现代IT景观中看到广泛采用,Apache Kafka是使用最多的平台之一。这些系统具有定义明确的API供使用和配置,并为各种数据存储方案提供了灵活的解决方案。它们的水平扩展能力使用户能够适应不断增长的数据量和不断变化的环境。但是,关于信息经纪人的主要挑战之一是他们成为IT架构中的瓶颈的危险。为了防止这种情况,需要可以处理的是使用特定配置的消息代理的知识。在本文中,我们为消息经纪人和类似的Java虚拟机系统提出了一个监视体系结构。我们使用我们的方法对流行的Apache Kafka平台进行了全面的性能分析。作为基准的一部分,我们研究了针对其最大数据摄入率的选定数据摄入方案。结果表明,我们可以在使用的商品硬件和开发的数据发送者工具上获得约420,000条消息的摄入率。

Message brokers see widespread adoption in modern IT landscapes, with Apache Kafka being one of the most employed platforms. These systems feature well-defined APIs for use and configuration and present flexible solutions for various data storage scenarios. Their ability to scale horizontally enables users to adapt to growing data volumes and changing environments. However, one of the main challenges concerning message brokers is the danger of them becoming a bottleneck within an IT architecture. To prevent this, knowledge about the amount of data a message broker using a specific configuration can handle needs to be available. In this paper, we propose a monitoring architecture for message brokers and similar Java Virtual Machine-based systems. We present a comprehensive performance analysis of the popular Apache Kafka platform using our approach. As part of the benchmark, we study selected data ingestion scenarios with respect to their maximum data ingestion rates. The results show that we can achieve an ingestion rate of about 420,000 messages/second on the used commodity hardware and with the developed data sender tool.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源