Apache Samza

Apache Samza is an open-source near-realtime, asynchronous computational framework for stream processing developed by the Apache Software Foundation in Scala and Java.

Apache Samza
Developer(s)Apache Software Foundation
Stable release
1.2.0 / 11 June 2019 (2019-06-11)
RepositorySamza Repository
Written inScala, Java
Operating systemCross-platform
TypeDistributed stream processing
LicenseApache License 2.0
Websitesamza.apache.org

Samza allows you to build stateful applications that process data in real-time from multiple sources including Apache Kafka.

Samza provides fault tolerance, isolation and stateful processing. Unlike batch systems (like Hadoop or Spark) it provides continuous computation and output, which result in sub-second[1] response times.

There are many players in the field of real time stream processing and Samza is one of the mature products[2][3][4] . It has been added to Apache in 2013[5].

Samza is used by multiple companies[6]. The biggest installation is in Linkedin.

History

Apache Samza has been developed in conjunction with Apache Kafka. Both were originally developed by LinkedIn.[7]

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.