Advertisement

Databricks Accelerates Apache Spark’s Structured Streaming and Launches Production Platform

By on

by Angela Guess

According to a new press release, “Databricks, the company founded by the creators of the popular Apache Spark project, today announced the general availability of Structured Streaming, a high-level API that enables stream processing at up to five times higher throughput than other engines, on its cloud platform. Databricks is also contributing new code to Apache Spark that lowers the latency of Structured Streaming to the sub-millisecond range and greatly accelerates its throughput. ‘With Structured Streaming, customers can now get best-in-class latency while simultaneously benefitting from Spark’s much simpler streaming APIs and lowering the operational cost of their streaming applications by up to five times,’ said Matei Zaharia, cofounder and chief technologist at Databricks. ‘We are excited to keep working with the open source community to build out Structured Streaming and to deliver continuous application capabilities to our customers’.”

The release goes on, “Available today on Databricks’ managed cloud service when users choose “Databricks Runtime 3.0,” Structured Streaming makes it easier for users to build end-to-end streaming applications that integrate with storage, serving systems and batch jobs in a consistent and fault-tolerant way. Additional features of Structured Streaming include: Custom stateful processing for complex business logic such as sessionization; Production monitoring for streaming jobs, alerting and management; Connection to common data sources, including S3, Kinesis and Kafka.”

Read more at Globe Newswire.

Photo credit: Databricks

Leave a Reply