Companies in the category 'Data Streaming'
These are companies that provide open source platforms or tools for processing and analyzing continuous streams of data in real-time.
AI orchestration for data streaming
Pandio provides AI orchestration solutions, focusing on distributed messaging for artificial intelligence. Their Kafka API-compatible managed solution offers superior scalability and reliability for real-time data streaming, advanced analytics, and business insights.
Real-time data processing with Python
Pathway is a Python ETL framework for real-time stream processing, LLM pipelines, and RAG applications, powered by a scalable Rust engine based on Differential Dataflow. The company is also developing BDH, a post-transformer frontier AI model designed to solve the fundamental memory and continual learning limitations of transformer-based architectures.
Postgres CDC to streams and queues
Sequin is an open-source change data capture (CDC) platform that streams changes from Postgres databases to messaging systems, queues, and search indexes such as Kafka, SQS, Redis, and HTTP endpoints. Built on Elixir, it provides exactly-once processing guarantees, backfill support, and real-time streaming without requiring complex infrastructure. Sequin connects to any Postgres database version 14 or higher and supports a wide range of sink destinations.
Cloud-native Kafka on S3, 10x cost savings
AutoMQ is a cloud-native, diskless alternative to Apache Kafka that runs on S3 or any S3-compatible object storage. Founded in 2022 by the original teams behind Apache RocketMQ and the Linux LVS project, AutoMQ re-engineers Kafka's architecture by decoupling storage to object storage, transforming the classic shared-nothing architecture into a shared storage architecture. This enables elastic auto-scaling in seconds, eliminates cross-AZ traffic costs, and reduces infrastructure costs by up to 90% compared to traditional Kafka deployments. AutoMQ maintains 100% compatibility with the Apache Kafka API, allowing seamless migration without application changes. It offers sub-10ms P99 latency, infinite scalable storage via cloud object storage, and built-in Prometheus/OpenTelemetry metrics export.
COSS Weekly Newsletter
Stay up to date with the latest news, funding rounds, and announcements from the COSS universe.
Check out COSS Weekly on the webConduktor: data hub for operational data & AI.
Conduktor offers tools for managing and monitoring Apache Kafka, providing a user-friendly interface to simplify Kafka operations and improve developer productivity.
Open source cloud data platform
Aiven provides managed open-source data infrastructure services on all major clouds. It allows businesses to deploy, operate, and scale open-source data technologies such as Apache Kafka, PostgreSQL, and more. Aiven primarily serves enterprise customers by offering reliable and scalable cloud solutions.
Unified streaming data platform for Apache Flink
Ververica is a stream processing company founded by the original creators of Apache Flink that provides a unified streaming data platform powered by VERA, a cloud-native engine. The company offers enterprise-grade solutions for real-time data processing, analytics, and stream processing applications through self-managed, managed service, and bring-your-own-cloud deployment options.
Incremental computing engine for AI, ML, and data.
Feldera provides an incremental computation engine that enables organizations to process complex SQL queries and analytics workloads with instant incremental updates, eliminating the need for full recomputation when data changes.
Real-time streaming platform for data lakehouses
RisingWave Labs develops a cloud-native streaming database designed for real-time data processing and analytics. It leverages technologies such as Rust and SQL to provide scalable and efficient data solutions for enterprises.
Fast, Kafka-compatible streaming platform
Redpanda is a modern streaming data platform designed to handle real-time data with high performance and reliability. It is built on a C++ architecture and focuses on providing a Kafka-compatible API for seamless integration with existing systems. The company primarily serves enterprise customers with its robust and scalable solutions.
Streaming platform for real-time data analytics.
InfinyOn provides a real-time data streaming platform designed to accelerate data-driven applications. The platform is built on Rust and focuses on delivering high-performance and low-latency data processing for enterprises. It serves primarily enterprise customers.

