Companies in the category 'Data Lake'
These are companies that provide an open source platform for storing large amounts of raw data in its native format until it is needed.
Intelligent lakehouse platform for data access.
Dremio offers open-source data lake solutions that enable organizations to query and analyze large datasets with high performance. Their platform supports various data sources and integrations.
Git-like version control for data lakes
lakeFS develops an open-source data version control system that transforms object storage into Git-like repositories for managing data lakes. The platform enables teams to manage data with Git-like operations, providing versioning, reproducibility, and control for enterprise AI and ML initiatives at scale.
Unified data & AI governance platform.
DataStrato provides a framework for managing data and AI assets. It's technology enables enterprise users query across different lakehouse formats like Apache Iceberg, Apache Hudi, and more with lakehouse federation. It also provides enterprise-ready connectors to engines like Trino, Apache Spark, Apache Flink, and more for various data lakes.
Open-source real-time data analytics platform
Matano offers open-source solutions for real-time data processing and analytics, enabling organizations to analyze large datasets efficiently. Their platform supports various data sources and integrations.
COSS Weekly Newsletter
Stay up to date with the latest news, funding rounds, and announcements from the COSS universe.
Check out COSS Weekly on the web
