Data Warehouses & Lakes
Compare 119 data warehouses & lakes tools to find the right one for your needs
📂 Subcategories
🔧 Tools
Compare and find the best data warehouses & lakes for your needs
VAST Data
A data platform that unifies storage, database, and compute for AI and deep learning.
StarRocks
An open-source, high-performance analytical database designed for real-time analytics.
ClickHouse Cloud
A fully managed, serverless cloud data warehouse based on the open-source ClickHouse database.
DuckDB
An in-process SQL OLAP database management system.
QuestDB
An open-source time-series database for applications in financial services, IoT, and DevOps.
Materialize
A streaming database that computes and maintains materialized views on streaming data.
dbt Cloud
A cloud-based development framework for transforming data in the warehouse.
Kensu
A data observability platform that provides real-time insights into the health of your data pipelines.
Firebolt
A high-performance, cloud-native data warehouse designed for sub-second analytics at scale.
Ahana Cloud for Presto
A fully managed service for Presto on AWS, making it easy to run interactive SQL queries on your data lake.
Upsolver
A data lakehouse platform that simplifies the process of building and managing real-time data pipelines.
MinIO
An open-source, high-performance object storage server.
ClickHouse
An open-source, column-oriented database management system for online analytical processing (OLAP).
Apache Pinot
An open-source, distributed OLAP data store designed to provide real-time analytics at scale.
Firebolt
A cloud data warehouse designed for high-performance, sub-second analytics at scale.
Rockset
A real-time analytics database for building data-intensive applications.
Atlan
An active metadata platform that helps data teams collaborate, govern, and discover data.
Immuta
A data security platform that provides automated data access control and privacy protection.
Snowflake
A cloud-native data platform that provides a single, integrated platform for data warehousing, data lakes, data engineering, and data science.
Yellowbrick Data Warehouse
A modern, elastic data warehouse for private, public, and hybrid clouds.
SingleStore
A distributed, relational database for both transactional and analytical workloads.
Exasol
A high-performance, in-memory, MPP database for analytics and data warehousing.
Amazon Web Services (AWS) Data Lake
A comprehensive set of services for building and managing a data lake on AWS.
Google Cloud Data Lake
A suite of Google Cloud services for building a scalable and secure data lake.
Snowflake Data Cloud
A cloud data platform that provides a data warehouse-as-a-service.
Snowflake Data Cloud
A cloud-native data platform for data warehousing, data lakes, data engineering, and data sharing.
Google Cloud BigLake
A storage engine that unifies data warehouses and lakes with fine-grained access control.
ChaosSearch
A data lake platform that turns your cloud object storage into a hot, searchable, SQL and AI-powered analytical database.
Amazon S3
Amazon S3 is an object storage service that offers industry-leading scalability, data availability, security, and performance.
Google Cloud Storage
A scalable, secure, and highly available object storage service from Google Cloud.
Snowflake
A cloud data platform that provides a data warehouse-as-a-service designed for the cloud.
Cloudian HyperStore
An on-premises, S3-compatible object storage platform.
Wasabi Cloud Storage
A simple, affordable, and fast S3-compatible object storage service.
Qumulo
A file data platform for storing and managing unstructured data at scale.
Monte Carlo
An end-to-end data observability platform that helps prevent and resolve data downtime.
Soda
A data observability platform that provides end-to-end data quality monitoring.
data.world
An enterprise data catalog platform that uses a knowledge graph to map and understand data.
Okera
A universal data authorization platform that simplifies data access governance.
Privacera
A data security and governance platform that provides fine-grained access control across multi-cloud environments.
Google BigQuery
A fully-managed, serverless data warehouse that enables scalable analysis over petabytes of data.
Databricks
A unified data analytics platform that combines data warehousing and data lakes into a lakehouse architecture.
Starburst Enterprise
An enterprise-grade distribution of the open-source Trino (formerly PrestoSQL) query engine for data lakehouse analytics.
Microsoft Azure Data Lake Storage
A cloud-based data lake solution for big data analytics.
Databricks Lakehouse Platform
A unified platform for data engineering, data science, and machine learning.
Databricks Lakehouse Platform
A unified platform for data engineering, data science, and machine learning.
Azure Synapse Analytics
An integrated analytics service that accelerates time to insight across data warehouses and big data systems.
Starburst Enterprise
A fully managed and enterprise-grade data lake analytics platform built on open source Trino.
Azure Data Lake Storage
A highly scalable and secure data lake for high-performance analytics workloads.
Databricks
A unified data and AI platform for data engineering, data science, and machine learning.
Starburst
A fully managed data lake analytics platform built on open-source Trino.
Dell PowerScale
A scale-out NAS platform for storing and managing unstructured data.
Google Cloud BigQuery
A fully-managed, serverless data warehouse that enables super-fast SQL queries using the processing power of Google's infrastructure.
Starburst
An enterprise-grade distribution of Trino (formerly PrestoSQL) with added features for security, connectivity, and manageability.
Databricks SQL
A serverless data warehouse built on the Databricks Lakehouse Platform, providing a SQL-native experience for BI and analytics.
Databricks
A unified data and AI platform that supports data mesh architectures through its Lakehouse Platform.
Google Cloud Dataplex
A service that unifies distributed data to automate data management and governance for analytics at scale.
Microsoft Azure Synapse Analytics
A unified analytics platform that brings together data integration, enterprise data warehousing, and big data analytics.
Oracle Autonomous Data Warehouse
A fully managed cloud data warehouse service that uses machine learning to automate database tuning, security, backups, updates, and other routine management tasks.
Vertica
A unified analytics platform that supports SQL, Python, R, and more, for data warehousing and machine learning at scale.
Dremio
An open data lakehouse platform that enables high-performance BI and analytics directly on data lake storage.
Greenplum
An open-source, massively parallel processing (MPP) data warehouse based on PostgreSQL.
Dremio
A data lakehouse platform that enables fast and easy analytics on data lake storage.
Amazon Redshift
A fully managed, petabyte-scale data warehouse service in the cloud.
Dremio
A data lakehouse platform that provides fast and easy analytics on your data lake storage.
Vertica Unified Analytics Platform
A unified analytics platform for data warehousing and machine learning at scale.
Dremio
A data lakehouse platform that enables high-performance BI and analytics directly on data lake storage.
Vertica
A unified analytics platform for data warehousing and data lakes.
Apache Hadoop (HDFS)
An open-source framework for distributed storage and processing of large datasets.
NetApp StorageGRID
A software-defined object storage solution for unstructured data.
Trino
A high-performance, distributed SQL query engine for big data analytics, enabling users to query large data sets.
Apache Druid
A real-time analytics database designed for fast slice-and-dice analytics on large data sets.
Apache Kylin
An open-source, distributed Analytical Data Warehouse for Big Data.
Amazon Athena
An interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL.
Denodo
A data virtualization platform that provides a unified view of data from disparate sources.
Starburst Galaxy
A fully managed data lake analytics platform for running fast queries on data anywhere.
Denodo Platform
A data virtualization platform that enables a logical data mesh by creating a unified view of disparate data sources.
Microsoft Purview
A unified data governance service that helps you manage and govern your on-premises, multicloud, and SaaS data.
Alation
A data intelligence platform that combines a data catalog with data governance, analytics stewardship, and data literacy.
Dremio
A data lakehouse platform that provides fast query performance and self-service analytics on data lake storage.
Amazon Redshift
A fully managed, petabyte-scale data warehouse service in the cloud.
Teradata Vantage
A multi-cloud data platform that unifies data warehouses, data lakes, and analytics into a single ecosystem.
SAP Datasphere
A comprehensive data service built on the SAP Business Technology Platform (BTP) that enables a business data fabric.
Oracle Cloud Infrastructure (OCI) Data Lake
A suite of OCI services for building and managing a data lake.
Teradata Vantage
A multi-cloud data platform that unifies data warehouses, data lakes, and analytics.
Teradata VantageCloud Lake
A cloud-native lakehouse that unifies analytics, data, and teams.
IBM watsonx.data
A data store built on an open lakehouse architecture to scale AI and analytics workloads.
Oracle Big Data Lakehouse
An integrated data platform that combines the best of data lakes and data warehouses.
Actian Avalanche
A hybrid cloud data platform for data warehousing, analytics, and integration.
Informatica Intelligent Data Management Cloud
A comprehensive cloud-native data management platform.
Qubole
A cloud-native data platform for data engineering, data science, and machine learning.
Teradata VantageCloud
A multi-cloud data platform that unifies data warehouses, data lakes, and analytics.
IBM Cloud Object Storage
A highly scalable and durable object storage service on the IBM Cloud.
Dremio
A SQL lakehouse platform that enables high-performance BI and analytics directly on data lake storage.
Presto
An open-source distributed SQL query engine for running interactive analytic queries against data sources of all sizes.
AWS Lake Formation
A service that makes it easy to set up a secure data lake in days.
Informatica Intelligent Data Management Cloud
An AI-powered, cloud-native platform for data integration, quality, and governance.
Talend Data Fabric
A unified platform that simplifies all aspects of working with data for insights.
SAP Datasphere
A comprehensive data service that enables every data professional to deliver seamless and scalable access to mission-critical business data.
Actian Avalanche
A fully managed hybrid cloud data platform for high-performance analytics.
Cloudera Data Platform
A hybrid data platform that provides a unified suite of analytical and data management services from the Edge to AI.
Cloudera Data Platform (CDP)
A hybrid data platform that enables analytics and machine learning across on-premises and cloud environments.
Cloudera Data Platform (CDP)
A hybrid data platform for the entire data lifecycle, from the Edge to AI.
Cloudera Data Platform (CDP)
A hybrid data platform that enables you to manage and secure the entire data lifecycle.
Oracle Cloud Infrastructure (OCI) Data Lake
A comprehensive data lake solution on Oracle Cloud Infrastructure.
Apache Impala
An open-source, massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache Hadoop.
IBM Db2 Big SQL
An enterprise-grade, hybrid ANSI-compliant SQL-on-Hadoop engine for advanced data query.
Qubole
A cloud-native data platform for machine learning, streaming, and ad-hoc analytics.
Collibra Data Intelligence Platform
A data intelligence platform that provides data governance, catalog, and privacy capabilities.
Cloudera Data Platform (CDP)
A hybrid data platform that enables secure data management and portable analytics across multi-cloud and on-premises environments.
IBM Db2 Warehouse
A client-managed, private cloud data warehouse for Docker container-supported infrastructures.
IBM watsonx.data
A data store that enables you to scale analytics and AI with a data lakehouse.
Zaloni Arena
A data lake management platform for data governance, cataloging, and self-service data access.
Apache Drill
An open-source, schema-free SQL query engine for querying non-relational and relational data stores.
Onehouse
A managed data lakehouse service that automates the management of open source data infrastructure.
Apache Iceberg
An open table format for huge analytic datasets in data lakes.
Delta Lake
An open-source storage layer that brings reliability to data lakes.
Apache Hudi
An open-source data lake platform for stream processing on big data.
Apache Doris
An open-source, real-time analytical database based on MPP architecture.
NextData
A data products platform designed to build, share, and manage data products in a data mesh.