Data Warehouses & Lakes

Compare 119 data warehouses & lakes tools to find the right one for your needs

📂 Subcategories

🔧 Tools

Compare and find the best data warehouses & lakes for your needs

VAST Data

The Data Platform for the AI Era.

A data platform that unifies storage, database, and compute for AI and deep learning.

View tool details →

StarRocks

A high-performance analytical database.

An open-source, high-performance analytical database designed for real-time analytics.

View tool details →

ClickHouse Cloud

The fastest and most resource efficient database for real-time analytics

A fully managed, serverless cloud data warehouse based on the open-source ClickHouse database.

View tool details →

DuckDB

The SQLite for Analytics.

An in-process SQL OLAP database management system.

View tool details →

QuestDB

The fastest open source time-series database.

An open-source time-series database for applications in financial services, IoT, and DevOps.

View tool details →

Materialize

The Streaming Database.

A streaming database that computes and maintains materialized views on streaming data.

View tool details →

dbt Cloud

The fastest way to build and deploy trusted data products.

A cloud-based development framework for transforming data in the warehouse.

View tool details →

Kensu

The Data Observability Company.

A data observability platform that provides real-time insights into the health of your data pipelines.

View tool details →

Firebolt

The Cloud Data Warehouse for Builders of Next-Gen Analytics Experiences

A high-performance, cloud-native data warehouse designed for sub-second analytics at scale.

View tool details →

Ahana Cloud for Presto

The Easiest Way to Run Presto on AWS.

A fully managed service for Presto on AWS, making it easy to run interactive SQL queries on your data lake.

View tool details →

Upsolver

The Data Lakehouse Platform for Real-Time Analytics.

A data lakehouse platform that simplifies the process of building and managing real-time data pipelines.

View tool details →

MinIO

High-Performance, S3-Compatible Object Storage.

An open-source, high-performance object storage server.

View tool details →

ClickHouse

The fastest and most resource efficient open-source database for real-time apps and analytics.

An open-source, column-oriented database management system for online analytical processing (OLAP).

View tool details →

Apache Pinot

Realtime distributed OLAP datastore.

An open-source, distributed OLAP data store designed to provide real-time analytics at scale.

View tool details →

Firebolt

The cloud data warehouse for sub-second analytics.

A cloud data warehouse designed for high-performance, sub-second analytics at scale.

View tool details →

Rockset

The Real-time Analytics Database.

A real-time analytics database for building data-intensive applications.

View tool details →

Atlan

The Active Metadata Platform for Data and AI.

An active metadata platform that helps data teams collaborate, govern, and discover data.

View tool details →

Immuta

The Data Security Platform.

A data security platform that provides automated data access control and privacy protection.

View tool details →

Snowflake

The AI Data Cloud

A cloud-native data platform that provides a single, integrated platform for data warehousing, data lakes, data engineering, and data science.

View tool details →

Yellowbrick Data Warehouse

The Data Warehouse for Distributed Clouds

A modern, elastic data warehouse for private, public, and hybrid clouds.

View tool details →

SingleStore

The Real-Time AI Database

A distributed, relational database for both transactional and analytical workloads.

View tool details →

Exasol

The Analytics Database

A high-performance, in-memory, MPP database for analytics and data warehousing.

View tool details →

Amazon Web Services (AWS) Data Lake

A data lake solution that automatically configures the core AWS services necessary to easily tag, search, share, transform, analyze, and govern specific subsets of data across a company or with other

A comprehensive set of services for building and managing a data lake on AWS.

View tool details →

Google Cloud Data Lake

A complete platform for building and managing a data lake on Google Cloud.

A suite of Google Cloud services for building a scalable and secure data lake.

View tool details →

Snowflake Data Cloud

One platform for all your data, all your workloads, and all your users.

A cloud data platform that provides a data warehouse-as-a-service.

View tool details →

Snowflake Data Cloud

The Data Cloud.

A cloud-native data platform for data warehousing, data lakes, data engineering, and data sharing.

View tool details →

Google Cloud BigLake

Unify data warehouses and lakes.

A storage engine that unifies data warehouses and lakes with fine-grained access control.

View tool details →

ChaosSearch

The Data Lake Platform for Log and Security Analytics.

A data lake platform that turns your cloud object storage into a hot, searchable, SQL and AI-powered analytical database.

View tool details →

Amazon S3

Object storage built to retrieve any amount of data from anywhere

Amazon S3 is an object storage service that offers industry-leading scalability, data availability, security, and performance.

View tool details →

Google Cloud Storage

Unified object storage for developers and enterprises.

A scalable, secure, and highly available object storage service from Google Cloud.

View tool details →

Snowflake

The Data Cloud.

A cloud data platform that provides a data warehouse-as-a-service designed for the cloud.

View tool details →

Cloudian HyperStore

Enterprise Object Storage.

An on-premises, S3-compatible object storage platform.

View tool details →

Wasabi Cloud Storage

Hot Cloud Storage.

A simple, affordable, and fast S3-compatible object storage service.

View tool details →

Qumulo

Unstructured Data Platform.

A file data platform for storing and managing unstructured data at scale.

View tool details →

Monte Carlo

The Data Observability Platform.

An end-to-end data observability platform that helps prevent and resolve data downtime.

View tool details →

Soda

Data quality and observability for the modern data stack.

A data observability platform that provides end-to-end data quality monitoring.

View tool details →

data.world

The Enterprise Data Catalog for Modern Data Work.

An enterprise data catalog platform that uses a knowledge graph to map and understand data.

View tool details →

Okera

Secure Data Access for All.

A universal data authorization platform that simplifies data access governance.

View tool details →

Privacera

Unified Data Security Governance.

A data security and governance platform that provides fine-grained access control across multi-cloud environments.

View tool details →

Google BigQuery

The AI data platform for all your data, everywhere

A fully-managed, serverless data warehouse that enables scalable analysis over petabytes of data.

View tool details →

Databricks

The Data and AI Company

A unified data analytics platform that combines data warehousing and data lakes into a lakehouse architecture.

View tool details →

Starburst Enterprise

The Data Lakehouse Analytics Platform

An enterprise-grade distribution of the open-source Trino (formerly PrestoSQL) query engine for data lakehouse analytics.

View tool details →

Microsoft Azure Data Lake Storage

A massively scalable and secure data lake for your high-performance analytics workloads.

A cloud-based data lake solution for big data analytics.

View tool details →

Databricks Lakehouse Platform

The only data lakehouse that unifies all your data, analytics and AI.

A unified platform for data engineering, data science, and machine learning.

View tool details →

Databricks Lakehouse Platform

The Data and AI Company.

A unified platform for data engineering, data science, and machine learning.

View tool details →

Azure Synapse Analytics

Limitless analytics service with unmatched time to insight.

An integrated analytics service that accelerates time to insight across data warehouses and big data systems.

View tool details →

Starburst Enterprise

The Analytics Engine for the Data Lakehouse.

A fully managed and enterprise-grade data lake analytics platform built on open source Trino.

View tool details →

Azure Data Lake Storage

Massively scalable and secure data lake for your high-performance analytics workloads.

A highly scalable and secure data lake for high-performance analytics workloads.

View tool details →

Databricks

The Data and AI Company.

A unified data and AI platform for data engineering, data science, and machine learning.

View tool details →

Starburst

The data lake analytics platform.

A fully managed data lake analytics platform built on open-source Trino.

View tool details →

Dell PowerScale

Unlock the potential of your unstructured data.

A scale-out NAS platform for storing and managing unstructured data.

View tool details →

Google Cloud BigQuery

A serverless, highly scalable, and cost-effective multi-cloud data warehouse designed for business agility.

A fully-managed, serverless data warehouse that enables super-fast SQL queries using the processing power of Google's infrastructure.

View tool details →

Starburst

The analytics engine for all your data.

An enterprise-grade distribution of Trino (formerly PrestoSQL) with added features for security, connectivity, and manageability.

View tool details →

Databricks SQL

Serverless data warehouse with the best price/performance.

A serverless data warehouse built on the Databricks Lakehouse Platform, providing a SQL-native experience for BI and analytics.

View tool details →

Databricks

The Data and AI Company.

A unified data and AI platform that supports data mesh architectures through its Lakehouse Platform.

View tool details →

Google Cloud Dataplex

An intelligent data fabric for unified data management and governance.

A service that unifies distributed data to automate data management and governance for analytics at scale.

View tool details →

Microsoft Azure Synapse Analytics

Limitless analytics service with unmatched time to insight

A unified analytics platform that brings together data integration, enterprise data warehousing, and big data analytics.

View tool details →

Oracle Autonomous Data Warehouse

The world's first and only autonomous database

A fully managed cloud data warehouse service that uses machine learning to automate database tuning, security, backups, updates, and other routine management tasks.

View tool details →

Vertica

The Unified Analytics Platform

A unified analytics platform that supports SQL, Python, R, and more, for data warehousing and machine learning at scale.

View tool details →

Dremio

The Easy and Open Data Lakehouse

An open data lakehouse platform that enables high-performance BI and analytics directly on data lake storage.

View tool details →

Greenplum

The Open Source Data Warehouse

An open-source, massively parallel processing (MPP) data warehouse based on PostgreSQL.

View tool details →

Dremio

The easy and open data lakehouse.

A data lakehouse platform that enables fast and easy analytics on data lake storage.

View tool details →

Amazon Redshift

Fast, easy, and secure cloud data warehousing at any scale.

A fully managed, petabyte-scale data warehouse service in the cloud.

View tool details →

Dremio

The Easy and Open Data Lakehouse.

A data lakehouse platform that provides fast and easy analytics on your data lake storage.

View tool details →

Vertica Unified Analytics Platform

The Unified Analytics Platform.

A unified analytics platform for data warehousing and machine learning at scale.

View tool details →

Dremio

The easy and open data lakehouse.

A data lakehouse platform that enables high-performance BI and analytics directly on data lake storage.

View tool details →

Vertica

Unified Analytics. Deployed Anywhere.

A unified analytics platform for data warehousing and data lakes.

View tool details →

Apache Hadoop (HDFS)

A framework for distributed processing of large data sets.

An open-source framework for distributed storage and processing of large datasets.

View tool details →

NetApp StorageGRID

Intelligent object storage for the hybrid cloud.

A software-defined object storage solution for unstructured data.

View tool details →

Trino

Fast distributed SQL query engine for big data analytics.

A high-performance, distributed SQL query engine for big data analytics, enabling users to query large data sets.

View tool details →

Apache Druid

A high performance, real-time analytics database.

A real-time analytics database designed for fast slice-and-dice analytics on large data sets.

View tool details →

Apache Kylin

Extreme OLAP Engine for Big Data.

An open-source, distributed Analytical Data Warehouse for Big Data.

View tool details →

Amazon Athena

Query data in S3 using SQL.

An interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL.

View tool details →

Denodo

The leader in data virtualization.

A data virtualization platform that provides a unified view of data from disparate sources.

View tool details →

Starburst Galaxy

The shortest path from raw data to actionable insight - open, interoperable, and AI-ready from day one.

A fully managed data lake analytics platform for running fast queries on data anywhere.

View tool details →

Denodo Platform

The leader in data virtualization, providing agile, high-performance data integration and data abstraction across the broadest range of enterprise, cloud, big data, and unstructured data sources.

A data virtualization platform that enables a logical data mesh by creating a unified view of disparate data sources.

View tool details →

Microsoft Purview

A unified data governance solution to maximize the business value of your data.

A unified data governance service that helps you manage and govern your on-premises, multicloud, and SaaS data.

View tool details →

Alation

The Data Intelligence Platform.

A data intelligence platform that combines a data catalog with data governance, analytics stewardship, and data literacy.

View tool details →

Dremio

The Easy and Open Data Lakehouse.

A data lakehouse platform that provides fast query performance and self-service analytics on data lake storage.

View tool details →

Amazon Redshift

Fast, easy, and widely used cloud data warehouse

A fully managed, petabyte-scale data warehouse service in the cloud.

View tool details →

Teradata Vantage

The connected multi-cloud data platform for enterprise analytics

A multi-cloud data platform that unifies data warehouses, data lakes, and analytics into a single ecosystem.

View tool details →

SAP Datasphere

The Business Data Fabric

A comprehensive data service built on the SAP Business Technology Platform (BTP) that enables a business data fabric.

View tool details →

Oracle Cloud Infrastructure (OCI) Data Lake

A complete, open, and enterprise-grade data lake platform.

A suite of OCI services for building and managing a data lake.

View tool details →

Teradata Vantage

The connected multi-cloud data platform for enterprise analytics.

A multi-cloud data platform that unifies data warehouses, data lakes, and analytics.

View tool details →

Teradata VantageCloud Lake

The connected multi-cloud data platform for enterprise analytics.

A cloud-native lakehouse that unifies analytics, data, and teams.

View tool details →

IBM watsonx.data

The open data lakehouse for AI and analytics workloads.

A data store built on an open lakehouse architecture to scale AI and analytics workloads.

View tool details →

Oracle Big Data Lakehouse

A comprehensive, integrated, and open data platform.

An integrated data platform that combines the best of data lakes and data warehouses.

View tool details →

Actian Avalanche

The Hybrid Data Platform.

A hybrid cloud data platform for data warehousing, analytics, and integration.

View tool details →

Informatica Intelligent Data Management Cloud

The Enterprise Cloud Data Management Leader.

A comprehensive cloud-native data management platform.

View tool details →

Qubole

The Open Data Lake Company.

A cloud-native data platform for data engineering, data science, and machine learning.

View tool details →

Teradata VantageCloud

The connected multi-cloud data platform for enterprise analytics.

A multi-cloud data platform that unifies data warehouses, data lakes, and analytics.

View tool details →

IBM Cloud Object Storage

Flexible, cost-effective, and scalable cloud storage.

A highly scalable and durable object storage service on the IBM Cloud.

View tool details →

Dremio

The easy and open data lakehouse.

A SQL lakehouse platform that enables high-performance BI and analytics directly on data lake storage.

View tool details →

Presto

The fast and reliable SQL engine for all your data.

An open-source distributed SQL query engine for running interactive analytic queries against data sources of all sizes.

View tool details →

AWS Lake Formation

Build, secure, and manage data lakes in days.

A service that makes it easy to set up a secure data lake in days.

View tool details →

Informatica Intelligent Data Management Cloud

Your end-to-end platform for data management.

An AI-powered, cloud-native platform for data integration, quality, and governance.

View tool details →

Talend Data Fabric

A single suite of apps for data integration and integrity.

A unified platform that simplifies all aspects of working with data for insights.

View tool details →

SAP Datasphere

The business data fabric.

A comprehensive data service that enables every data professional to deliver seamless and scalable access to mission-critical business data.

View tool details →

Actian Avalanche

The Hybrid Data Platform

A fully managed hybrid cloud data platform for high-performance analytics.

View tool details →

Cloudera Data Platform

The Hybrid Data Platform

A hybrid data platform that provides a unified suite of analytical and data management services from the Edge to AI.

View tool details →

Cloudera Data Platform (CDP)

The hybrid data platform for analytics and machine learning anywhere.

A hybrid data platform that enables analytics and machine learning across on-premises and cloud environments.

View tool details →

Cloudera Data Platform (CDP)

The Hybrid Data Company.

A hybrid data platform for the entire data lifecycle, from the Edge to AI.

View tool details →

Cloudera Data Platform (CDP)

The hybrid data company.

A hybrid data platform that enables you to manage and secure the entire data lifecycle.

View tool details →

Oracle Cloud Infrastructure (OCI) Data Lake

Build a modern data platform for all your enterprise data.

A comprehensive data lake solution on Oracle Cloud Infrastructure.

View tool details →

Apache Impala

High-performance SQL for data in Apache Hadoop.

An open-source, massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache Hadoop.

View tool details →

IBM Db2 Big SQL

A hybrid SQL-on-Hadoop engine.

An enterprise-grade, hybrid ANSI-compliant SQL-on-Hadoop engine for advanced data query.

View tool details →

Qubole

The Open Data Lake Platform.

A cloud-native data platform for machine learning, streaming, and ad-hoc analytics.

View tool details →

Collibra Data Intelligence Platform

Accelerate trusted business outcomes by connecting the right data, insights, and algorithms to all Data Citizens.

A data intelligence platform that provides data governance, catalog, and privacy capabilities.

View tool details →

Cloudera Data Platform (CDP)

The hybrid data company.

A hybrid data platform that enables secure data management and portable analytics across multi-cloud and on-premises environments.

View tool details →

IBM Db2 Warehouse

The AI-ready data warehouse for hybrid multicloud

A client-managed, private cloud data warehouse for Docker container-supported infrastructures.

View tool details →

IBM watsonx.data

A fit-for-purpose data store, built on an open data lakehouse architecture.

A data store that enables you to scale analytics and AI with a data lakehouse.

View tool details →

Zaloni Arena

The Data Lake Company.

A data lake management platform for data governance, cataloging, and self-service data access.

View tool details →

Apache Drill

Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage.

An open-source, schema-free SQL query engine for querying non-relational and relational data stores.

View tool details →

Onehouse

The Managed Data Lakehouse.

A managed data lakehouse service that automates the management of open source data infrastructure.

View tool details →

Apache Iceberg

The open table format for huge analytic datasets.

An open table format for huge analytic datasets in data lakes.

View tool details →

Delta Lake

An open-source storage framework that enables building a Lakehouse architecture.

An open-source storage layer that brings reliability to data lakes.

View tool details →

Apache Hudi

The Data Lake Platform.

An open-source data lake platform for stream processing on big data.

View tool details →

Apache Doris

A new-generation real-time data warehouse.

An open-source, real-time analytical database based on MPP architecture.

View tool details →

NextData

The Data Mesh Company.

A data products platform designed to build, share, and manage data products in a data mesh.

View tool details →