Cheap raw bytes vs fast structured queries — and the middle.
Data Lake vs Warehouse vs Lakehouse: cheap raw bytes vs fast structured queries
match the datastore to the access pattern
Warehouse SQL is faster; pay per query/credit.
Data lakes hold raw data in object storage cheaply; warehouses hold structured, columnar data for fast SQL. Lakehouse architectures (Iceberg, Delta) store warehouse-like tables on lake storage.
Warehouse: Snowflake, BigQuery, Redshift — fast SQL, expensive compute.
Lake: S3 + Parquet — cheap storage, BYO compute (Spark, Trino).
Lakehouse: ACID tables on lake storage; bridges the gap.
Build a daily revenue report.