Comparisons to Other Products¶

Hologres V.S. Traditional Data Warehouse¶

Apache Hive¶

AWS Redshift¶

Snowflake¶

Hologres V.S. Real-time/Time-series Database¶

Druid, Clickhouse,¶

HBase, Cassandra¶

Kudu¶

Must be used with Impala
Point-lookup is slower than HBase and Cassandra, and analytics is slower than Druid and ClickHouse

Hologres V.S. Data Lake¶

Data Lake setups like Apache Iceberg, Apache Hudi, Delta Lake are mostly libraries plus a cloud distributed file system. They are poor man, open-source version of commercial data warehouses.

Given they are libraries maintaining data on top of commodity storage, Data Lakes have to be used with Spark or Presto.

Fits better with mini-batch processing engine like Spark, but not Flink No native streaming support Data latency depends on how frequently files are commited No point-lookup support They are libaraies not services, thus their capabilities are limited and operation cost is very high. E.g. No compaction to compact small files as their number grows No TTL support