Monday, 11 August 2025

Estimating the Number of Distinct Values

Normal method to get the distinct values,

    SELECT COUNT(DISTINCT(customer_name)) FROM table_name;

Using estimation function,

    SELECT HLL(customer_name) FROM table_name;


Benchmark: For 150,000,000 rows

  •     Normal method took, 12s
  •     Estimation function took, 5.6s

No comments:

Data Sharing

  1. Create Share CREATE SHARE my_share; 2. Grant privileges to share GRANT USAGE ON DATABASE my_db TO SHARE my_share; GRANT USAGE ...