Approximate count distincts

Github 来源:TimescaleDB 浏览 140 扫码分享 2022-12-25 22:20:27

Approximate count distincts

Approximate count distincts

Approximate count distincts are typically used to find the number of unique values, or cardinality, in a large dataset. When you calculate cardinality in a dataset, the time it takes to process the query is proportional to how large the dataset is. So if you wanted to find the cardinality of a dataset that contained only 20 entries, the calculation would be very fast. Finding the cardinality of a dataset that contains 20 million entries, however, can take a significant amount of time and compute resources. Approximate count distincts do not calculate the exact cardinality of a dataset, but rather estimate the number of unique values, to reduce memory consumption and improve compute time by avoiding spilling the intermediate results to the secondary storage.

当前内容版权归 TimescaleDB 或其关联方所有，如需对内容或内容相关联开源项目进行关注与资助，请访问 TimescaleDB .

本文档使用 BookStack 构建