Exploiting ordered dictionaries to efficiently construct histograms with q-error guarantees in SAP HANA

G Moerkotte, D DeHaan, N May, A Nica… - Proceedings of the 2014 …, 2014 - dl.acm.org
Proceedings of the 2014 ACM SIGMOD international conference on Management of …, 2014dl.acm.org
Histograms that guarantee a maximum multiplicative error (q-error) for estimates may
significantly improve the plan quality of query optimizers. However, the construction time for
histograms with maximum q-error was too high for practical use cases. In this paper we
extend this concept with a threshold, ie, an estimate or true cardinality θ, below which we do
not care about the q-error because we still expect optimal plans. This allows us to develop
far more efficient construction algorithms for histograms with bounded error. The test for θ, q …
Histograms that guarantee a maximum multiplicative error (q-error) for estimates may significantly improve the plan quality of query optimizers. However, the construction time for histograms with maximum q-error was too high for practical use cases. In this paper we extend this concept with a threshold, i.e., an estimate or true cardinality θ, below which we do not care about the q-error because we still expect optimal plans. This allows us to develop far more efficient construction algorithms for histograms with bounded error. The test for θ, q-acceptability developed also exploits the order-preserving dictionary encoding of SAP HANA. We have integrated this family of histograms into SAP HANA, and we report on the construction time, histograms size, and estimation errors on real-world data sets. In virtually all cases the histograms can be constructed in far less than one second, requiring less than 5% of space compared to the original compressed data.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果