Efficient discovery of periodic-frequent patterns in columnar temporal databases

P Ravikumar, P Likhitha, B Venus Vikranth Raj… - Electronics, 2021 - mdpi.com
Electronics, 2021mdpi.com
Discovering periodic-frequent patterns in temporal databases is a challenging problem of
great importance in many real-world applications. Though several algorithms were
described in the literature to tackle the problem of periodic-frequent pattern mining, most of
these algorithms use the traditional horizontal (or row) database layout, that is, either they
need to scan the database several times or do not allow asynchronous computation of
periodic-frequent patterns. As a result, this kind of database layout makes the algorithms for …
Discovering periodic-frequent patterns in temporal databases is a challenging problem of great importance in many real-world applications. Though several algorithms were described in the literature to tackle the problem of periodic-frequent pattern mining, most of these algorithms use the traditional horizontal (or row) database layout, that is, either they need to scan the database several times or do not allow asynchronous computation of periodic-frequent patterns. As a result, this kind of database layout makes the algorithms for discovering periodic-frequent patterns both time and memory inefficient. One cannot ignore the importance of mining the data stored in a vertical (or columnar) database layout. It is because real-world big data is widely stored in columnar database layout. With this motivation, this paper proposes an efficient algorithm, Periodic Frequent-Equivalence CLass Transformation (PF-ECLAT), to find periodic-frequent patterns in a columnar temporal database. Experimental results on sparse and dense real-world and synthetic databases demonstrate that PF-ECLAT is memory and runtime efficient and highly scalable. Finally, we demonstrate the usefulness of PF-ECLAT with two case studies. In the first case study, we have employed our algorithm to identify the geographical areas in which people were periodically exposed to harmful levels of air pollution in Japan. In the second case study, we have utilized our algorithm to discover the set of road segments in which congestion was regularly observed in a transportation network.
MDPI
以上显示的是最相近的搜索结果。 查看全部搜索结果