Prevalence of hypertensive diseases and treated hypertensive patients in Japan: A nationwide administrative claims database study

T Waki, K Miura, S Tanaka-Mizuno, Y Ohya… - Hypertension …, 2022 - nature.com
We investigated the prevalence of hypertensive patients and treated hypertensive patients
using a Japanese nationwide administrative claims database. We analyzed national …

Krypton: Real-Time Serving and Analytical SQL Engine at ByteDance

J Chen, R Shi, H Chen, L Zhang, R Li, W Ding… - Proceedings of the …, 2023 - dl.acm.org
In recent years, at ByteDance, we have started seeing more and more business scenarios
that require performing real-time data serving besides complex Ad Hoc analysis over large …

Physical Database Design for Manufacturing Business Analytics

N Nishikawa, S Fujiwara, Y Hayamizu… - … Conference on Big …, 2023 - ieeexplore.ieee.org
The manufacturing business field is accommodating a massive number of networked
sensors, which are offering a new horizon of microscopic observability; every single piece of …

Mirage: Generating Enormous Databases for Complex Workloads

Q Wang, H Li, Z Hu, R Zhang, C Yang… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
To optimize query parallelism techniques, substantial workloads are required with specific
query plans and customized output size for each operator (denoted as cardinality …

LakeHarbor: Making Structures First-Class Citizens in Data Lakes

H Yamada, M Kitsuregawa… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
This paper introduces LakeHarbor, a new data management paradigm that makes structures
(eg, indexes) first-class citizens in data lakes. The LakeHarbor paradigm enables a data …

Nested Loops Revisited Again

H Yamada, K Goda… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Hash joins and sort-merge joins have been considered the algorithms of choice for
analytical relational queries in most parallel database systems because of their performance …

Powerful Analytics Platform for National-Scale Database of Health Care Insurance Claims

K Goda, M Kitsuregawa - Epidemiologic Research on Real-World Medical …, 2022 - Springer
Japan has been continuously building a national-scale insurance claims database by
collecting all the insurance claims data from all the public health care insurers since 2009 …

Dynamic Fault Tolerance for Multi-Node Query Processing

Y Bessho, Y Hayamizu, K Goda… - … on Information and …, 2022 - search.ieice.org
Parallel processing is a typical approach to answer analytical queries on large database. As
the size of the database increases, we often try to increase the parallelism by incorporating …

Research on the Optimization of Supply Chain Information Sys-tems for Export Cross-border E-commerce

C Han, Z Liu - Proceedings of the 2023 4th International Conference …, 2023 - dl.acm.org
With its unparalleled supply chain advantages and comprehensive industry support, China's
export cross-border e-commerce industry has experienced significant and sustained growth …

[PDF][PDF] PyJedAI Parallelization with MPIRE

IA Kontonis, G Papadakis, K Nikoletos - 2023 - pergamos.lib.uoa.gr
Entity resolution is a critical task in various applications, but it faces quadratic complexity. To
make entity resolution scalable to large datasets, blocking is typically employed. Syntactic …