An overview of data warehouse and data lake in modern enterprise data management

A Nambiar, D Mundra - Big data and cognitive computing, 2022 - mdpi.com
Data is the lifeblood of any organization. In today's world, organizations recognize the vital
role of data in modern business intelligence systems for making meaningful decisions and …

Milvus: A purpose-built vector data management system

J Wang, X Yi, R Guo, H Jin, P Xu, S Li, X Wang… - Proceedings of the …, 2021 - dl.acm.org
Recently, there has been a pressing need to manage high-dimensional vector data in data
science and AI applications. This trend is fueled by the proliferation of unstructured data and …

Big data management challenges in health research—a literature review

X Wang, C Williams, ZH Liu… - Briefings in …, 2019 - academic.oup.com
Big data management for information centralization (ie making data of interest findable) and
integration (ie making related data connectable) in health research is a defining challenge in …

Pocket: Elastic ephemeral storage for serverless analytics

A Klimovic, Y Wang, P Stuedi, A Trivedi… - … USENIX Symposium on …, 2018 - usenix.org
Serverless computing is becoming increasingly popular, enabling users to quickly launch
thousands of shortlived tasks in the cloud with high elasticity and fine-grain billing. These …

Amazon Redshift re-invented

N Armenatzoglou, S Basu, N Bhanoori, M Cai… - Proceedings of the …, 2022 - dl.acm.org
In 2013, AmazonWeb Services revolutionized the data warehousing industry by launching
Amazon Redshift, the first fully-managed, petabyte-scale, enterprise-grade cloud data …

Photon: A fast query engine for lakehouse systems

A Behm, S Palkar, U Agarwal, T Armstrong… - Proceedings of the …, 2022 - dl.acm.org
Many organizations are shifting to a data management paradigm called the" Lakehouse,"
which implements the functionality of structured data warehouses on top of unstructured …

Data warehouse systems

A Vaisman, E Zimányi - Data-Centric Systems and Applications, 2014 - Springer
Since the late 1970s, relational database technology has been adopted by most
organizations to store their essential data. However, nowadays, the needs of these …

Building an elastic query engine on disaggregated storage

M Vuppalapati, J Miron, R Agarwal, D Truong… - … USENIX Symposium on …, 2020 - usenix.org
We present operational experience running Snowflake, a cloud-based data warehousing
system with SQL support similar to state-of-the-art databases. Snowflake design is motivated …

Understanding data storage and ingestion for large-scale deep recommendation model training: Industrial product

M Zhao, N Agarwal, A Basant, B Gedik, S Pan… - Proceedings of the 49th …, 2022 - dl.acm.org
Datacenter-scale AI training clusters consisting of thousands of domain-specific accelerators
(DSA) are used to train increasingly-complex deep learning models. These clusters rely on a …

Viper: An efficient hybrid pmem-dram key-value store

L Benson, H Makait, T Rabl - 2021 - publishup.uni-potsdam.de
Key-value stores (KVSs) have found wide application in modern software systems. For
persistence, their data resides in slow secondary storage, which requires KVSs to employ …