The big data system, components, tools, and technologies: a survey

TR Rao, P Mitra, R Bhatt, A Goswami - Knowledge and Information …, 2019 - Springer
The traditional databases are not capable of handling unstructured data and high volumes
of real-time datasets. Diverse datasets are unstructured lead to big data, and it is laborious …

A large-scale analysis of hundreds of in-memory key-value cache clusters at twitter

J Yang, Y Yue, KV Rashmi - ACM Transactions on Storage (TOS), 2021 - dl.acm.org
Modern web services use in-memory caching extensively to increase throughput and reduce
latency. There have been several workload analyses of production systems that have fueled …

Elastic sketch: Adaptive and fast network-wide measurements

T Yang, J Jiang, P Liu, Q Huang, J Gong… - Proceedings of the …, 2018 - dl.acm.org
When network is undergoing problems such as congestion, scan attack, DDoS attack, etc.,
measurements are much more important than usual. In this case, traffic characteristics …

Tactical provenance analysis for endpoint detection and response systems

WU Hassan, A Bates, D Marino - 2020 IEEE Symposium on …, 2020 - ieeexplore.ieee.org
Endpoint Detection and Response (EDR) tools provide visibility into sophisticated intrusions
by matching system events against known adversarial behaviors. However, current solutions …

Netcache: Balancing key-value stores with fast in-network caching

X Jin, X Li, H Zhang, R Soulé, J Lee, N Foster… - Proceedings of the 26th …, 2017 - dl.acm.org
We present NetCache, a new key-value store architecture that leverages the power and
flexibility of new-generation programmable switches to handle queries on hot items and …

Shenango: Achieving high {CPU} efficiency for latency-sensitive datacenter workloads

A Ousterhout, J Fried, J Behrens, A Belay… - … USENIX Symposium on …, 2019 - usenix.org
Datacenter applications demand microsecond-scale tail latencies and high request rates
from operating systems, and most applications handle loads that have high variance over …

Datacenter {RPCs} can be general and fast

A Kalia, M Kaminsky, D Andersen - 16th USENIX Symposium on …, 2019 - usenix.org
It is commonly believed that datacenter networking software must sacrifice generality to
attain high performance. The popularity of specialized distributed systems designed …

Wisckey: Separating keys from values in ssd-conscious storage

L Lu, TS Pillai, H Gopalakrishnan… - ACM Transactions On …, 2017 - dl.acm.org
We present WiscKey, a persistent LSM-tree-based key-value store with a performance-
oriented data layout that separates keys from values to minimize I/O amplification. The …

Network requirements for resource disaggregation

PX Gao, A Narayan, S Karandikar, J Carreira… - … USENIX symposium on …, 2016 - usenix.org
Traditional datacenters are designed as a collection of servers, each of which tightly couples
the resources required for computing tasks. Recent industry trends suggest a paradigm shift …

Offloading distributed applications onto smartnics using ipipe

M Liu, T Cui, H Schuh, A Krishnamurthy… - Proceedings of the …, 2019 - dl.acm.org
Emerging Multicore SoC SmartNICs, enclosing rich computing resources (eg, a multicore
processor, onboard DRAM, accelerators, programmable DMA engines), hold the potential to …