[PDF][PDF] 大数据管理: 概念, 技术与挑战

孟小峰, 慈祥 - 2013 - idke.ruc.edu.cn
大数据管理:概念,技术与挑战 Page 1 大数据管理:概念,技术与挑战 孟小峰慈祥 (中国人民大学信息
学院北京100872) Big Data Management: Concepts, Techniques and Challenges Meng …

The rise of “big data” on cloud computing: Review and open research issues

IAT Hashem, I Yaqoob, NB Anuar, S Mokhtar, A Gani… - Information systems, 2015 - Elsevier
Cloud computing is a powerful technology to perform massive-scale and complex
computing. It eliminates the need to maintain expensive computing hardware, dedicated …

[HTML][HTML] A survey on platforms for big data analytics

D Singh, CK Reddy - Journal of big data, 2015 - Springer
The primary purpose of this paper is to provide an in-depth analysis of different platforms
available for performing big data analytics. This paper surveys different hardware platforms …

[PDF][PDF] 云计算: 体系架构与关键技术

罗军舟, 金嘉晖, 宋爱波, 东方 - 通信学报, 2011 - fs.gongkong.com
系统地分析和总结云计算的研究现状, 划分云计算体系架构为核心服务, 服务管理,
用户访问接口等3 个层次. 围绕低成本, 高可靠, 高可用, 规模可伸缩等研究目标 …

[PDF][PDF] 大数据分析——RDBMS 与MapReduce 的竞争与共生

覃雄派, 王会举, 杜小勇, 王珊 - 软件学报, 2012 - jos.org.cn
在科学研究, 计算机仿真, 互联网应用, 电子商务等诸多应用领域, 数据量正在以极快的速度增长,
为了分析和利用这些庞大的数据资源, 必须依赖有效的数据分析技术. 传统的关系数据管理技术 …

A unified form of fuzzy C-means and K-means algorithms and its partitional implementation

ID Borlea, RE Precup, AB Borlea, D Iercan - Knowledge-Based Systems, 2021 - Elsevier
This paper proposes as an element of novelty the Unified Form (UF) clustering algorithm,
which treats Fuzzy C-Means (FCM) and K-Means (KM) algorithms as a single configurable …

A survey of data partitioning and sampling methods to support big data analysis

MS Mahmud, JZ Huang, S Salloum… - Big Data Mining and …, 2020 - ieeexplore.ieee.org
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …

Spark sql: Relational data processing in spark

M Armbrust, RS Xin, C Lian, Y Huai, D Liu… - Proceedings of the …, 2015 - dl.acm.org
Spark SQL is a new module in Apache Spark that integrates relational processing with
Spark's functional programming API. Built on our experience with Shark, Spark SQL lets …

Samza: stateful scalable stream processing at LinkedIn

SA Noghabi, K Paramasivam, Y Pan… - Proceedings of the …, 2017 - dl.acm.org
Distributed stream processing systems need to support stateful processing, recover quickly
from failures to resume such processing, and reprocess an entire data stream quickly. We …

Apache hadoop yarn: Yet another resource negotiator

VK Vavilapalli, AC Murthy, C Douglas… - Proceedings of the 4th …, 2013 - dl.acm.org
The initial design of Apache Hadoop [1] was tightly focused on running massive,
MapReduce jobs to process a web crawl. For increasingly diverse companies, Hadoop has …