An efficient distributed caching for accessing small files in HDFS

K Bok, H Oh, J Lim, Y Pae, H Choi, B Lee, J Yoo - Cluster Computing, 2017 - Springer
In this paper, we propose a distributed caching scheme to efficiently access small files in
Hadoop distributed file system. The proposed scheme reduces the volume of metadata to …

Enhancing HDFS with a full-text search system for massive small files

W Xu, X Zhao, B Lao, G Nong - The Journal of Supercomputing, 2021 - Springer
HDFS is a popular open-source system for scalable and reliable file management, which is
designed as a general-purpose solution for distributed file storage. While it works well for …

[PDF][PDF] A network load sensitive block placement strategy of HDFS

L Meng, W Zhao, H Zhao, Y Ding - KSII Transactions on Internet …, 2015 - koreascience.kr
This paper investigates and analyzes the default block placement strategy of HDFS. HDFS is
a typical representative distributed file system to stream vast amount of data effectively at …

基于分布式技术的科技文献大数据平台的建设研究*

常志军, 钱力, 谢靖, 吴振新, 张鹄… - 数据分析与 …, 2021 - manu44.magtech.com.cn
[目的] 解决海量篇级文献的存储与在线访问, 大规模数据治理和服务性能低的问题,
建设科技文献大数据平台.[方法] 以分布式技术为基础, 分析科技大数据特点及服务导向 …

Big Data Platform for Sci-Tech Literature Based on Distributed Technology

C Zhijun, Q Li, X Jing, W Zhenxin… - Data Analysis and …, 2021 - manu44.magtech.com.cn
[Objective] This research addresses the issues facing the storage and online access of
massive text-level documents, the governance of large-scale data, and the low service …