Vol.10, No.2, May 2021. ISSN: 2217-8309 eISSN: 2217-8333
TEM Journal
TECHNOLOGY, EDUCATION, MANAGEMENT, INFORMATICS Association for Information Communication Technology Education and Science |
Analysis and Experimental Study of HDFS Performance
Yordan Kalmukov, Milko Marinov, Tsvetelina Mladenova, Irena Valova
© 2021 Yordan Kalmukov, published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. (CC BY-NC-ND 4.0)
Citation Information: TEM Journal. Volume 10, Issue 2, Pages 806-814, ISSN 2217-8309, DOI: 10.18421/TEM102-38, May 2021.
Received: 24 January 2021. Revised: 18 March 2021.
Abstract:
In the age of big data, the amount of data that people generate and use on a daily basis has far exceeded the storage and processing capabilities of a single computer system. That motivates the use of distributed big data storage and processing system such as Hadoop. It provides a reliable, horizontallyscalable, fault-tolerant and efficient service, based on the Hadoop Distributed File System (HDFS) and MapReduce. The purpose of this research is to experimentally determine whether (and to what extent) the network communication speed, the file replication factor, the files’ sizes and their number, and the location of the HDFS client influence the performance of the HDFS read/write operations.
Keywords –HDFS, Distributed file systems, Distributed and parallel computing, Hadoop cluster. |
----------------------------------------------------------------------------------------------------------- ----------------------------------------------------------------------------------------------------------- |