Litke, W. and Budka, M., 2015. Scaling beyond one rack and sizing of Hadoop platform. Scalable Computing: Practice and Experience , 16 (4), 423-435.
Full text available as:
|
PDF
Scaling_beyond_one_rack_and_sizing_of_Hadoop_platform.pdf - Accepted Version 359kB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
Abstract
This paper focuses on two aspects of configuration choices of the Hadoop platform. Firstly we are looking to establish performance implications of expanding an existing Hadoop cluster beyond a single rack. In the second part of the testing we are focusing on performance differences when deploying clusters of different sizes. The study also examines constraints of the disk latency found on the test cluster during our experiments and discusses their impact on the overall perfor- mance. All testing approaches described in this work offer an insight into understanding of Hadoop environment for the companies looking to either expand their existing Big Data analytics platform or implement it for the first time.
Item Type: | Article |
---|---|
ISSN: | 1895-1767 |
Uncontrolled Keywords: | Hadoop, Big Data analytics, scalability, benchmarking, teragen, terasort, teraval- idate, inodes, platform bottlenecks, disk latency |
Group: | Faculty of Science & Technology |
ID Code: | 22861 |
Deposited By: | Symplectic RT2 |
Deposited On: | 09 Nov 2015 11:47 |
Last Modified: | 14 Mar 2022 13:54 |
Downloads
Downloads per month over past year
Repository Staff Only - |