Skip to main content

Evaluating Scalable Distributed Erlang for Scalability and Reliability.

Chechina, N., Mackenzie, K., Thompson, S., Trinder, P., Boudeville, O., Fordos, V., Liang, H., Hoch, C., Ghaffari, A. and Hernandez, M.M., 2017. Evaluating Scalable Distributed Erlang for Scalability and Reliability. IEEE Transactions on Parallel and Distributed Systems, 28 (8), 2244-2257.

Full text available as:

[img] PDF
evaluating-scalable-distributed-erlang.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial No Derivatives.

1MB
[img]
Preview
PDF (OPEN ACCESS ARTICLE)
07820204.pdf - Published Version
Available under License Creative Commons Attribution.

1MB

DOI: 10.1109/TPDS.2017.2654246

Abstract

Large scale servers with hundreds of hosts and tens of thousands of cores are becoming common. To exploit these platforms software must be both scalable and reliable, and distributed actor languages like Erlang are a proven technology in this area. While distributed Erlang conceptually supports the engineering of large scale reliable systems, in practice it has some scalability limits that force developers to depart from the standard language mechanisms at scale. In earlier work we have explored these scalability limitations, and addressed them by providing a Scalable Distributed (SD) Erlang library that partitions the network of Erlang Virtual Machines (VMs) into scalable groups (s_groups). This paper presents the first systematic evaluation of SD Erlang s_groups and associated tools, and how they can be used. We present a comprehensive evaluation of the scalability and reliability of SD Erlang using three typical benchmarks and a case study. We demonstrate that s_groups improve the scalability of reliable and unreliable Erlang applications on up to 256 hosts (6,144 cores). We show that SD Erlang preserves the class-leading distributed Erlang reliability model, but scales far better than the standard model. We present a novel, systematic, and tool-supported approach for refactoring distributed Erlang applications into SD Erlang. We outline the new and improved monitoring, debugging and deployment tools for large scale SD Erlang applications. We demonstrate the scaling characteristics of key tools on systems comprising up to 10 K Erlang VMs.

Item Type:Article
ISSN:1045-9219
Uncontrolled Keywords:Scalability; reliability; actors; Erlang
Group:Faculty of Science & Technology
ID Code:30277
Deposited By: Symplectic RT2
Deposited On:29 Jan 2018 12:30
Last Modified:14 Mar 2022 14:09

Downloads

Downloads per month over past year

More statistics for this item...
Repository Staff Only -