Abstract
Localizing Failures in Distributed Synchronization
by: Manhoi Choy and Ambuj K. Singh
Abstract:
We investigate the fault-tolerance of distributed algorithms in asynchronousmessage passing systems with undetectable process failures. We introducefailure locality as a measure of the fault-tolerance of distributedalgorithms. This measure captures the general idea of allowing part of asystem to continue to function despite the failure of other parts of thesystem. We concentrate on two problems, the dining philosophers problem andthe committee coordination problem, and present algorithms with an optimalfailure locality.
Keywords:
Concurrency, Distributed Algorithms, Fault-tolerance, Lower Bounds,Synchronization.
Date:
February 1993
Document: 1993-03