Abstract
A major obstacle in implementing a rollback recovery scheme for fault tolerance in a concurrent distributed system is the domino effect. A low overhead checkpointing scheme is proposed to prevent this effect. Each process saves its state periodically. The state-save synchronization among processes is implemented by bounding clock drifts. A communication protocol that assures that all saved states are consistent is developed.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - Symposium on Reliability in Distributed Software and Database Systems |
Editors | Anon |
Place of Publication | Piscataway, NJ, United States |
Publisher | Publ by IEEE |
Pages | 12-20 |
Number of pages | 9 |
State | Published - 1989 |
Externally published | Yes |
Event | Proceedings of the Eighth Symposium on Reliable Distributed Systems - Seattle, WA, USA Duration: Oct 10 1989 → Oct 12 1989 |
Other
Other | Proceedings of the Eighth Symposium on Reliable Distributed Systems |
---|---|
City | Seattle, WA, USA |
Period | 10/10/89 → 10/12/89 |
ASJC Scopus subject areas
- Software