*banner
 

Incremental Checkpointing with Application to Distributed Discrete Event Simulation
Thomas Huining Feng, Edward A. Lee

Citation
Thomas Huining Feng, Edward A. Lee. "Incremental Checkpointing with Application to Distributed Discrete Event Simulation". Winter Simulation Conference (WSC 2006), December, 2006.

Abstract
Checkpointing is widely used in robust fault-tolerant applications. We present an efficient incremental checkpointing mechanism. It requires to record only the state changes and not the complete state. After the creation of a checkpoint, state changes are logged incrementally as records in memory, with which an application can spontaneously roll back later. This incrementalism allows us to implement checkpointing with high performance. Only small constant time is required for checkpoint creation and state recording. Rollback requires linear time in the number of recorded state changes, which is bounded by the number of state variables times the number of checkpoints. We implement a Java source transformer that automatically converts an existing application into a behavior-preserving one with checkpointing functionality. This transformation is application-independent and application-transparent. A wide range of applications can benefit from this technique. Currently, it has been used for distributed discrete event simulation using the Time Warp technique.

Electronic downloads

Citation formats  
  • HTML
    Thomas Huining Feng, Edward A. Lee. <a
    href="http://chess.eecs.berkeley.edu/pubs/388.html"
    >Incremental Checkpointing with Application to
    Distributed Discrete Event Simulation</a>, Winter
    Simulation Conference (WSC 2006), December, 2006.
  • Plain text
    Thomas Huining Feng, Edward A. Lee. "Incremental
    Checkpointing with Application to Distributed Discrete Event
    Simulation". Winter Simulation Conference (WSC 2006),
    December, 2006.
  • BibTeX
    @inproceedings{FengLee06_IncrementalCheckpointingWithApplicationToDistributed,
        author = {Thomas Huining Feng and Edward A. Lee},
        title = {Incremental Checkpointing with Application to
                  Distributed Discrete Event Simulation},
        booktitle = {Winter Simulation Conference (WSC 2006)},
        month = {December},
        year = {2006},
        abstract = {Checkpointing is widely used in robust
                  fault-tolerant applications. We present an
                  efficient incremental checkpointing mechanism. It
                  requires to record only the state changes and not
                  the complete state. After the creation of a
                  checkpoint, state changes are logged incrementally
                  as records in memory, with which an application
                  can spontaneously roll back later. This
                  incrementalism allows us to implement
                  checkpointing with high performance. Only small
                  constant time is required for checkpoint creation
                  and state recording. Rollback requires linear time
                  in the number of recorded state changes, which is
                  bounded by the number of state variables times the
                  number of checkpoints. We implement a Java source
                  transformer that automatically converts an
                  existing application into a behavior-preserving
                  one with checkpointing functionality. This
                  transformation is application-independent and
                  application-transparent. A wide range of
                  applications can benefit from this technique.
                  Currently, it has been used for distributed
                  discrete event simulation using the Time Warp
                  technique.},
        URL = {http://chess.eecs.berkeley.edu/pubs/388.html}
    }
    

Posted by Thomas Huining Feng on 19 Feb 2008.
Groups: ptolemy
For additional information, see the Publications FAQ or contact webmaster at chess eecs berkeley edu.

Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright.

©2002-2018 Chess