![]() Multilevel checkpointing allows applications to take both frequent inexpensive checkpoints and less frequent, more resilient checkpoints, resulting in better efficiency and reduced load on the parallel file system. A solution to this problem is multilevel checkpointing. However, at scale, the cost in time and bandwidth of checkpointing to a parallel file system becomes prohibitive. ![]() As the system mean time before failure correspondingly drops, applications must checkpoint frequently to make progress. High performance computing systems are growing more powerful by using more components.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |