|
Semester Project: Study of Scalable Fault Tolerant MPI Implementations for Volatile Nodes
Several research efforts aim to provide MPI implementations featuring scalable, fault-tolerant protocols for volatile nodes (e.g., MPICH-V, Starfish, FT-MPI). We have teamed in groups and performed an in-depth literature study of the merits and capabilities of these approaches. We have compared and contrasted the most important approches and presented our findings in a presentation (up to 15 minutes): summing up the state of the art in the field, addressing issues such as how these different approches work (from the system point of view and from the application performance point of view), reporting reseach directions already underlined in the papers, and suggesting future reseach opportunities. We have used these papers for our projects. We would like to thank the authors of the papers for their terrific work. List of presentations:
Please feel free to send your comments at: mtaufer AT utep DOT edu. Thanks! |
Last Change: Mar 2005
Author: Taufer Michela