[Hpcresilience] [CFP] FTS'18 Workshop @CLUSTER'18, Belfast, UK (deadline extended: Jun 27st, 2018)

Bogdan Nicolae bogdan.nicolae at acm.org
Mon Jun 18 09:30:01 MDT 2018


Call for Papers: FTS'18: 4th International Workshop on Fault Tolerant
Systems
September 10, 2018. Belfast, UK
(https://fts-workshop.github.io/fts18/)

Co-Located with IEEE CLUSTER 2018
(https://cluster2018.github.io/)

-------------------------------------------------------------------------------
IMPORTANT DATES

Paper abstract submission: Jun 21st, 2018
Full papers due: Jun 27th, 2018
Paper Acceptance Notification: Jul 20th, 2018
Camera-ready deadline: Jul 31st, 2018
Workshop: Sept 10th, 2018

-------------------------------------------------------------------------------
OVERVIEW

Fault Tolerance is a cross-cutting issue that spans all layers of the
hardware
and software stack, and hence, requires coupled improvements in each layer
and
co-design between the different layers. FTS aims at providing a venue for
researchers to share experiences across the hardware/software layers and
attendees
to get a holistic view of fault tolerance techniques, especially with a
focus on
HPC and parallel computing.

-------------------------------------------------------------------------------
WORKSHOP SCOPE

We invite the submission of original work that is related to the topics
below.
The papers can be either short (4 pages) position papers, or long (8 pages)
research papers.

Topics of interest include (in the context of fault tolerance):
- Techniques for predicting, detecting, and correcting silent data
corruption
- Hardware failure prediction and recovery techniques
- Novel hardware design for fault mitigation
- Programming interfaces and libraries to facilitate resilience
- Resilient algorithm design for application level fault tolerance
- Failure modeling and analysis
- Checkpoint/restart model and replication model
- Fault injection techniques for improved tool development
- Hardware and software fault detection and reporting systems
- Fault tolerance for coupled executions, workflows, and in situ data
analytics
- Algorithms for performance optimization in the presence of faults
- Techniques and algorithms for rollback recovery

-------------------------------------------------------------------------------
SUBMISSION INSTRUCTIONS

Camera-ready version of accepted papers must be compliant with the IEEE
Xplore
format for publication and must be at most 8 pages including figures and
references. Authors of accepted papers may purchase additional up to 2
pages.
Submission implies the willingness of at least one of the authors to
register
and present the paper.

Papers conforming to the above guidelines can be submitted through the
workshop's submission system:
https://easychair.org/conferences/?conf=fts18.

-------------------------------------------------------------------------------
GENERAL CHAIRS

- Bogdan Nicolae, Argonne National Laboratory, USA
- Guillaume Aupy, INRIA, France

-------------------------------------------------------------------------------
PROGRAM COMMITTEE

- Aurelien Bouteiller, University of Tennessee Knoxville
- Wolfgang Frings, Juelich Supercomputing Centre
- Amina Guermouche, Telecom Sud-Paris
- Julien Bigot, CEA, Maison de la Simulation
- Esteban Meneses, Costa Rica Supercomputing Center
- Wesley Bland, Intel
- Thomas Ropars, Grenoble Alpes University
- Leonardo Bautista Gomez, Barcelona Supercomputing Center
- Fanny Dufossé, INRIA
- Leonardo Bautista-Gomez, Barcelona Supercomputing Center
- Franck Cappello, Argonne National Laboratory
- Dorian Arnold, Emory University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://rfd.newmexicoconsortium.org/pipermail/hpcresilience/attachments/20180618/883dedba/attachment.html>


More information about the Hpcresilience mailing list