Hardback

Fault-Tolerance Techniques for High-Performance Computing

$276.99

Add to wishlist

This title is printed to order. This book may have been self-published. If so, we cannot guarantee the quality of the content. In the main most books will have gone through the editing process however some may not. We therefore suggest that you be aware of this before ordering this book. If in doubt check either the author or publisher’s details as we are unable to accept any returns unless they are faulty. Please contact us if you have any questions.

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

In Shop

Out of stock

Shipping & Delivery

Available to order, ships in 1-2 weeks

$9.00 standard shipping within Australia
FREE standard shipping within Australia for orders over $100.00
Express & International shipping calculated at checkout

MORE INFO

Stock availability can be subject to change without notice. We recommend calling the shop or contacting our online team to check availability of low stock items. Please see our Shopping Online page for more details.

Format

Hardback

Publisher

Springer International Publishing AG

Country

Switzerland

Date

15 July 2015

Pages

320

ISBN

9783319209425

Format

Hardback

Publisher

Springer International Publishing AG

Country

Switzerland

Date

15 July 2015

Pages

320

ISBN

9783319209425

Looking for something in particular?

Search our extensive online catalogue.

Cart ()

Readings Recommends

Subtotal (excludes shipping)

Re-send account confirmation

Reset your password

Reset your password

Fault-Tolerance Techniques for High-Performance Computing

In Shop

Shipping & Delivery

Looking for something in particular?

Readings E-News