Research Computing and Engineering

Research Computing and Engineering

Subscribe iTunes

Podcast Feed
Add to Google
feed image

Sponsor

 (Opens New Window)

www.mlds-networks.com

  • Websites
  • E-Mail
  • Streaming Audio
  • Podcast
  • Virtual Servers
Home Podcast RCE 12: BLCR
RCE 12: BLCR PDF Print E-mail
Written by Brock Palen   
Friday, 19 June 2009 16:34

MP3 (Right Click Save As)

Brock Palen and Jeff Squyres speak with Paul Hargrove of the Berkley Labratory Checkpoint Restart (BLCR) project, for checkpointing, restartaring and migrating HPC applications.

Notes:  All library code is LGPL; kernel module and the (small) user-space utils are GPL.  There is also support for BLCR in SLURM 2.0 which is not mentioned in the show.

Since 2000, Paul has been a Principle Investigator in the Future Technologies Group (FTG) at Lawrence Berkeley National Laboratory (LBNL).  His general area of work can be described as systems software and runtime environments for High Performance Computing (HPC). His current research interests include Checkpoint/Restart, Partitioned Global Address Space (PGAS) languages, and high-performance cluster networks.  Current projects include Berkeley Lab Checkpoint/Restart (BLCR) for Linux, Global Address Space Networking (GASNet), and Berkeley Unified Parallel C (UPC).  Paul received his Ph.D. from Stanford University in 2003.

 
 
Joomla 1.5 Templates by Joomlashack