RCE - Super Computers show

RCE - Super Computers

Summary: Super Computers, HPC High Performance Computing, and Engineering. All parts driving our technology development for the future of the world.

Join Now to Subscribe to this Podcast

Podcasts:

 RCE 85: FraunhoferFS (FhGFS) | File Type: audio/mpeg | Duration: 42:26

FraunhoferFS (FhGFS) is the high-performance parallel file system from the Fraunhofer Competence Center for High Performance Computing. Its distributed metadata architecture has been designed to provide the scalability and flexibility that is required to run today's most demanding HPC applications.

 RCE 84: Scalable Checkpoint/Restart | File Type: audio/mpeg | Duration: 46:13

https://computation-rnd.llnl.gov/scr/ Multilevel checkpointing allows applications to take both frequent inexpensive checkpoints and less frequent, more resilient checkpoints, resulting in better efficiency and reduced load on the parallel file system. The slowest but most resilient level writes to the parallel file system, which can withstand an entire system failure. Faster checkpointing for the most common failure modes uses node-local storage, such as RAM, Flash, or disk, and applies cross-node redundancy schemes. Most failures only disable one or two nodes, and multinode failures often disable nodes in a predictable pattern. Thus, an application can usually recover from a less resilient checkpoint level, given well-chosen redundancy schemes.

 RCE 83: Orio | File Type: audio/mpeg | Duration: 48:26

http://brnorris03.github.io/Orio/ Orio, An open-source extensible framework for the definition of domain-specific languages and generation of optimized (C, Fortran, CUDA, OpenCL) code for multiple architecture targets (e.g., CPUs, NVIDIA and AMD GPUs, Intel Phi), including support for empirical autotuning of the generated code.

Comments

Login or signup comment.