Back To Schedule
Monday, March 4 • 8:30am - 12:00pm
Workshop: Best Practices in Supercomputing Systems Management

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

8:30 – 8:40 Keith Gray (BP) Welcome and Introduction
8:40 – 9:00 Rosemary Francis (Ellexus) Benchmarking challenges
9:00 – 9:20 Carlos Rosales-Fernandez (Intel) Optimize for both memory and compute using Roofline automation and SIMD analysis tools
9:20 – 9:40 Ron Cogswell & Alex Morris (Shell) Benchmarking Seismic Code on AMD & Intel CPUs.
9:40 – 10:00 Dave McMillan (Cray) Monitoring – A View into our Systems
10:30 – 10:50 Mike Townsley (Exxon) ExxonMobil’s new Spring Campus HPC Data Center
10:50 – 11:20 Tommy Minyard (TACC) Fontera; challenges & liquid cooling in the data center
11:20 – 11:35 Tommy Minyard (TACC) Storage, NV DIMMS & Apache Pass
11:35 – 11:55 Erik Engquist (Rice) Efficient data transfers
11:55 – 12:00 David Baldwin (Shell) Close

This session will share best practices in supercomputing systems management. In the last few years, our industry has made great progress improving the reliability of very large MPI jobs in our clusters, but the challenge still remains on how to accurately benchmark our algorithms to best squeeze every last piece of performance out of our systems. Coupled with a changing landscape for the filesystems and evolving services supported by cloud, the choices we make as HPC professionals becomes more difficult but no less critical.

We are organizing a workshop to share best practices in filesystems monitoring and management and performance management with a focus on the cluster’s health to drive application performance. Experts and practitioners from industry, academia and national laboratories will present and share their experiences on these subjects as well as leading a discussion.

Monday March 4, 2019 8:30am - 12:00pm CST