Talk at ETH Zurich: Katherine Yelick, Antisocial Parallelism: Avoiding, Hiding and Managing Communication

Katherine Yelick, University of California, Berkeley & Lawrence Berkeley National Labs will give a presentation at ETH Zurich:

Antisocial Parallelism: Avoiding, Hiding and Managing Communication
Monday, October 13, 2014, 16:15 – 17:15, Location: CAB G 61

Additional information »

Abstract: Future computing system designs will be constrained by power density and total system energy, and will require new programming models and implementation strategies. Data movement in the memory system and interconnect will dominate running time and energy costs, making communication cost reduction the primary optimization criteria for compilers and programmers. Communication cost can be divided into latency costs, which are per communication event, and bandwidth costs, which grow with total communication volume. The trends show growing gaps for both of these relative to computation, with the additional problem that communication congestion can conspire to worsen both in practice. In this talk I will discuss prior work an open problems in optimizing communication, starting with PGAS languages. This involves reducing the communication cost, through overlap, and the frequency through caching and aggregation. Much of the compile-time work in this area was done in the Titanium language, where strong typing and data abstraction aid in program analysis, while UPC compilers tend to use more dynamic optimizations. There are still many open questions on the design of languages and compilers, especially as the communication hierarchies become deeper and more complex. Bandwidth reduction often requires more substantial algorithmic transformations, although some techniques, such as loop tiling, are well known. These can be applied as hand-optimizations, through code generation strategies in autotuned libraries, or as fully automatic compiler transformations. Less obvious techniques for communication avoidance have arisen in the so-called “2.5D” parallel algorithms, which I will describe more generally as “.5D” algorithms. These ideas are applicable to many domains, from scientific computations to database operations. In addition to having provable optimality properties, these algorithms also perform well on large-scale parallel machines. I will end by describing some recent work that lays the foundation for automating transformations to produce communication optimal code for arbitrary loop nests.

Latest Posts

End-of-Year Wrap-Up 2025

Insights and Exchange at the HPC-CH Forum on Financial Aspects of HPC

Apertus: A fully open, transparent, multilingual language model

Call for Presentations and Participation: hpc-ch forum on Financial Aspects of HPC

CSCS User Lab Day 2025

A language model built for the public good

HPC Optimization Specialist

Online @ ETH Zurich: Course in Parallel Programming with MPI/OpenMP, Aug 20-29, 2025

Call for Presentations and Participation: hpc-ch forum on HPC Workflows and Batch Systems

2025 HPC-AI Swiss Conference

Hackathon: CI-CD pipelines on Alps

Interdisciplinary School on Machine Learning and Artificial Intelligence for Science, June 19 to 24, 2025 (Heilbronn, Germany)

Talk at ETH Zurich: Katherine Yelick, Antisocial Parallelism: Avoiding, Hiding and Managing Communication