15th JLESC Workshop

Name: 15th JLESC Workshop
Start: 2023-03-21T12:00:00+01:00
End: 2023-03-23T22:00:00+01:00
Location: LaBRI

21–23 Mar 2023

LaBRI

Europe/Paris timezone

Contact

Session

Short Talks on Distributed Resources

ST Res

22 Mar 2023, 14:00

LaBRI Amphi (LaBRI)

LaBRI Amphi

LaBRI

There are no materials yet.

7. Towards an application-driven dynamic resource approach for HPC

Dominik Huber (Technical University of Munich (TUM))

22/03/2023, 14:00

Programming languages and runtimes

Short talk

This short talk provides an introduction to the ongoing research at UGA /TUM (EuroHPC Time-X) on an application-driven dynamic resource approach for HPC. Time-X targets the area of parallel-in-time (PinT) integration, where resource dynamic strategies have been shown to improve the performance and efficiency of PinT algorithms.

However, current approaches to enable dynamic resources for...

21. Study of the folding of distributed experiments containing a distributed file system

Quentin Guilloteau

22/03/2023, 14:10

I/O, storage and in-situ processing

Short talk

The development and evaluation of grid or cluster middlewares, such as batch schedulers, require to deploy numerous machines to reach an environment close to the full scale of the production system.

To avoid these huge deployments, one can consider folding the system on itself by deploying several "virtual" resources onto one physical resource.

In this study, we investigate the...

65. Dynamic resources in MPI

SERGIO ISERTE AGUT (Barcelona Supercomputing Center)

22/03/2023, 14:20

Programming languages and runtimes

Short talk

Process malleability and dynamic resources have demonstrated, in several studies, to increase the productivity of HPC facilities, in terms of completed jobs per unit of time. In this regard, changing the number of resources assigned to an application during its execution accelerates global job processing. Furthermore, the users of malleable applications can also benefit from malleability when...

33. Optimize heterogenous storage resources use on HPC systems with simulations

Julien Monniot (INRIA)

22/03/2023, 14:30

I/O, storage and in-situ processing

Short talk

Large-scale infrastructures are increasingly required to store and retrieve massive amounts of data in order to execute scientific applications at scale. The severe need for I/O performance is now often handled by new intermediate tiers of storage resources, deployed throughout HPC systems (node-local storage, burst-buffers, …) and backed by more and more specialized hardware (NVRAM, NVMe, …)....

67. Composition of Scheduling and Control-Theory Techniques

Raphael Bleuse (Univ. Grenoble Alpes, Inria)

22/03/2023, 14:40

Programming languages and runtimes

Short talk

The management and allocation of resources to users in HPC infrastructures often relies on the RJMS.
One key component for an optimized resource allocation, with respect to some objectives, is the scheduler.

Scheduling theory is interesting as it provides algorithms with performance guarantees.
These guarantees come at the cost of tedious and complex modeling effort.
The growing...

60. Seamless Heterogeneous Memory Management Via The EcoHMEM Methodology

HATEM ELSHAZLY (Barcelona Supercomputing Center (BSC))

22/03/2023, 14:50

Programming languages and runtimes

Short talk

New memory technologies are an emerging to provide larger RAM sizes at reasonable cost and energy consumption. In addition to the conventional DRAM, recent memory infrastructures contain byte-addressable persistent memory (PMEM) technology that offers capacities higher than DRAM and better access times than Nand-based technologies such as SSDs.

In such hybrid infrastuctures, users have...

63. Cloud-Bursting and Autoscaling for Python-Native Scientific and AI Workflows

Mr Tingkai Liu (University of Illinois at Urbana-Champaign)

22/03/2023, 15:00

Programming languages and runtimes

Short talk

We have extended the Ray framework to enable automatic scaling of workloads on high-performance computing (HPC) clusters managed by SLURM© and bursting to a Cloud managed by Kubernetes®. Our implementation allows a single Python-based parallel workload to be run concurrently across an HPC cluster and a Cloud. The Python-level abstraction provided by our solution offers a transparent user...

Building timetable...

15th JLESC Workshop

Contact

Session

Short Talks on Distributed Resources

LaBRI Amphi

LaBRI

Description

Presentation materials

Choose timezone

15th JLESC Workshop

Contact

Description

Presentation materials