FlowVR
FlowVR is an open source middelware based on the dataflow paradigm to augment parallel simulations running on thousands of cores with in situ processing capabilities and live steering.
Melissa
Melissa is an in situ solution for sensitivity analysis. It implements iterative algorithms to compute spatio-temporal statistic fields over results of large scale sensitivity studies. Melissa benefits are multiple:
- Storage saving: zero intermediate files and a memory requirement on the server side in the order of the outputs of one simulation run.
- Timesaving: simulations run faster when sending data to the server than when writing their results to disk, and our one-pass algorithm does not need to read back some huge amount of data from disk to compute the statistics.
- Ubiquitous: performance and scalability gains enable to compute ubiquitous multidimensional and time varying statistics, i.e. everywhere in space and time, instead of providing statistics for a limited sample of probes as usually done.
- Adaptive: simulation groups can be de ned, started or interrupted on-line according to past runs behavior or the statistics already computed.
- Fault tolerance: Melissa integrates a lightweight bookkeeping and a few heartbeats are required to detect issues and restart the server or the simulations, with limited intermediate result loss.
- Elasticity: simulation groups are independent and connect dynamically to the parallel server when they start. They are submitted as independent jobs to the batch scheduler. Thus, the scheduler can adapt the resources allocated to the application during the execution.
OAR
OAR is a versatile resource and task manager (also called a batch scheduler) for HPC clusters, and other computing infrastructures such as distributed computing experimental testbeds where versatility is a key (Grid’5000, IoT Lab FIT).
Kameleon
Kameleon is a simple but powerful tool to master the construction of software appliances. Building from scratch using scripted recipes, kameleon builds put an emphasis on providing reconstructibility.
Batsim
Batsim is a Batch Scheduler Simulator. A Batch scheduler — AKA Resources and Jobs Management System (RJMS) — is a system that manages resources in large-scale computing centers, notably by scheduling and placing jobs, and by setting up energy policies. Batsim simulates the computing center’s behaviour. It is made such that any event-based scheduling algorithm can be plugged to it. Thus, it permits to compare decision algorithms coming from production and academics worlds.
All software developed by DataMove are released under Free / Libre Open-Source licences.