Latest News – Page 2

Update for the squeue command

All,

Users are now allowed to use the squeue command on the devel nodes
through sudo, to display information about all jobs.

sudo squeue

squeue is the only command for which information about all users will
be made available to all, as its information are not a breach to the
GPDR related constraints.

Cheers,

The PlaFRIM team

Update on miriel and sirocco[01-05] nodes

Standard nodes miriel[001-088] and K40M nodes sirocco[01-05] are in best effort and without support, and will be removed from the platform when failing to start.

Changes for the information available through the squeue command

All,

We are currently in a phase of compliance with the constraints related
to the GDPR; this limits in particular the access to the slurm database.
The tool squeue is impacted with this limitation. Users only see
information about their jobs when making queries (this is also the case
for all other slurm commands, scontrol, sacct...).

The sinfo command can still be used to get an overview of the platform
usage. The state of the nodes and a filtered output of the squeue
command are available on the platform web page
https://www.plafrim.fr/jobs-monitoring/

This page is currently being updated to report as much information as
possible to the users. We will keep working on it in the coming days and
weeks, to hopefully meet everyone's expectations.

Please note this address is only available once signed in on the web
site (you need to use the WP identifiers obtained at the account creation).

Cheers,

The PlaFRIM team

PlaFRIM discussion team on mattermost

Dear all,

A PlaFRIM discussion team is now available on the Inria mattermost
server. PlaFRIM users got an an email with the link to join.

This team is intended for discussions on any subject related to the use
of the platform. Channels can be created for some specific needs. In any
cases, here some rules to follow

- DO NOT USE any channel to report tickets, sending an email to the
technical team plafrim-support AT inria.fr is the only way to submit tickets.

- Refrain from having NON-SERIOUS conversations or trolls.

For those not having Inria email addresses, an external account must be
created for you to access the mattermost server. Please send an email
back to nathalie.furmento AT labri.fr if you need such an account.

Cheers,

The PlaFRIM technical team

PlaFRIM is back

The shutdown is now over and PlaFRIM is available again.

Closure of PlaFRIM from April 24 to 27, 2020

In order to proceed with the annual total electric shutdown of the building scheduled for 25 April, we have to shut down the PlaFRIM infrastructure from 24 April during the afternoon.
A normal situation is scheduled on Monday 27 April during the morning.
Thank you for your understanding.

Degraded mode

The current health situation forces us to work in degraded mode. We are doing what is necessary to keep PlaFRIM operational as long as the building remains open.
The follow-up and support of the users will probably be done in a degraded way.
Of course, we can always be joined via plafrim-support@inria.fr.

Thank you for your understanding

New parallel storage space

A new parallel BeeGFS file system (see https://www.beegfs.io/content/) is now available on PlaFRIM. You will find a /beegfs/<LOGIN> directory on all PlaFRIM nodes.
This parallel file system replaces the /lustre storage which is no longer under warranty and is not recommended.

The data in this directory are not saved. The quota is 1 TB per user.
If you need to write/read massively and/or in parallel, this is the preferred file system unlike your /home/<LOGIN> directory.

Information on all available storage spaces can be found in the plafrim website’s FAQ (see here).

Please do not hesitate to contact us if you need further information

New AMD nodes available

New AMD compute nodes are now available (see Hardware Documentation/) :

– diablo [01-04]: 2 * AMD EPYC 7452 32-Core Processor and 256 GB RAM
– diablo05 : 2 * AMD EPYC 7702 64-Core Processor and 1 TB RAM

To reserve them, use the “amd” or “diablo” constraints.

Please do not hesitate to contact us if you need further information.

End of Network instability on bora[001-040] and devel[01-02]

The network switch was replaced on January 15. Since then, we have not observed any outages.
We confirm that the problems of access to the bora [001-040] and devel [01-02] nodes are solved.

Please do not hesitate to contact us if you need further information.