system news

Maintenance on Kebnekaise 2023-10-12 - 2023-10-13

  • Posted on: 4 October 2023
  • By: ake

We will have a maintenance window on Kebnekaise 2023-10-12 - 2023-10-13 to upgrade the batch system. It is due to an important security update of SLURM (the workload manager/batch scheduler).

From 2023-10-12 08:00 no jobs will be allowed to run, this means that jobs will not be allowed to start if their requested time limit reaches into this service window.

The maintenance window is from 2023-10-12 08:00 to 2023-10-13 17:00, but we hope to be finished before that.

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

  • Posted on: 9 June 2023
  • By: brorerik

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

We can see no other effect on running jobs and the job queue are now open again after having been DOWN for 1 hour

If you see some other negative effect send us a support case and we'll help solving the issue

Sorry for the inconvenience that this may have caused.

 

Best regards,

/Support

2023-01-30 07:00 Planned maintenance of the cooling systems and central file system (FINISHED 2023-02-02 20:30)

  • Posted on: 20 January 2023
  • By: brorerik

Akademiska hus have a planned maintenance of the cooling systems for the HPC2N Infrastructure computer hall on 2023-02-01

We'll coordinate an upgrade of the central file system around their maintenance to minimize the time the cluster is draining jobs.

The combined maintenance window will therefore start on 2023-01-30 07:00 and according to our planning end on 2023-02-03 16:00

All Kebnekaise nodes, central storage and the login nodes will be unavailable during this time.

2022-12-05 File system down, login not working (SOLVED 23:58)

  • Posted on: 5 December 2022
  • By: brorerik

We are currently experiencing file system server problems.

This is blocking logins and is also affecting running jobs.

We're working to get it back online but currently have no ETA for this.

UPDATE 23:58

The issues has now been resolved and all systems are working normally and the jobs queues are active,

UPDATE 17:30

The work with the file system verification continues, the job queues will not be up until late this evening or around 09.00 tomorrow.

UPDATE 13:10

File system down, 2022-10-06 login not working (SOLVED 2022-10-06 21:40)

  • Posted on: 6 October 2022
  • By: ake

Today at around 08:30 the file system started to have problems.

This is blocking logins and is also affecting running jobs.

We're working to get it back online.

 

UPDATE  2022-10-06 21:40)

The problem with the file system has been solved and Kebnekaise is now up and working normally again.

Due to the problems today some running jbs might have failed and need to be restarted by the user again.

We are sorry for the problems that this has caused

Regards,

/Support

Pages

Updated: 2024-11-01, 13:56