Storage on Viking

It is vital that users continue to manage their data on the Viking cluster. There are currently 6 areas where data can be stored. Below details the areas, the type of data you should store there and what is/isn’t backed up.

Backed Up

Location

Default
Quota*

Data Type

Deletion policy

Home

No

/users/abc123

User code, programs, local application
cache Python/conda/R environments

Never deleted

Scratch (users)

No

/mnt/scratch/users/abc123/

2TB

Active research data used for jobs running on Viking

Data not touched in 90 days deleted

Scratch (projects)

No

/mnt/scratch/projects

2TB

Active shared project data for jobs running on Viking

Other types of shared data for workloads that an entire project can use.

Useful for join application testing and development

Data not touched in 90 days deleted
unless discussed with Viking team

Scratch (flash)

No

Lfs commands

2TB

Flash (fast) storage that can be used if your workloads are IO intensive

Data not touched in 90 days deleted

longship

No

Viking:
/mnt/longship/users
/mnt/longship/projects

Campus (SFTP/Linux):
/shared/longship/users
/shared/longship/projects

Campus (Windows):
\\longship.york.ac.uk\users
\\longship.york.ac.uk\projects

2TB

Warm data.
A place to store data not actively in use but will be in the future on Viking.
Accessible from Viking AND campus. Can also be used to transfer data to/from Viking.

Not deleted

localtmp

No

/users/abc123/localtmp

Tmp files for jobs

Clean up period TBD

* (can be increased on request)

Scratch

Scratch is a high performance file system that runs Lustre. The Lustre file system is an open-source, parallel file system that supports High Performance Computing (HPC) environments. Scratch works best when the filesystem is not at capacity. To ensure it remains efficient we will be implementing a policy where data will be deleted automatically if it has not been touched in 90 days. If you wish to keep your data please ensure it is backed up on the University filestore storage.york.ac.uk or in the cloud or move to Longship (see below).

We will be making exceptions to this rule for certain project directories where groups share data between teams. This is to encourage users to not download the same datasets multiple times.

Scratch (Flash)

For data IO intensive workflows you may find better performance using our high performance flash storage. This filesystem also runs lustre. The overall capacity is smaller therefore we ask users to clean up and delete data after use. To use flash scratch perform the following steps:

$ mkdir test-dir
$ lfs setstripe --pool flash test-dir

Then write to the directory that has the flash pool set. By default that just stripes over one Object Storage Targets (OST), to stripe across e.g. 10:

$ lfs setstripe --pool flash --stripe-count 10 test-dir

Note

It’s easiest to create a new dir with the striping set, then cp the files there (don’t use mv as this only updates the metadata, the underlying objects remain the same). For better lustre performance it may be worth explicitly striping across multiple OSTs there as well. Striping in Lustre should only be used for large files, using it for small files can actually degrade performance. Further guidance on using lustre to achieve maximum performance can be found here.

Attention

Please remember to delete your data after you’ve finished using the flash storage.

Longship

If you have data which needs to be kept on scratch but might not be used in the next 90 days you can copy it to Longship. This is not a high-performance filestore but can be used to save data so users do not have to copy datasets back and forth to Viking from campus. Longship is mounted read/write on the Viking login nodes and read only on the compute nodes. If you only have a small number of files to read for your job you might be able to do this direct from Longship. You can also access Longship on campus. The hope is to simplify the data journey for users, whilst also taking a lead on curating your data.

Location

Below are the paths for accessing Longship from various locations.

Viking

/mnt/longship/users

/mnt/longship/projects

Campus (SFTP/Linux)

/shared/longship/users

/shared/longship/projects

Campus (Windows)

\\longship.york.ac.uk\users

\\longship.york.ac.uk\projects