Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 195ab26

Browse files
committed
test
1 parent 9a1b4ed commit 195ab26

File tree

1 file changed

+26
-27
lines changed

1 file changed

+26
-27
lines changed

guides/inf_cluster_quick_start_07_10_24.md

Lines changed: 26 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ __At a minimum, it would be very helpful if you can do these THREE tasks:__
3737
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
3838
bash Miniconda3-latest-Linux-x86_64.sh
3939
```
40-
- Installing Conda can be _very_ slow, so please get this done before we start.
40+
- Installing Conda take a bit of time, so please get this done before we start.
4141
- Note that copying `${HOME}/miniconda3` between clusters typically breaks links and pathways. You should install `miniconda3` from scratch each time and then export and recreate each environment across clusters
4242
- You can use `virtualenv` or `poetry` if desired but I will assume that you know what you are doing / don't need help.
4343
@@ -56,7 +56,7 @@ This is an approximate diagram of how the cluster is configured for a different
5656
5757
## Quick Bash Environment Setup
5858
59-
1. There is now only one Informatics cluster called `mlp` (note that `ilcc-cluster` still works as an alias). Throughout this guide I will assume you have either set a variable called `CLUSTER_NAME` (or you'll just replace that in the instructions) e.g `export CLUSTER_NAME=mlp`.
59+
1. There is now only one Informatics cluster, and the head node is called `mlp` (note that `ilcc-cluster` still works as an alias). Throughout this guide I will assume you have either set a variable called `CLUSTER_NAME` (or you'll just replace that in the instructions) e.g `export CLUSTER_NAME=mlp`.
6060
6161
2. Run this line to ssh into the cluster: `ssh ${USER}@${CLUSTER_NAME}.inf.ed.ac.uk`
6262
@@ -87,17 +87,16 @@ This is an approximate diagram of how the cluster is configured for a different
8787
```
8888
git clone https://github.com/cdt-data-science/cluster-scripts
8989
cd ./cluster-scripts
90-
```
91-
- __Follow the instructions in `README.md`__
92-
93-
6. Re-source your Bash profile
94-
```
90+
echo 'export PATH=/home/$USER/cluster-scripts:$PATH' >> ~/.bashrc
9591
source ~/.bashrc
9692
```
9793
98-
7. You can now play around with commands on the cluster (try running `free-gpus`, `cluster-status`)
94+
6. You can now play around with commands on the cluster (try running `free-gpus`, `cluster-status`)
95+
96+
- __For more information, have a look at the `README.md`__ in the `cluster-scripts` directory.
97+
9998
100-
8. You are ready to go!
99+
7. You are ready to go!
101100
102101
103102
## What's Next? Practical examples!
@@ -114,10 +113,10 @@ All the examples below expect you have performed the prior setup.
114113
115114
Make the conda environment. This can take a bit of time (it’s harder for the distributed filesystem to deal with lots of small files than for your local machine’s hard drive) - go get a cup of tea.
116115
117-
1. Check local versions of cuda available - at time of writing cuda 12.5.0 is the latest version available: `ls -d /opt/cu*`. You should use this version for the `cudatoolkit=??.?.?` argument below.
116+
1. Check local versions of cuda available - at time of writing cuda 11.8.0 available: `ls -d /opt/cu*`. You should use this version for the `cudatoolkit=??.?.?` argument below.
118117
119118
2. Run the command to create a conda environment called `pt`:
120-
`conda create -y -n pt python=3 pytorch torchvision torchaudio cudatoolkit=12.5.0 -c pytorch` (more info about PyTorch installation here if this goes wrong: https://pytorch.org/get-started/locally/)
119+
`conda create -y -n pt python=3 pytorch torchvision torchaudio cudatoolkit=11.8.0 -c pytorch` (more info about PyTorch installation here if this goes wrong: https://pytorch.org/get-started/locally/)
121120
3. Activate the environment `conda activate pt`
122121
123122
#### Get some code to run MNIST experiments.
@@ -129,25 +128,25 @@ Get some code to run MNIST in PyTorch and run it:
129128
130129
##### Interactive jobs (without a GPU)
131130
132-
1. Get an interactive session (you shouldn’t do processing on the head node)
133-
- Find partitions which are used for Interactive sessions (they'll have interactive in the name). For example:
134-
```
135-
$ sinfo -o '%R;%N' | column -s';' -t
136-
137-
PARTITION NODELIST
138-
Teach-Interactive landonia[01,03]
139-
Teach-Standard landonia[02,04-13,19-20]
140-
Teach-Short
141-
Teach-LongJobs landonia[21-25]
142-
General_Usage letha06,meme
143-
PGR-Standard crannog[01-07],damnii[01-12]
144-
ILCC-Standard barre,duflo,greider,levi,mcclintock,nuesslein
145-
ILCC-CDT arnold,strickland
146-
MandI-Standard bonsall,buccleuch,chatelet,davie,elion,gibbs,livy,nicolson,quarry,snippy,tangmere,tomorden,yonath
131+
1. Get an interactive session (you shouldn’t do processing on the head node).
132+
- List all the partitions and nodes available:
133+
```
134+
$ sinfo -o '%R;%N' | column -s';' -t
135+
136+
PARTITION NODELIST
137+
Teach-Interactive landonia[01,03]
138+
Teach-Standard landonia[02,04-13,19-20]
139+
Teach-Short
140+
Teach-LongJobs landonia[21-25]
141+
General_Usage letha06,meme
142+
PGR-Standard crannog[01-07],damnii[01-12]
143+
ILCC-Standard barre,duflo,greider,levi,mcclintock,nuesslein
144+
ILCC-CDT arnold,strickland
145+
MandI-Standard bonsall,buccleuch,chatelet,davie,elion,gibbs,livy,nicolson,quarry,snippy,tangmere,tomorden,yonath
147146
```
148147
149148
- Use srun to get an interactive session on that partition. For example:
150-
```
149+
```
151150
srun --partition=ILCC-Standard --time=08:00:00 --mem=8000 --cpus-per-task=4 --pty bash
152151
```
153152

0 commit comments

Comments
 (0)