You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/imagenet/README.md
+30-24Lines changed: 30 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ Notice that with the new Amp API **you never need to explicitly convert your mod
16
16
17
17
To train a model, create softlinks to the Imagenet dataset, then run `main.py` with the desired model architecture, as shown in `Example commands` below.
18
18
19
-
The default learning rate schedule is set for ResNet50. `main_amp.py` script rescales the learning rate according to the global batch size (number of distributed processes x per-process minibatch size).
19
+
The default learning rate schedule is set for ResNet50. `main_amp.py` script rescales the learning rate according to the global batch size (number of distributed processes \* per-process minibatch size).
20
20
21
21
## Example commands
22
22
@@ -26,59 +26,65 @@ The default learning rate schedule is set for ResNet50. `main_amp.py` script re
26
26
CPU data loading bottlenecks.
27
27
28
28
**Note:**`--opt-level``O1` and `O2` both use dynamic loss scaling by default unless manually overridden.
29
-
`--opt-level``O0` and `O3` (the "pure" training modes) do not use loss scaling by default, but they
30
-
can also be told to use loss scaling via manual overrides. Using loss scaling with `O0`
31
-
(pure FP32 training) does not really make sense, though, and will trigger a warning.
29
+
`--opt-level``O0` and `O3` (the "pure" training modes) do not use loss scaling by default.
30
+
`O0` and `O3` can be told to use loss scaling via manual overrides, but using loss scaling with `O0`
31
+
(pure FP32 training) does not really make sense, and will trigger a warning.
32
32
33
-
```bash
34
-
### Softlink training dataset into current directory
33
+
Softlink training and validation dataset into current directory
34
+
```
35
35
$ ln -sf /data/imagenet/train-jpeg/ train
36
-
### Softlink validation dataset into current directory
37
36
$ ln -sf /data/imagenet/val-jpeg/ val
38
37
```
39
38
40
-
Single-process "pure fp32" training
39
+
### `--opt-level O0` (FP32 training) and `O3` (FP16 training)
0 commit comments