Conversation
Signed-off-by: jwilber <[email protected]>
|
Important Review skippedAuto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the
✨ Finishing touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
| thd_enabled: true | ||
| use_sequence_packing: true | ||
| fp8_enabled: true | ||
| fp8_recipe: transformer_engine.common.recipe.DelayedScaling |
There was a problem hiding this comment.
let's use Float8BlockScaling
| # Lingua 1B - FSDP2 with Context Parallelism | ||
| - config: L2_lingua_1b | ||
| task_cmd: train_fsdp2_cp | ||
| thd_enabled: false | ||
| use_sequence_packing: false | ||
| fp8_enabled: false | ||
| cp_enabled: true | ||
| cp_size: 2 | ||
| wandb_name: "llama3_lingua_1b__fsdp2__cp__${now:%Y%m%d-%H%M%S}__${gitsha:}" | ||
| job_name: "llama3-lingua-1b-fsdp2-cp" | ||
|
|
||
| # Lingua 1B - FSDP2 with Context Parallelism + FP8 | ||
| - config: L2_lingua_1b | ||
| task_cmd: train_fsdp2_cp | ||
| thd_enabled: false | ||
| use_sequence_packing: false | ||
| fp8_enabled: true | ||
| fp8_recipe: transformer_engine.common.recipe.DelayedScaling | ||
| fp8_format: HYBRID | ||
| cp_enabled: true | ||
| cp_size: 2 | ||
| wandb_name: "llama3_lingua_1b__fsdp2__cp__fp8__${now:%Y%m%d-%H%M%S}__${gitsha:}" | ||
| job_name: "llama3-lingua-1b-fsdp2-cp-fp8" |
There was a problem hiding this comment.
cp runs should probably use sequence packing, but we need to be careful to use cudnn>=9.18
Signed-off-by: jwilber <[email protected]>
Signed-off-by: jwilber <[email protected]>
Signed-off-by: jwilber <[email protected]>
Draft llama3 configs.
(not yet tested).
Also updated schedule for runs (and added this info to README):
┌─────────────────────┬───────────────────────┐
│ Model │ Schedule │
├─────────────────────┼───────────────────────┤
│ esm2_native_te_650m │ Mon/Wed/Fri (1am PST) │
├─────────────────────┼───────────────────────┤
│ esm2_native_te_15b │ Mon/Wed/Fri (1am PST) │
├─────────────────────┼───────────────────────┤
│ llama3_native_te_1b │ Tue/Thu (1am PST) │
├─────────────────────┼───────────────────────┤
│ codonfm_ptl_te │ Tue/Thu (1am PST) │
└─────────────────────┴───────────────────────┘