Commit 03c9d80
authored
Decouple optimizer state and grad dtypes in distributed Adam optimizer (#1575)
* Decouple distopt dtypes for grads and optim state
* Automatically detect grad dtype for Transformer layer wgrad fusion
* Review suggestions from @crcrpar1 parent 0c8400a commit 03c9d80
5 files changed
Lines changed: 97 additions & 106 deletions
File tree
- apex
- contrib
- csrc/optimizers
- optimizers
- test/optimizers
- transformer/tensor_parallel
- csrc
Lines changed: 43 additions & 41 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
200 | 200 | | |
201 | 201 | | |
202 | 202 | | |
| 203 | + | |
203 | 204 | | |
204 | 205 | | |
205 | 206 | | |
| |||
230 | 231 | | |
231 | 232 | | |
232 | 233 | | |
233 | | - | |
| 234 | + | |
234 | 235 | | |
235 | 236 | | |
236 | 237 | | |
| |||
256 | 257 | | |
257 | 258 | | |
258 | 259 | | |
259 | | - | |
| 260 | + | |
260 | 261 | | |
261 | 262 | | |
262 | 263 | | |
| |||
294 | 295 | | |
295 | 296 | | |
296 | 297 | | |
297 | | - | |
| 298 | + | |
298 | 299 | | |
299 | 300 | | |
300 | 301 | | |
| |||
349 | 350 | | |
350 | 351 | | |
351 | 352 | | |
352 | | - | |
353 | | - | |
354 | | - | |
355 | | - | |
356 | | - | |
357 | | - | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
358 | 356 | | |
359 | 357 | | |
360 | 358 | | |
| |||
363 | 361 | | |
364 | 362 | | |
365 | 363 | | |
366 | | - | |
367 | | - | |
368 | | - | |
369 | | - | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
374 | | - | |
375 | | - | |
376 | | - | |
377 | | - | |
378 | | - | |
379 | | - | |
380 | | - | |
381 | | - | |
382 | | - | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
383 | 382 | | |
384 | 383 | | |
385 | 384 | | |
| |||
402 | 401 | | |
403 | 402 | | |
404 | 403 | | |
| 404 | + | |
405 | 405 | | |
406 | 406 | | |
407 | 407 | | |
408 | 408 | | |
409 | 409 | | |
410 | 410 | | |
411 | 411 | | |
412 | | - | |
413 | | - | |
414 | | - | |
415 | | - | |
416 | | - | |
417 | | - | |
418 | | - | |
419 | | - | |
420 | | - | |
421 | | - | |
422 | | - | |
423 | | - | |
424 | | - | |
425 | | - | |
426 | | - | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
427 | 429 | | |
428 | 430 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
444 | 444 | | |
445 | 445 | | |
446 | 446 | | |
447 | | - | |
448 | | - | |
449 | | - | |
450 | | - | |
451 | | - | |
452 | 447 | | |
453 | 448 | | |
454 | 449 | | |
| |||
488 | 483 | | |
489 | 484 | | |
490 | 485 | | |
491 | | - | |
492 | | - | |
493 | | - | |
494 | | - | |
495 | | - | |
496 | | - | |
497 | | - | |
| 486 | + | |
498 | 487 | | |
499 | 488 | | |
500 | 489 | | |
| |||
515 | 504 | | |
516 | 505 | | |
517 | 506 | | |
518 | | - | |
519 | 507 | | |
520 | 508 | | |
521 | 509 | | |
522 | 510 | | |
523 | 511 | | |
524 | 512 | | |
525 | | - | |
526 | 513 | | |
527 | 514 | | |
528 | 515 | | |
| |||
565 | 552 | | |
566 | 553 | | |
567 | 554 | | |
568 | | - | |
| 555 | + | |
569 | 556 | | |
570 | 557 | | |
571 | 558 | | |
| |||
603 | 590 | | |
604 | 591 | | |
605 | 592 | | |
606 | | - | |
| 593 | + | |
607 | 594 | | |
608 | 595 | | |
609 | 596 | | |
| |||
829 | 816 | | |
830 | 817 | | |
831 | 818 | | |
832 | | - | |
| 819 | + | |
833 | 820 | | |
834 | 821 | | |
835 | 822 | | |
| |||
1089 | 1076 | | |
1090 | 1077 | | |
1091 | 1078 | | |
1092 | | - | |
| 1079 | + | |
1093 | 1080 | | |
1094 | 1081 | | |
1095 | 1082 | | |
| |||
1934 | 1921 | | |
1935 | 1922 | | |
1936 | 1923 | | |
1937 | | - | |
1938 | 1924 | | |
1939 | 1925 | | |
1940 | 1926 | | |
| |||
2036 | 2022 | | |
2037 | 2023 | | |
2038 | 2024 | | |
2039 | | - | |
| 2025 | + | |
2040 | 2026 | | |
2041 | 2027 | | |
2042 | 2028 | | |
2043 | 2029 | | |
2044 | 2030 | | |
2045 | 2031 | | |
2046 | | - | |
| 2032 | + | |
2047 | 2033 | | |
2048 | 2034 | | |
2049 | 2035 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| 36 | + | |
36 | 37 | | |
37 | 38 | | |
38 | 39 | | |
| |||
79 | 80 | | |
80 | 81 | | |
81 | 82 | | |
| 83 | + | |
82 | 84 | | |
83 | 85 | | |
84 | 86 | | |
| |||
117 | 119 | | |
118 | 120 | | |
119 | 121 | | |
| 122 | + | |
120 | 123 | | |
121 | 124 | | |
122 | 125 | | |
| |||
133 | 136 | | |
134 | 137 | | |
135 | 138 | | |
| 139 | + | |
136 | 140 | | |
137 | 141 | | |
138 | 142 | | |
| |||
239 | 243 | | |
240 | 244 | | |
241 | 245 | | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
242 | 256 | | |
243 | 257 | | |
244 | 258 | | |
| |||
0 commit comments