Codestin Search App

yaoyu-33 · 2022-11-15T18:57:24Z

We updated megatron fused softmax in the following aspects:

We updated the limit of sequence length from 2048 to 4096 according to https://github.com/NVIDIA/Megatron-LM/blob/main/megatron/model/fused_softmax.py#L171
We also enabled mask=None support in scaled_masked_softmax according to https://github.com/NVIDIA/Megatron-LM/blob/main/megatron/model/fused_softmax.py#L84

Signed-off-by: Yu Yao <[email protected]>

crcrpar

how is the custom C++/CUDA extension of scaled_softmax_cuda built?
usually setup.py needs to be updated.
also, could you update https://github.com/NVIDIA/apex/blob/master/tests/L0/run_transformer/test_fused_softmax.py as well?

Signed-off-by: Yu Yao <[email protected]>

yaoyu-33 · 2022-11-15T19:40:42Z

@crcrpar comments addressed

crcrpar · 2022-11-16T07:17:24Z

+            expected.backward(g0)
+            actual.backward(g1)


feels like these lines are based off of some existing cases but how about either checking grads or remove these lines?

added grad check

I think the purpose in other tests was just to test backward can work correctly

Signed-off-by: Yu Yao <[email protected]>

crcrpar

generally okay but could you maybe tweak tests a bit?

Signed-off-by: Yu Yao <[email protected]>

* Update megatron fused softmax follow megatron-lm Signed-off-by: Yu Yao <[email protected]> * Add mask=None support in scaled_masked_softmax Signed-off-by: Yu Yao <[email protected]> * Update setup.py for scaled_softmax_cuda Signed-off-by: Yu Yao <[email protected]> * Add tests for fused_scale_softmax (mask=None) Signed-off-by: Yu Yao <[email protected]> * Assert grad equal in fused softmax test Signed-off-by: Yu Yao <[email protected]> * Revert "Assert grad equal in fused softmax test" Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: Yu Yao <[email protected]>

yaoyu-33 added 2 commits November 15, 2022 10:49

Update megatron fused softmax follow megatron-lm

6f9bae6

Signed-off-by: Yu Yao <[email protected]>

Add mask=None support in scaled_masked_softmax

302f29d

Signed-off-by: Yu Yao <[email protected]>

crcrpar reviewed Nov 15, 2022

View reviewed changes

yaoyu-33 added 2 commits November 15, 2022 11:19

Update setup.py for scaled_softmax_cuda

2bcecc3

Signed-off-by: Yu Yao <[email protected]>

Add tests for fused_scale_softmax (mask=None)

a8d55a5

Signed-off-by: Yu Yao <[email protected]>

crcrpar reviewed Nov 16, 2022

View reviewed changes

yaoyu-33 and others added 2 commits November 16, 2022 09:10

Assert grad equal in fused softmax test

b14e66f

Signed-off-by: Yu Yao <[email protected]>

Merge branch 'NVIDIA:master' into master

b254a4a

crcrpar reviewed Nov 17, 2022

View reviewed changes

Comment thread tests/L0/run_transformer/test_fused_softmax.py Outdated

Comment thread tests/L0/run_transformer/test_fused_softmax.py Outdated

Revert "Assert grad equal in fused softmax test"

618d49c

Signed-off-by: Yu Yao <[email protected]>

crcrpar merged commit abeca58 into NVIDIA:master Nov 22, 2022

BugMaker-bot mentioned this pull request May 9, 2023

Error installing Apex #1645

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update megatron fused softmax follow megatron-lm#1539

Update megatron fused softmax follow megatron-lm#1539
crcrpar merged 7 commits into
NVIDIA:masterfrom
yaoyu-33:master

yaoyu-33 commented Nov 15, 2022

Uh oh!

crcrpar left a comment

Uh oh!

yaoyu-33 commented Nov 15, 2022

Uh oh!

crcrpar Nov 16, 2022

Uh oh!

yaoyu-33 Nov 16, 2022

Uh oh!

yaoyu-33 Nov 16, 2022

Uh oh!

crcrpar left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yaoyu-33 commented Nov 15, 2022

Uh oh!

crcrpar left a comment

Choose a reason for hiding this comment

Uh oh!

yaoyu-33 commented Nov 15, 2022

Uh oh!

crcrpar Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

yaoyu-33 Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

yaoyu-33 Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

crcrpar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants