Thanks to visit codestin.com
Credit goes to github.com

Skip to content

fix large loss during llama2 post-training#82

Open
sidhantls wants to merge 1 commit into
horseee:mainfrom
sidhantls:fix_nan_llama2
Open

fix large loss during llama2 post-training#82
sidhantls wants to merge 1 commit into
horseee:mainfrom
sidhantls:fix_nan_llama2

Conversation

@sidhantls
Copy link
Copy Markdown

@sidhantls sidhantls commented Oct 7, 2024

Fixes #81

When loading pruned model (output of hf_prune.py) in post_training, cast model to fp32 if base_model is llama2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Post training more than 1 epoch leads to performance degradation

1 participant