Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@Fedzbar
Copy link
Contributor

@Fedzbar Fedzbar commented Dec 7, 2024

Hello,

Thanks for the great library!

I believe there is a bug with the conversion of Llama models when it comes to the RoPE wavelength.

This is quite important as the theta is set to 500k in the new Llama models, while this was hardcoded to 10k.

Thanks!
Federico

@danieldjohnson danieldjohnson merged commit e23bfed into google-deepmind:main Dec 15, 2024
2 checks passed
@danieldjohnson
Copy link
Collaborator

Good catch, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants