Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@afrittoli
Copy link
Contributor

@afrittoli afrittoli commented Mar 12, 2024

Description of your changes:

Spellcheck all markdown files (except REVIEWERS) to help us maintain how quality docs.

Code snippets are excluded from the check.
The job only runs when markdown files are changed to save CI resources.

Add a custom dictionary that includes words unknown by the default dict.

Copy link
Member

@markstur markstur left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see a couple typos in your exceptions. Should fix those. I am in favor of a spellcheck even though sometimes it's a pain.

@afrittoli
Copy link
Contributor Author

Thanks @markstur for the review - I addressed your comments and reduced the custom dictionary even more with a couple more spell fixes.

It looks like both "pre-trained" and "pretrained" are acceptable, however the version with the hyphen seems more common, so I left it as it is.

@afrittoli
Copy link
Contributor Author

I see a couple typos in your exceptions. Should fix those. I am in favor of a spellcheck even though sometimes it's a pain.

Indeed - I tried to configure the spellcheck to give as little pain as possible.
One extra thing we could add to the config is to ignore words shorted than X characters (e.g. all 2 and 3-letter words).

Ultimately I think it's worth the little extra pain, rather than having spelling errors in the docs (which look rather unprofessional) and many small PRs fixing them.

Spellcheck all markdown files (expect REVIEWERS) to help us
maintain how quality docs.

Code snippets are excluded from the check.
The job only runs when markdown files are changed to save
CI resources.

Add a custom dictionary that includes words unknown by the
default dict.

Signed-off-by: Andrea Frittoli <[email protected]>
Copy link
Member

@hickeyma hickeyma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for this handy tool @afrittoli

@hickeyma hickeyma merged commit f7435b9 into instructlab:main Mar 14, 2024
@hickeyma hickeyma deleted the spellcheck branch March 14, 2024 12:49
@markstur
Copy link
Member

Thanks @markstur for the review - I addressed your comments and reduced the custom dictionary even more with a couple more spell fixes.

It looks like both "pre-trained" and "pretrained" are acceptable, however the version with the hyphen seems more common, so I left it as it is.

Yeah a lot of one word /two words/hyphen decisions depend on before or after the noun.

A pre-trained thing vs the thing is pretrained.
An open-sourced thing vs the thing is open sourced.
(my examples might be correct (?))

I refer to grammarly (well basically I google is X one word or two or hyphenated)

@nathan-weinberg nathan-weinberg mentioned this pull request Feb 7, 2025
6 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants