Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

ManyTheFish
Copy link
Member

@ManyTheFish ManyTheFish commented Jan 18, 2023

Pull Request

Use compatibility decomposition normalizer in facet string extraction in order to have a more human friendly sort order.

Now, é (U+00E9) is converted to e (U+0065) + ◌́ (U+0301). This way any word starting with an accented/diacritized version of a character is put just after the words starting with the unaccented version of the character.

Related issue

Fixes #3260

@ManyTheFish ManyTheFish added this to the v1.1.0 milestone Jan 18, 2023
Copy link
Member

@Kerollmops Kerollmops left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is great. Thank you!

@irevoire
Copy link
Contributor

irevoire commented Jan 18, 2023

Now, é (U+00E9) is converted to e (U+0065) + ◌́ (U+0301). This way any word starting with an accented/diacritized version of a character is put just after the words starting with the unaccented version of the character.

Is that true, though?

et
été
ez

The end of your message tells me these words would be sorted like that, but if I understood correctly, the first part they're actually going to be sorted like that;

et
ez
été

ACTUALLY: I just reread your message, and that's exactly what you said, I just missed the important s on words in is put just after the words 🤦

Nice improvement whatsoever 👍

@ManyTheFish
Copy link
Member Author

thanks @Kerollmops and @irevoire,

bors merge

@bors
Copy link
Contributor

bors bot commented Jan 18, 2023

Build succeeded:

@bors bors bot merged commit b3166df into main Jan 18, 2023
@bors bors bot deleted the enhance-facet-string-normalization branch January 18, 2023 17:50
@meili-bot meili-bot added the v1.1.0 PRs/issues solved in v1.1.0 released on 2023-04-03 label Apr 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
v1.1.0 PRs/issues solved in v1.1.0 released on 2023-04-03
Projects
None yet
Development

Successfully merging this pull request may close these issues.

sort by is incorrect for accented characters
4 participants