Add proper grapheme cluster support for modern terminal UTF-8 handling #1583
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR addresses critical UTF-8 display and input issues that occur in modern terminals like Ghostty when handling emoji with variation selectors and other complex Unicode sequences.
The problem manifests as display corruption where emoji sequences like
ππππ₯°πβ£οΈπβ₯οΈβ₯οΈπβ£οΈπ₯°π₯°πππcause text positioning issues and garbled output in sidepanels and input fields.Key Changes
Technical Details
src/core/utf8.c: New grapheme cluster advancement functions with utf8proc integrationsrc/fe-text/gui-entry.c: Fixed cursor positioning and text measurement for complex Unicodesrc/fe-text/gui-readline.c: Enhanced paste processing to handle multi-codepoint sequencessrc/core/recode.c: Bypass TRANSLIT when both input and output are valid UTF-8Test Case
The implementation was tested with the emoji sequence that originally caused issues:
ππππ₯°πβ£οΈπβ₯οΈβ₯οΈπβ£οΈπ₯°π₯°πππCompatibility