Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for "confusables"? #25

Open
WardBrian opened this issue Mar 7, 2024 · 0 comments
Open

Support for "confusables"? #25

WardBrian opened this issue Mar 7, 2024 · 0 comments

Comments

@WardBrian
Copy link

UTS 39 specifies a list of "confusables" as well as "intentional confusables". These are characters like the greek and cyrillic characters which look identical but are not normalized to each other.

It would be very helpful if there was some way to identify these with each other, particularly the intentional confusables as many of them are valid XID characters.

Rather than an extra database of values here, there is also an algorithm given in section 4 ("Confusability Detection") which could be placed in the uunf package

WardBrian added a commit to stan-dev/stanc3 that referenced this issue Mar 7, 2024
WardBrian added a commit to stan-dev/stanc3 that referenced this issue Mar 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants