-
-
Notifications
You must be signed in to change notification settings - Fork 136
Accents in non-latin symbols. #217
Copy link
Copy link
Open
Description
Hello.
I'm trying to use Inflector::unaccent to compare strings the same way as mysql with utf8_unicode_ci(_ai that can be omitted) collation compare.
I found out that some (or all) cyrillic letters are not unaccented. For example, russian ё (https://www.compart.com/en/unicode/U+0451) and ukrainian ї(https://www.compart.com/en/unicode/U+0457) which are widely used. The mysql compares them properly and removes accents.
How did your compose the Inflector::ACCENTED_CHARACTERS? Can we add non-latin symbols there?
I guess this can be done automatically using official unicode data: https://unicode.org/Public/UNIDATA/UnicodeData.txt
The 6th column shows what is the character composed of (or it's empty if the character is not accented).
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels