Zsh Mailing List Archive
Messages sorted by: Reverse Date, Date, Thread, Author

Re: utf-8



On 12/18/2014 12:52 PM, ZyX wrote:
http://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt, third column. Read http://www.unicode.org/reports/tr44/tr44-14.html#General_Category_Values for the explanation of the values, you need L* and N* (note: testing shows that not all N* are relevant: No is not (test: CIRCLED DIGIT ONE), N is not as well (test: VULGAR FRACTION ONE QUARTER), Nd (DIGIT ONE, FULLWIDTH DIGIT ONE) and No (RUNIC ARLAUG SYMBOL) are). I highly suggest seeking answer in libc sources if you need better precision.

It is very generous. I can think of only one more question. What happens in a language 'above' normal ASCII with things like escapes? Like if you were writing in Russian:

echo "\nRussian is a very expressive language.\n"

.... if that was in Cyrillic characters, how does one indicate '\n' ?



Messages sorted by: Reverse Date, Date, Thread, Author