[PATCH] [BZ 14094] Update locale data to Unicode 7.0.0
Pravin Satpute
psatpute@redhat.com
Sat Jun 21 19:28:00 GMT 2014
Hi All,
Attached patch to fix this long pending issue.
A. Process for updating locales/i18n ctype with new Unicode release is
documented @ [1], I think it should get added either in WIKI, or docs
folder of glibc.
B. Patch adds two scripts to scripts folder and updates locales/i18n file
gen-unicode-ctype-dcp.py - To generate upper, lower and alpha class
from DerivedCoreProperties.txt [2]
check-backcompatibility.py - to test whether updated locales/i18n
is backward compatible with older one.
C. Best way to check new updated i18n file is compatible with existing
i18n file is
a. copy new i18n file as a i18nnew
b. check-backcompatibility.py i18n i18nnew > Report
D. By using better file DerivedCoreProperties.txt from UCD for
generating CTYPE, we found number of characters were improperly mapped
to 'alpha' categories.
Report/Analysis for backward compatibility is available AT
backward-compatibility5_1-to-7_0 [3]
Please suggest me proper place for backward-compatibility5_1-to-7_0
Next i am looking for UTF-8 file.
Thanks,
Pravin Satpute
1. https://github.com/pravins/glibc-i18n/blob/master/steps-to-generate-ctype
2. https://sourceware.org/bugzilla/show_bug.cgi?id=14094#c9
3.
https://github.com/pravins/glibc-i18n/blob/master/unicode7-0/backward-compatibility5_1-to-7_0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bug-14094.patch
Type: text/x-patch
Size: 200442 bytes
Desc: not available
URL: <http://sourceware.org/pipermail/libc-alpha/attachments/20140621/428d57ba/attachment.bin>
More information about the Libc-alpha
mailing list