| 1 |
NOTE TO TRANSLATORS (2004-02-03):
|
| 2 |
|
| 3 |
The Debian web team has started removing translations that are more
|
| 4 |
than six months outdated from the English version. So if you want to
|
| 5 |
start translating a new page, first check the CVS and see if there is
|
| 6 |
an (perhaps extremely outdated) deleted version! You may save some
|
| 7 |
work that way.
|
| 8 |
|
| 9 |
Chuan-kai Lin <cklin@debian.org>
|
| 10 |
|
| 11 |
|
| 12 |
|
| 13 |
Some notes and ramblings on Chinese translations (and the fun of
|
| 14 |
maintaining both Big5 and GB pages and hope that all the characters
|
| 15 |
show up properly. :-)
|
| 16 |
|
| 17 |
Note: This document may contain Big5 code.
|
| 18 |
|
| 19 |
Content Negotiation:
|
| 20 |
~~~~~~~~~~~~~~~~~~~
|
| 21 |
lang charset
|
| 22 |
------ ------ -------
|
| 23 |
zh-CN .zh-cn Big5
|
| 24 |
zh-TW .zh-tw GB2312
|
| 25 |
|
| 26 |
|
| 27 |
Big5<->GB is quite a challenging problem.
|
| 28 |
|
| 29 |
Big5 is *bad*!! Its relationship to Unicode is _not_ one-to-one,
|
| 30 |
and is giving us a lot of headaches. Fortunately, in the upcoming
|
| 31 |
Unicode 3.0, much (if not all) of this issue would be resolved.
|
| 32 |
|
| 33 |
The following is from
|
| 34 |
ftp://ftp.unicode.org/Public/MAPPINGS/EASTASIA/OTHER/BIG5.TXT
|
| 35 |
|
| 36 |
# WARNING! It is currently impossible to provide round-trip compatibility
|
| 37 |
# between BIG5 and Unicode.
|
| 38 |
#
|
| 39 |
# A number of characters are not currently mapped because
|
| 40 |
# of conflicts with other mappings. They are as follows:
|
| 41 |
#
|
| 42 |
# BIG5 Description Comments
|
| 43 |
#
|
| 44 |
# 0xA15A SPACING UNDERSCORE duplicates A1C4
|
| 45 |
# 0xA1C3 SPACING HEAVY OVERSCORE not in Unicode
|
| 46 |
# 0xA1C5 SPACING HEAVY UNDERSCORE not in Unicode
|
| 47 |
# 0xA1FE LT DIAG UP RIGHT TO LOW LEFT duplicates A2AC
|
| 48 |
# 0xA240 LT DIAG UP LEFT TO LOW RIGHT duplicates A2AD
|
| 49 |
# 0xA2CC HANGZHOU NUMERAL TEN conflicts with A451 mapping
|
| 50 |
# 0xA2CE HANGZHOU NUMERAL THIRTY conflicts with A4CA mapping
|
| 51 |
#
|
| 52 |
# We currently map all of these characters to U+FFFD REPLACEMENT CHARACTER
|
| 53 |
|
| 54 |
Another reference is the Big5+ standard tables. At least it won't
|
| 55 |
leave any Big5+ codes dangling. :-) It does include a Big5+ to GBK
|
| 56 |
table, but then, we want GB, not GBK. Hmm...
|
| 57 |
|
| 58 |
|
| 59 |
Converter
|
| 60 |
~~~~~~~~~
|
| 61 |
* Don't bother with tcs. Due to the traditional/simplified character
|
| 62 |
issue, tcs simply doesn't work well at all.
|
| 63 |
|
| 64 |
* utf-converter works but need more tweaking to get everything translated
|
| 65 |
properly. (1999-12-28 update: Nah, let's not bother with utf-converter
|
| 66 |
when we have zh-AutoConvert. :-)
|
| 67 |
|
| 68 |
|
| 69 |
-- Anthony Fok <foka@debian.org>, Fri, 16 Apr 1999 05:11:03 -0600
|