Contents:	About HKU Big5 and ETen Big5
		If you choose to select ETen Big5 as your Big5 encoding


About HKU Big5 and ETen Big5
============================

The situation is really complicated with the Chinese encoding standard
used in Hong Kong and Taiwan.
In addition of Big5, which is widely used, there are other standards
and de facto standards.
Even different vendors may have slightly different versions of Big5.
HKU-Big5 is a version of Big5 defined by X window font hku-ch16,
which is the only freely available font known to us.

The major difference between the ETEN and HKU Big5 codings is
the offset of non-frequent Big5 Hanzi codes in the ETEN coding scheme:
(the legal 2nd byte is 0x40--0x7e and 0xa1--0xfe)

    portions of BIG5 codes    coding in ETEN      coding in HKU-Ch16
    ----------------------    ----------------    ----------------
    Non-Chinese symbols       0xa140 -- 0xa3bf    0xa140 -- 0xa3fe
    Frequent (5401)           0xa440 -- 0xc67e    0xa440 -- 0xc67e
    Non-frequent (7652)       0xc940 -- 0xf9d5    0xc6a1 -- 0xf755
    Non-BIG5 supplement       0xf9d6 -- ??          no

A program et2hku (or, sometime renamed as et2hkubig5, or et2b5) is
supplied in this directory to convert text file between this two
coding format.


If you choose to select ETen Big5 as your Big5 encoding
=======================================================

All Big-5 dictionaries and hku-ch16.bdf font distributed with
cxterm is in HKU Big5 format.  Thus the cxterm using these font
and dictionaries is in HKU Big5 encoding and only reads and writes
files in that format.  However, since cxterm is coding-independent,
you can choose to install ETen Big5 encoding fonts and dictionaries.

Following these steps:

1) convert hku-ch16.bdf font to ETen encoding.  (If you already have ETen
   encoding fonts, this step is optional.)  Assume your hku-ch16.bdf is
   put in ~/xfonts.

	% font-hku2et-ch16.sh ~/xfonts

2) redo all the font installation steps ("bdftosnf" or "bdftopcf",
   "mkfontdir", "xset", etc).
	
3) convert all cxterm dictionaries to ETen encoding.  Assume your .tit
   format dictionaries are in ~/src/cxterm/dict/tit/ (or, in X11R4,
   ~/src/cxterm/Dict/).

	% make					# produce et2hku first.
	% tit-hku2et.sh ~/src/cxterm/dict/tit

4) rebuild and reinstall the dictionaries.

	% (cd ~/src/cxterm/; make install)


NOTES:

Since hku-ch16 is contributed by HKU postgrad society.
The converted et-ch16.bdf is still property of the contributor.
I don't know whether the contributor will have any objection to the
redistribution of et-ch16.bdf, since the encoding scheme is changed.

-----
ID: cxterm 11.5.1 (p3), Oct/28/1991, ygz@cs.purdue.edu
