 |
Index for Section 5 |
|
 |
Alphabetical listing for C |
|
 |
Bottom of page |
|
Chinese(5)
NAME
Chinese, chinese - Introduction to Chinese language support
DESCRIPTION
There are two kinds of written Chinese characters, traditional Chinese and
simplified Chinese. The former is widely used in Hong Kong (HK) and Taiwan,
while the latter is more widely used in the People's Republic of China
(PRC) and Singapore.
Codesets
There are several coded character sets (codesets) available to support each
of the two written Chinese languages. For each Chinese language, the
following list notes the supported codesets and the strings that represent
those codesets in the names of locales, converters, and other kinds of
system files:
· Codesets for traditional Chinese:
DEC Hanyu, dechanyu
For more information, see dechanyu(5).
Taiwanese EUC (Extended UNIX Code), eucTW
For more information, see eucTW(5).
Big-5, big5
For more information, see big5(5).
Shift Big-5, sbig5
For more information, see sbig5(5).
The Shift Big-5 codeset is supported for codeset conversion only;
it is not used in locales nor is it used for input and output
operations.
TeleCode, telecode
For more information, see telecode(5).
The Telecode codeset is supported for codeset conversion only; it
is not used in locales nor used for input and output operations.
PC code page, cp950
The cp950 encoding format is identical to big5. For more
information, see code_page(5).
UTF-16, UCS-4, and UTF-8 (UTF-16, UCS-4, UTF-8)
For more information, see Unicode(5).
· Codesets for simplified Chinese:
DEC Hanzi, dechanzi
For more information, see dechanzi(5).
GB18030 (gb18030, GB18030)
For more information, see GB18030(5).
GBK (gbk, GBK)
For more information, see GBK(5).
PC code page, cp936
The cp936 encoding format is identical to dechanzi except that
the code page supports additional characters. For more
information, see code_page(5).
UTF-16, UCS-4, and UTF-8 (UTF-16, UCS-4, UTF-8)
For more information, see Unicode(5).
Locales
The following list groups supported locales according to language, country
or territory, and codeset. Each locale supports one collation sequence,
which is specified following the locale name.
· For traditional Chinese, Taiwan:
Locales using DEC Hanyu:
zh_TW.dechanyu
Collation using internal code order
zh_TW.dechanyu@radical
Collation by radical
zh_TW.dechanyu@stroke
Collation by stroke
zh_TW.dechanyu@chuyin
Chuyin (phonetic) collation
Locales using Taiwanese EUC:
zh_TW.eucTW
Collation using internal code order
zh_TW.eucTW@radical
Collation by radical
zh_TW.eucTW@stroke
Collation by stroke
zh_TW.eucTW@chuyin
Chuyin (phonetic) collation
Locales using Big-5:
zh_TW.big5
Collation using internal code order
zh_TW.big5@radical
Collation by radical
zh_TW.big5@stroke
Collation by stroke
zh_TW.big5@chuyin
Chuyin (phonetic) collation
Locale using UTF-8:
zh_TW.UTF-8
Collation is the same as for zh_TW.dechanyu
· For traditional Chinese, Hong Kong:
Locales using DEC Hanyu:
zh_HK.dechanyu
Collation using internal code order
Locales using Taiwanese EUC:
zh_HK.eucTW
Collation using internal code order
Locales using Big-5:
zh_HK.big5
Collation using internal code order
Locale using UTF-8:
zh_HK.UTF-8
Collation is the same as for zh_HK.dechanyu
· For simplified Chinese, Hong Kong:
Locale using DEC Hanzi:
zh_HK.dechanzi
Collation using internal code order
· For simplified Chinese, PRC:
Locales using DEC Hanzi:
zh_CN.dechanzi
Collation using internal code order
zh_CN.dechanzi@radical
Collation by radical
zh_CN.dechanzi@stroke
Collation by stroke
zh_CN.dechanzi@pinyin
Pinyin (Phonetic) collation
Locale using GBK:
zh_CN.GBK
Collation using internal code order (however, all
characters in the GB 2312 character set are ordered first,
followed by the characters in the GBK extension)
Locale using GB18030
zh_CN.GB18030
Collation using internal code order (collation sequence
reflects the subset to superset relationship; thus all
characters in GB 2312 character set are ordered first,
followed by the characters in GBK not already ordered,
followed by the characters in GB18030 not already ordered)
Locale using UTF-8:
zh_CN.UTF-8
Collation is the same as for zh_CN.GB18030
The asort(1) reference page contains more information on the collation
sequences used for Asian languages.
You can use the locale command (see locale(1)) to display the names of
locales installed on your system. See i18n_intro(5) for information on
setting locale from the operating system command line.
For the Common Desktop environment (CDE), you also need to set your session
language to an appropriate value by using the Language menu, which you
access by clicking on the Options button of the Login window.
Keyboards, Input Servers, and Input Methods
You can use any standard English PC style keyboard to enter Chinese. See
the keyboard(5) reference page for information about loading keyboard
mapping tables (keymaps) for keyboards.
To support Chinese input in a Motif application environment such as CDE,
the operating system provides the following input servers:
· For traditional Chinese:
dxhanyuim
See dxhanyuim(1X) for more information.
· For simplified Chinese:
dxhanziim
See dxhanziim(1X) for more information.
· For both traditional and simplified Chinese:
dxim
See dxim(1X) for more information.
The dxhanziim and dxhanyuim input servers are started automatically for a
CDE session based on the locale setting chosen through the login window.
See the appropriate input server reference page for information about
starting an input server from the command line.
The operating system supports the following Chinese input methods, listed
by name under the written language with which the methods are used. Next to
the name is the function key used to switch to that input method when the
window for the input server is active.
· For traditional Chinese:
(These input methods are enabled when the dxhanyuim server is
running.)
Internal code, F8
Phonetic, F10
Phrase, F9
Quick Tsang-Chi, F7
Symbol
Tsang-Chi, F6
Note that there are no locales to support the Shift Big-5 and Telecode
character sets, so these characters cannot be input directly.
· For simplified Chinese:
(These input methods are enabled when the dxhanziim server is
running.)
5-Stroke, F6
5-Shape, F10
Pin-Yin, or Phonetic, F8
Qu-Wei or Row-Column in GB2312-80, F7
Telex Code, F9
Phrase Input, F5
Intelligent ABC, F4
Of these input methods, the Intelligent ABC input method is the most
recent implementation. It is also the input method used on PCs running
Microsoft Windows.
· For traditional and simplified Chinese (use the function key to invoke
either traditional or simplified Chinese in combination with the
function key for the input method):
(These input methods are enabled when the dxim server is running.)
Traditional Chinese, F2:
Internal code, F9
Phonetic, F7
Quick Tsang-Chi, F6
Symbol, F8
Tsang-Chi, F5
Simplified Chinese, F1:
5-Stroke, F6
5-Shape, F8
Qu-Wei or Row-Column in GB2312-80, F7
Telex Code, F10
Symbol Input, F9
Intelligent ABC, F5
For the following terminals or keyboard settings, you can toggle between
the English input mode and Chinese input mode by using a particular key or
key sequence:
· For either traditional Chinese on a VT382-D terminal keyboard or
simplified Chinese on a VT382-C terminal keyboard, press Compose.
· In CDE for PC-style keyboard settings, use Shift+Space (for Hanzi) or
Alt+Space (for Hanyu)
· The dxim default key sequence is Ctrl+Space
The preceding key sequences are defaults. You can override them by setting
other keys to perform the same function.
Chinese Terminals
The operating system supports the VT382-D and the VT382-C terminals for
traditional Chinese and simplified Chinese, respectively.
Running X and Motif Applications
X or Motif applications require non-ASCII fonts to display Chinese
characters. Therefore, the font path must be set appropriately before
starting an application that displays Chinese characters. An application
can find Chinese fonts for codesets other than GBK or GB18030 in either of
the following directories:
· /usr/i18n/lib/X11/fonts/decwin/75dpi, for low resolution display
· /usr/i18n/lib/X11/fonts/decwin/100dpi, for high resolution display
Chinese fonts for the GBK and GB18030 codesets are installed in
/usr/i18n/lib/X11/fonts/SChineseTT.
For applications running under CDE, you should not need to enter manual
commands to ensure that the appropriate fonts are found as long as those
fonts are installed on the system or available through a font server. For
other applications, you can use the following command to check the font
path:
% xset q
If one of the directories in the preceding list is not in the font path,
the following example shows how to add the directory. You can substitute
100dp for 75dpi if you want high resolution display.
% xset +fp /usr/i18n/lib/X11/decwin/75dpi/
% xset fp rehash
Printers
The operating system supports the following Chinese printers, grouped by
language. The associated print filter is noted in parentheses following the
printer name.
· For traditional Chinese:
CP382-D (cp382dof)
The CP382-D is a Chinese dot matrix printer.
Others
The operating system also supports text printers that have built-
in traditional Chinese fonts and text printers to which
traditional Chinese fonts can be downloaded.
· For simplified Chinese:
LA88-C (la88cof)
The LA88-C is a Chinese dot matrix printer.
LA380-CB (la380cbof)
The LA380-CB is a Chinese graphic line printer.
In addition, the pcfof and wwpsof generic print filters are available for
use with many other printers to support Chinese printing. For more
information on setting up and configuring printers, refer to
i18n_printing(5) and lprsetup(8).
SEE ALSO
Commands: asort(1), locale(1), lp(1), lpr(1), dxhanyuim(1X), dxhanziim(1X),
dxim(1X), xset(1X), lpd(8), lprsetup(8)
Files: printcap(4)
Others: big5(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5),
GB18030(5), GBK(5), i18n_intro(5), i18n_printing(5), iconv_intro(5),
keyboard(5), l10n_intro(5), sbig5(5), telecode(5), Unicode(5)
Writing Software for the International Market
Using International Software
 |
Index for Section 5 |
|
 |
Alphabetical listing for C |
|
 |
Top of page |
|