Index Index for
Section 5
Index Alphabetical
listing for C
Bottom of page Bottom of
page

Chinese(5)

NAME

Chinese, chinese - Introduction to Chinese language support

DESCRIPTION

There are two kinds of written Chinese characters, traditional Chinese and simplified Chinese. The former is widely used in Hong Kong (HK) and Taiwan, while the latter is more widely used in the People's Republic of China (PRC) and Singapore. Codesets There are several coded character sets (codesets) available to support each of the two written Chinese languages. For each Chinese language, the following list notes the supported codesets and the strings that represent those codesets in the names of locales, converters, and other kinds of system files: · Codesets for traditional Chinese: DEC Hanyu, dechanyu For more information, see dechanyu(5). Taiwanese EUC (Extended UNIX Code), eucTW For more information, see eucTW(5). Big-5, big5 For more information, see big5(5). Shift Big-5, sbig5 For more information, see sbig5(5). The Shift Big-5 codeset is supported for codeset conversion only; it is not used in locales nor is it used for input and output operations. TeleCode, telecode For more information, see telecode(5). The Telecode codeset is supported for codeset conversion only; it is not used in locales nor used for input and output operations. PC code page, cp950 The cp950 encoding format is identical to big5. For more information, see code_page(5). UTF-16, UCS-4, and UTF-8 (UTF-16, UCS-4, UTF-8) For more information, see Unicode(5). · Codesets for simplified Chinese: DEC Hanzi, dechanzi For more information, see dechanzi(5). GB18030 (gb18030, GB18030) For more information, see GB18030(5). GBK (gbk, GBK) For more information, see GBK(5). PC code page, cp936 The cp936 encoding format is identical to dechanzi except that the code page supports additional characters. For more information, see code_page(5). UTF-16, UCS-4, and UTF-8 (UTF-16, UCS-4, UTF-8) For more information, see Unicode(5). Locales The following list groups supported locales according to language, country or territory, and codeset. Each locale supports one collation sequence, which is specified following the locale name. · For traditional Chinese, Taiwan: Locales using DEC Hanyu: zh_TW.dechanyu Collation using internal code order zh_TW.dechanyu@radical Collation by radical zh_TW.dechanyu@stroke Collation by stroke zh_TW.dechanyu@chuyin Chuyin (phonetic) collation Locales using Taiwanese EUC: zh_TW.eucTW Collation using internal code order zh_TW.eucTW@radical Collation by radical zh_TW.eucTW@stroke Collation by stroke zh_TW.eucTW@chuyin Chuyin (phonetic) collation Locales using Big-5: zh_TW.big5 Collation using internal code order zh_TW.big5@radical Collation by radical zh_TW.big5@stroke Collation by stroke zh_TW.big5@chuyin Chuyin (phonetic) collation Locale using UTF-8: zh_TW.UTF-8 Collation is the same as for zh_TW.dechanyu · For traditional Chinese, Hong Kong: Locales using DEC Hanyu: zh_HK.dechanyu Collation using internal code order Locales using Taiwanese EUC: zh_HK.eucTW Collation using internal code order Locales using Big-5: zh_HK.big5 Collation using internal code order Locale using UTF-8: zh_HK.UTF-8 Collation is the same as for zh_HK.dechanyu · For simplified Chinese, Hong Kong: Locale using DEC Hanzi: zh_HK.dechanzi Collation using internal code order · For simplified Chinese, PRC: Locales using DEC Hanzi: zh_CN.dechanzi Collation using internal code order zh_CN.dechanzi@radical Collation by radical zh_CN.dechanzi@stroke Collation by stroke zh_CN.dechanzi@pinyin Pinyin (Phonetic) collation Locale using GBK: zh_CN.GBK Collation using internal code order (however, all characters in the GB 2312 character set are ordered first, followed by the characters in the GBK extension) Locale using GB18030 zh_CN.GB18030 Collation using internal code order (collation sequence reflects the subset to superset relationship; thus all characters in GB 2312 character set are ordered first, followed by the characters in GBK not already ordered, followed by the characters in GB18030 not already ordered) Locale using UTF-8: zh_CN.UTF-8 Collation is the same as for zh_CN.GB18030 The asort(1) reference page contains more information on the collation sequences used for Asian languages. You can use the locale command (see locale(1)) to display the names of locales installed on your system. See i18n_intro(5) for information on setting locale from the operating system command line. For the Common Desktop environment (CDE), you also need to set your session language to an appropriate value by using the Language menu, which you access by clicking on the Options button of the Login window. Keyboards, Input Servers, and Input Methods You can use any standard English PC style keyboard to enter Chinese. See the keyboard(5) reference page for information about loading keyboard mapping tables (keymaps) for keyboards. To support Chinese input in a Motif application environment such as CDE, the operating system provides the following input servers: · For traditional Chinese: dxhanyuim See dxhanyuim(1X) for more information. · For simplified Chinese: dxhanziim See dxhanziim(1X) for more information. · For both traditional and simplified Chinese: dxim See dxim(1X) for more information. The dxhanziim and dxhanyuim input servers are started automatically for a CDE session based on the locale setting chosen through the login window. See the appropriate input server reference page for information about starting an input server from the command line. The operating system supports the following Chinese input methods, listed by name under the written language with which the methods are used. Next to the name is the function key used to switch to that input method when the window for the input server is active. · For traditional Chinese: (These input methods are enabled when the dxhanyuim server is running.) Internal code, F8 Phonetic, F10 Phrase, F9 Quick Tsang-Chi, F7 Symbol Tsang-Chi, F6 Note that there are no locales to support the Shift Big-5 and Telecode character sets, so these characters cannot be input directly. · For simplified Chinese: (These input methods are enabled when the dxhanziim server is running.) 5-Stroke, F6 5-Shape, F10 Pin-Yin, or Phonetic, F8 Qu-Wei or Row-Column in GB2312-80, F7 Telex Code, F9 Phrase Input, F5 Intelligent ABC, F4 Of these input methods, the Intelligent ABC input method is the most recent implementation. It is also the input method used on PCs running Microsoft Windows. · For traditional and simplified Chinese (use the function key to invoke either traditional or simplified Chinese in combination with the function key for the input method): (These input methods are enabled when the dxim server is running.) Traditional Chinese, F2: Internal code, F9 Phonetic, F7 Quick Tsang-Chi, F6 Symbol, F8 Tsang-Chi, F5 Simplified Chinese, F1: 5-Stroke, F6 5-Shape, F8 Qu-Wei or Row-Column in GB2312-80, F7 Telex Code, F10 Symbol Input, F9 Intelligent ABC, F5 For the following terminals or keyboard settings, you can toggle between the English input mode and Chinese input mode by using a particular key or key sequence: · For either traditional Chinese on a VT382-D terminal keyboard or simplified Chinese on a VT382-C terminal keyboard, press Compose. · In CDE for PC-style keyboard settings, use Shift+Space (for Hanzi) or Alt+Space (for Hanyu) · The dxim default key sequence is Ctrl+Space The preceding key sequences are defaults. You can override them by setting other keys to perform the same function. Chinese Terminals The operating system supports the VT382-D and the VT382-C terminals for traditional Chinese and simplified Chinese, respectively. Running X and Motif Applications X or Motif applications require non-ASCII fonts to display Chinese characters. Therefore, the font path must be set appropriately before starting an application that displays Chinese characters. An application can find Chinese fonts for codesets other than GBK or GB18030 in either of the following directories: · /usr/i18n/lib/X11/fonts/decwin/75dpi, for low resolution display · /usr/i18n/lib/X11/fonts/decwin/100dpi, for high resolution display Chinese fonts for the GBK and GB18030 codesets are installed in /usr/i18n/lib/X11/fonts/SChineseTT. For applications running under CDE, you should not need to enter manual commands to ensure that the appropriate fonts are found as long as those fonts are installed on the system or available through a font server. For other applications, you can use the following command to check the font path: % xset q If one of the directories in the preceding list is not in the font path, the following example shows how to add the directory. You can substitute 100dp for 75dpi if you want high resolution display. % xset +fp /usr/i18n/lib/X11/decwin/75dpi/ % xset fp rehash Printers The operating system supports the following Chinese printers, grouped by language. The associated print filter is noted in parentheses following the printer name. · For traditional Chinese: CP382-D (cp382dof) The CP382-D is a Chinese dot matrix printer. Others The operating system also supports text printers that have built- in traditional Chinese fonts and text printers to which traditional Chinese fonts can be downloaded. · For simplified Chinese: LA88-C (la88cof) The LA88-C is a Chinese dot matrix printer. LA380-CB (la380cbof) The LA380-CB is a Chinese graphic line printer. In addition, the pcfof and wwpsof generic print filters are available for use with many other printers to support Chinese printing. For more information on setting up and configuring printers, refer to i18n_printing(5) and lprsetup(8).

SEE ALSO

Commands: asort(1), locale(1), lp(1), lpr(1), dxhanyuim(1X), dxhanziim(1X), dxim(1X), xset(1X), lpd(8), lprsetup(8) Files: printcap(4) Others: big5(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5), GB18030(5), GBK(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), keyboard(5), l10n_intro(5), sbig5(5), telecode(5), Unicode(5) Writing Software for the International Market Using International Software

Index Index for
Section 5
Index Alphabetical
listing for C
Top of page Top of
page