comparison runtime/doc/mbyte.txt @ 236:4707450c2b33

updated for version 7.0066
author vimboss
date Fri, 15 Apr 2005 21:00:38 +0000
parents c93c9cad9618
children 0f6afaf1b8d1
comparison
equal deleted inserted replaced
235:23d380e32c95 236:4707450c2b33
1 *mbyte.txt* For Vim version 7.0aa. Last change: 2005 Feb 13 1 *mbyte.txt* For Vim version 7.0aa. Last change: 2005 Mar 29
2 2
3 3
4 VIM REFERENCE MANUAL by Bram Moolenaar et al. 4 VIM REFERENCE MANUAL by Bram Moolenaar et al.
5 5
6 6
434 Hc is "Hanzi Converter". Hc convert a GB file to a Big5 file, or Big5 434 Hc is "Hanzi Converter". Hc convert a GB file to a Big5 file, or Big5
435 file to GB file. Hc can be found at: 435 file to GB file. Hc can be found at:
436 ftp://ftp.cuhk.hk/pub/chinese/ifcss/software/unix/convert/hc-30.tar.gz 436 ftp://ftp.cuhk.hk/pub/chinese/ifcss/software/unix/convert/hc-30.tar.gz
437 437
438 Korean: hmconv 438 Korean: hmconv
439 Hmconv is Korean code conversion utility especially for E-mail. It can 439 Hmconv is Korean code conversion utility especially for E-mail. It can
440 convert between EUC-KR and ISO-2022-KR. Hmconv can be found at: 440 convert between EUC-KR and ISO-2022-KR. Hmconv can be found at:
441 ftp://ftp.kaist.ac.kr/pub/hangul/code/hmconv/ 441 ftp://ftp.kaist.ac.kr/pub/hangul/code/hmconv/
442 442
443 Multilingual: lv 443 Multilingual: lv
444 Lv is a Powerful Multilingual File Viewer. And it can be worked as 444 Lv is a Powerful Multilingual File Viewer. And it can be worked as
445 |charset| converter. Supported |charset|: ISO-2022-CN, ISO-2022-JP, 445 |charset| converter. Supported |charset|: ISO-2022-CN, ISO-2022-JP,
446 ISO-2022-KR, EUC-CN, EUC-JP, EUC-KR, EUC-TW, UTF-7, UTF-8, ISO-8859 446 ISO-2022-KR, EUC-CN, EUC-JP, EUC-KR, EUC-TW, UTF-7, UTF-8, ISO-8859
447 series, Shift_JIS, Big5 and HZ. Lv can be found at: 447 series, Shift_JIS, Big5 and HZ. Lv can be found at:
448 http://www.ff.iij4u.or.jp/~nrt/freeware/lv4495.tar.gz 448 http://www.ff.iij4u.or.jp/~nrt/freeware/lv4495.tar.gz
449 449
450 450
451 *mbyte-conversion* 451 *mbyte-conversion*
452 When reading and writing files in an encoding different from 'encoding', 452 When reading and writing files in an encoding different from 'encoding',
504 504
505 xterm -u8 -fn -misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1 505 xterm -u8 -fn -misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1
506 or, for bigger character: > 506 or, for bigger character: >
507 xterm -u8 -fn -misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1 507 xterm -u8 -fn -misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1
508 508
509 and you will have a working UTF-8 terminal emulator. Try both > 509 and you will have a working UTF-8 terminal emulator. Try both >
510 510
511 cat utf-8-demo.txt 511 cat utf-8-demo.txt
512 vim utf-8-demo.txt 512 vim utf-8-demo.txt
513 513
514 with the demo text that comes with ucs-fonts.tar.gz in order to see 514 with the demo text that comes with ucs-fonts.tar.gz in order to see
759 we pre-input Hira-gana, second Kana-Kanji conversion. There are so many 759 we pre-input Hira-gana, second Kana-Kanji conversion. There are so many
760 Kanji characters (6349 Kanji characters are defined in JIS X 0208) and the 760 Kanji characters (6349 Kanji characters are defined in JIS X 0208) and the
761 number of Hira-gana characters are 76. So, first, we pre-input text as 761 number of Hira-gana characters are 76. So, first, we pre-input text as
762 pronounced in Hira-gana, second, we convert Hira-gana to Kanji or Kata-Kana, 762 pronounced in Hira-gana, second, we convert Hira-gana to Kanji or Kata-Kana,
763 if needed. There are some Kana-Kanji conversion server: jserver 763 if needed. There are some Kana-Kanji conversion server: jserver
764 (distributed with Wnn, see below) and canna. Canna could be found at: 764 (distributed with Wnn, see below) and canna. Canna could be found at:
765 ftp://ftp.nec.co.jp/pub/Canna/ (no longer works). 765 ftp://ftp.nec.co.jp/pub/Canna/ (no longer works).
766 766
767 There is a good input system: Wnn4.2. Wnn 4.2 contains, 767 There is a good input system: Wnn4.2. Wnn 4.2 contains,
768 xwnmo (|IM-server|) 768 xwnmo (|IM-server|)
769 jserver (Japanese Kana-Kanji conversion server) 769 jserver (Japanese Kana-Kanji conversion server)
799 799
800 *. on-the-spot *OnTheSpot* 800 *. on-the-spot *OnTheSpot*
801 Preedit Area and Status Area are performed by the client application in 801 Preedit Area and Status Area are performed by the client application in
802 the area of application. The client application is directed by the 802 the area of application. The client application is directed by the
803 |IM-server| to display all pre-edit data at the location of text 803 |IM-server| to display all pre-edit data at the location of text
804 insertion. The client registers callbacks invoked by the input method 804 insertion. The client registers callbacks invoked by the input method
805 during pre-editing. 805 during pre-editing.
806 *. over-the-spot *OverTheSpot* 806 *. over-the-spot *OverTheSpot*
807 Status Area is created in a fixed position within the area of application, 807 Status Area is created in a fixed position within the area of application,
808 in case of Vim, the position is the additional status line. Preedit Area 808 in case of Vim, the position is the additional status line. Preedit Area
809 is made at present input position of application. The input method 809 is made at present input position of application. The input method
1229 is no longer relevant in the GTK+ 2 GUI. 1229 is no longer relevant in the GTK+ 2 GUI.
1230 1230
1231 Up to two combining characters can be used. The combining character is drawn 1231 Up to two combining characters can be used. The combining character is drawn
1232 on top of the preceding character. When editing text a composing character is 1232 on top of the preceding character. When editing text a composing character is
1233 mostly considered part of the preceding character. For example "x" will 1233 mostly considered part of the preceding character. For example "x" will
1234 delete a character and its following composing characters by default. If the 1234 delete a character and its following composing characters by default. If the
1235 'delcombine' option is on, then pressing 'x' will delete the combining 1235 'delcombine' option is on, then pressing 'x' will delete the combining
1236 characters, one at a time, then the base character. But when inserting, you 1236 characters, one at a time, then the base character. But when inserting, you
1237 type the first character and the following composing characters separately, 1237 type the first character and the following composing characters separately,
1238 after which they will be joined. The "r" command will not allow you to type a 1238 after which they will be joined. The "r" command will not allow you to type a
1239 combining character, because it doesn't know one is coming. Use "R" instead. 1239 combining character, because it doesn't know one is coming. Use "R" instead.
1251 bits). However, displaying only works for 16 bit characters, and only for the 1251 bits). However, displaying only works for 16 bit characters, and only for the
1252 characters present in the selected font. 1252 characters present in the selected font.
1253 1253
1254 Useful commands: 1254 Useful commands:
1255 - "ga" shows the decimal, hexadecimal and octal value of the character under 1255 - "ga" shows the decimal, hexadecimal and octal value of the character under
1256 the cursor. If there are composing characters these are shown too. (if the 1256 the cursor. If there are composing characters these are shown too. (If the
1257 message is truncated, use ":messages"). 1257 message is truncated, use ":messages").
1258 - "g8" shows the bytes used in a UTF-8 character, also the composing 1258 - "g8" shows the bytes used in a UTF-8 character, also the composing
1259 characters, as hex numbers. 1259 characters, as hex numbers.
1260 - ":set encoding=utf-8 fileencodings=" forces using UTF-8 for all files. The 1260 - ":set encoding=utf-8 fileencodings=" forces using UTF-8 for all files. The
1261 default is to use the current locale for 'encoding' and set 'fileencodings' 1261 default is to use the current locale for 'encoding' and set 'fileencodings'