Mercurial > vim
comparison runtime/doc/mbyte.txt @ 236:4707450c2b33
updated for version 7.0066
author | vimboss |
---|---|
date | Fri, 15 Apr 2005 21:00:38 +0000 |
parents | c93c9cad9618 |
children | 0f6afaf1b8d1 |
comparison
equal
deleted
inserted
replaced
235:23d380e32c95 | 236:4707450c2b33 |
---|---|
1 *mbyte.txt* For Vim version 7.0aa. Last change: 2005 Feb 13 | 1 *mbyte.txt* For Vim version 7.0aa. Last change: 2005 Mar 29 |
2 | 2 |
3 | 3 |
4 VIM REFERENCE MANUAL by Bram Moolenaar et al. | 4 VIM REFERENCE MANUAL by Bram Moolenaar et al. |
5 | 5 |
6 | 6 |
434 Hc is "Hanzi Converter". Hc convert a GB file to a Big5 file, or Big5 | 434 Hc is "Hanzi Converter". Hc convert a GB file to a Big5 file, or Big5 |
435 file to GB file. Hc can be found at: | 435 file to GB file. Hc can be found at: |
436 ftp://ftp.cuhk.hk/pub/chinese/ifcss/software/unix/convert/hc-30.tar.gz | 436 ftp://ftp.cuhk.hk/pub/chinese/ifcss/software/unix/convert/hc-30.tar.gz |
437 | 437 |
438 Korean: hmconv | 438 Korean: hmconv |
439 Hmconv is Korean code conversion utility especially for E-mail. It can | 439 Hmconv is Korean code conversion utility especially for E-mail. It can |
440 convert between EUC-KR and ISO-2022-KR. Hmconv can be found at: | 440 convert between EUC-KR and ISO-2022-KR. Hmconv can be found at: |
441 ftp://ftp.kaist.ac.kr/pub/hangul/code/hmconv/ | 441 ftp://ftp.kaist.ac.kr/pub/hangul/code/hmconv/ |
442 | 442 |
443 Multilingual: lv | 443 Multilingual: lv |
444 Lv is a Powerful Multilingual File Viewer. And it can be worked as | 444 Lv is a Powerful Multilingual File Viewer. And it can be worked as |
445 |charset| converter. Supported |charset|: ISO-2022-CN, ISO-2022-JP, | 445 |charset| converter. Supported |charset|: ISO-2022-CN, ISO-2022-JP, |
446 ISO-2022-KR, EUC-CN, EUC-JP, EUC-KR, EUC-TW, UTF-7, UTF-8, ISO-8859 | 446 ISO-2022-KR, EUC-CN, EUC-JP, EUC-KR, EUC-TW, UTF-7, UTF-8, ISO-8859 |
447 series, Shift_JIS, Big5 and HZ. Lv can be found at: | 447 series, Shift_JIS, Big5 and HZ. Lv can be found at: |
448 http://www.ff.iij4u.or.jp/~nrt/freeware/lv4495.tar.gz | 448 http://www.ff.iij4u.or.jp/~nrt/freeware/lv4495.tar.gz |
449 | 449 |
450 | 450 |
451 *mbyte-conversion* | 451 *mbyte-conversion* |
452 When reading and writing files in an encoding different from 'encoding', | 452 When reading and writing files in an encoding different from 'encoding', |
504 | 504 |
505 xterm -u8 -fn -misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1 | 505 xterm -u8 -fn -misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1 |
506 or, for bigger character: > | 506 or, for bigger character: > |
507 xterm -u8 -fn -misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1 | 507 xterm -u8 -fn -misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1 |
508 | 508 |
509 and you will have a working UTF-8 terminal emulator. Try both > | 509 and you will have a working UTF-8 terminal emulator. Try both > |
510 | 510 |
511 cat utf-8-demo.txt | 511 cat utf-8-demo.txt |
512 vim utf-8-demo.txt | 512 vim utf-8-demo.txt |
513 | 513 |
514 with the demo text that comes with ucs-fonts.tar.gz in order to see | 514 with the demo text that comes with ucs-fonts.tar.gz in order to see |
759 we pre-input Hira-gana, second Kana-Kanji conversion. There are so many | 759 we pre-input Hira-gana, second Kana-Kanji conversion. There are so many |
760 Kanji characters (6349 Kanji characters are defined in JIS X 0208) and the | 760 Kanji characters (6349 Kanji characters are defined in JIS X 0208) and the |
761 number of Hira-gana characters are 76. So, first, we pre-input text as | 761 number of Hira-gana characters are 76. So, first, we pre-input text as |
762 pronounced in Hira-gana, second, we convert Hira-gana to Kanji or Kata-Kana, | 762 pronounced in Hira-gana, second, we convert Hira-gana to Kanji or Kata-Kana, |
763 if needed. There are some Kana-Kanji conversion server: jserver | 763 if needed. There are some Kana-Kanji conversion server: jserver |
764 (distributed with Wnn, see below) and canna. Canna could be found at: | 764 (distributed with Wnn, see below) and canna. Canna could be found at: |
765 ftp://ftp.nec.co.jp/pub/Canna/ (no longer works). | 765 ftp://ftp.nec.co.jp/pub/Canna/ (no longer works). |
766 | 766 |
767 There is a good input system: Wnn4.2. Wnn 4.2 contains, | 767 There is a good input system: Wnn4.2. Wnn 4.2 contains, |
768 xwnmo (|IM-server|) | 768 xwnmo (|IM-server|) |
769 jserver (Japanese Kana-Kanji conversion server) | 769 jserver (Japanese Kana-Kanji conversion server) |
799 | 799 |
800 *. on-the-spot *OnTheSpot* | 800 *. on-the-spot *OnTheSpot* |
801 Preedit Area and Status Area are performed by the client application in | 801 Preedit Area and Status Area are performed by the client application in |
802 the area of application. The client application is directed by the | 802 the area of application. The client application is directed by the |
803 |IM-server| to display all pre-edit data at the location of text | 803 |IM-server| to display all pre-edit data at the location of text |
804 insertion. The client registers callbacks invoked by the input method | 804 insertion. The client registers callbacks invoked by the input method |
805 during pre-editing. | 805 during pre-editing. |
806 *. over-the-spot *OverTheSpot* | 806 *. over-the-spot *OverTheSpot* |
807 Status Area is created in a fixed position within the area of application, | 807 Status Area is created in a fixed position within the area of application, |
808 in case of Vim, the position is the additional status line. Preedit Area | 808 in case of Vim, the position is the additional status line. Preedit Area |
809 is made at present input position of application. The input method | 809 is made at present input position of application. The input method |
1229 is no longer relevant in the GTK+ 2 GUI. | 1229 is no longer relevant in the GTK+ 2 GUI. |
1230 | 1230 |
1231 Up to two combining characters can be used. The combining character is drawn | 1231 Up to two combining characters can be used. The combining character is drawn |
1232 on top of the preceding character. When editing text a composing character is | 1232 on top of the preceding character. When editing text a composing character is |
1233 mostly considered part of the preceding character. For example "x" will | 1233 mostly considered part of the preceding character. For example "x" will |
1234 delete a character and its following composing characters by default. If the | 1234 delete a character and its following composing characters by default. If the |
1235 'delcombine' option is on, then pressing 'x' will delete the combining | 1235 'delcombine' option is on, then pressing 'x' will delete the combining |
1236 characters, one at a time, then the base character. But when inserting, you | 1236 characters, one at a time, then the base character. But when inserting, you |
1237 type the first character and the following composing characters separately, | 1237 type the first character and the following composing characters separately, |
1238 after which they will be joined. The "r" command will not allow you to type a | 1238 after which they will be joined. The "r" command will not allow you to type a |
1239 combining character, because it doesn't know one is coming. Use "R" instead. | 1239 combining character, because it doesn't know one is coming. Use "R" instead. |
1251 bits). However, displaying only works for 16 bit characters, and only for the | 1251 bits). However, displaying only works for 16 bit characters, and only for the |
1252 characters present in the selected font. | 1252 characters present in the selected font. |
1253 | 1253 |
1254 Useful commands: | 1254 Useful commands: |
1255 - "ga" shows the decimal, hexadecimal and octal value of the character under | 1255 - "ga" shows the decimal, hexadecimal and octal value of the character under |
1256 the cursor. If there are composing characters these are shown too. (if the | 1256 the cursor. If there are composing characters these are shown too. (If the |
1257 message is truncated, use ":messages"). | 1257 message is truncated, use ":messages"). |
1258 - "g8" shows the bytes used in a UTF-8 character, also the composing | 1258 - "g8" shows the bytes used in a UTF-8 character, also the composing |
1259 characters, as hex numbers. | 1259 characters, as hex numbers. |
1260 - ":set encoding=utf-8 fileencodings=" forces using UTF-8 for all files. The | 1260 - ":set encoding=utf-8 fileencodings=" forces using UTF-8 for all files. The |
1261 default is to use the current locale for 'encoding' and set 'fileencodings' | 1261 default is to use the current locale for 'encoding' and set 'fileencodings' |