comparison src/mbyte.c @ 34336:d2ad8733db75 v9.1.0101

patch 9.1.0101: upper-case of German sharp s should be U+1E9E Commit: https://github.com/vim/vim/commit/bd1232a1faf56b614a1e74c4ce51bc6e0650ae00 Author: glepnir <glephunter@gmail.com> Date: Mon Feb 12 22:14:53 2024 +0100 patch 9.1.0101: upper-case of German sharp s should be U+1E9E Problem: upper-case of ? should be U+1E9E (CAPITAL LETTER SHARP S) (fenuks) Solution: Make gU, ~ and g~ convert the U+00DF LATIN SMALL LETTER SHARP S (?) to U+1E9E LATIN CAPITAL LETTER SHARP S (?), update tests (glepnir) This is part of Unicode 5.1.0 from April 2008, so should be fairly safe to use now and since 2017 is part of the German standard orthography, according to Wikipedia: https://en.wikipedia.org/wiki/Capital_%E1%BA%9E#cite_note-auto-12 There is however one exception: UnicodeData.txt for U+00DF LATIN SMALL LETTER SHARP S does NOT define U+1E9E LATIN CAPITAL LETTER SHARP S as its upper case version. Therefore, toupper() won't be able to convert from lower sharp s to upper case sharp s (the other way around however works, since U+00DF is considered the lower case character of U+1E9E and therefore tolower() works correctly for the upper case version). fixes: #5573 closes: #14018 Signed-off-by: glepnir <glephunter@gmail.com> Signed-off-by: Christian Brabandt <cb@256bit.org>
author Christian Brabandt <cb@256bit.org>
date Mon, 12 Feb 2024 22:45:02 +0100
parents d7cfd8fb1d75
children cffcacc1502a
comparison
equal deleted inserted replaced
34335:d1c84a2d538d 34336:d2ad8733db75
3452 {0x118a0,0x118bf,1,32}, 3452 {0x118a0,0x118bf,1,32},
3453 {0x16e40,0x16e5f,1,32}, 3453 {0x16e40,0x16e5f,1,32},
3454 {0x1e900,0x1e921,1,34} 3454 {0x1e900,0x1e921,1,34}
3455 }; 3455 };
3456 3456
3457 // Note: UnicodeData.txt does not define U+1E9E as being the corresponding upper
3458 // case letter for U+00DF (ß), however it is part of the toLower table
3457 static convertStruct toUpper[] = 3459 static convertStruct toUpper[] =
3458 { 3460 {
3459 {0x61,0x7a,1,-32}, 3461 {0x61,0x7a,1,-32},
3460 {0xb5,0xb5,-1,743}, 3462 {0xb5,0xb5,-1,743},
3461 {0xe0,0xf6,1,-32}, 3463 {0xe0,0xf6,1,-32},