Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Re: Czech Sort (was Re: I miss ..., bug in to-local-file ...)

From: petr:krenzelok:trz:cz at: 10-Jun-2001 1:19

----- Original Message ----- From: "GS Jones" <[gjones05--mail--orion--org]> To: <[rebol-list--rebol--com]> Sent: Saturday, June 09, 2001 2:23 PM Subject: [REBOL] Czech Sort (was Re: I miss ..., bug in to-local-file ...)
> Hi, PeKr, Cyphre, (Ladislav, others?), Volker, > > I was distracted last night trying to figure out the plugin problem for
IE.
> Many hours spent; nothing to show for it. > > Anyway, this morning I completed my reference table. Some of the letters
were
> very tricky to find. I basically scanned PeKr's companies pages for the > letters, then looked at the source to determine the single byte code/char > representation. I think I can read Czech now .... not! ;-) > > I used a language list reference to get all the alphabet. This list did
not
> list "ch" as a separate character. I tried to put the letters in order
based on
> the references I found and PeKr's instructions. This list follows the > questions. Please look over the list to check for completeness and
accuracy of
> order (hopefully I got the byte codes correct).
I am sorry I have not found free time to respond earlier! "Ch" is not single character! It is still TWO characters - the sort sequence is, however, different from english one: a b c d e f g h "ch" i j k l m n ..... so above is a little bit Czech special case .... a b c c^ d d^ ..... don't know how to express myself in english, but imagine ^ being upon "c", and rotated in 180 degrees :-) .. maybe a little composed image could help? http://www.rebol.cz/~can/rebol-view/czech-alphabet.png
> First, my questions: > 1) Does the uppercase version of "ch" look like "Ch" or "CH"?
Ch - still two letters ... just sorted as being one ...
> 2) A sort/case in REBOL normally puts the uppercase versions before the
lower
> case versions. Would this be acceptable? PeKr mentioned that "lowercase > letters first, uppercase follows". Does this mean that a sort/case should > ideally put the lower case version before an upper case version?
I am not sure about that one ....
> Now, for my reference table. The column abbreviations may be obvious (-:,
but
> here is the translation: > Dec: Decimal representation of the single computer byte representation of
the
> letter > UnD: The decimal representation of the Unicode word (meaning 2 bytes) for
the
> letter (here for future use, if needed) > L: ASCII letters linguist use to refer to letters > UnHx: Unicode Hex for the letter > Description: Linguistic description of letters > > Reference table version 0.1 (I have a lot of confidence, don't I? ;-) > (best viewed with fixed width font) > > Dec UnD L UnHx Description > 97 97 a 0061 LATIN SMALL LETTER A > 225 225 a' 00e1 LATIN SMALL LETTER A WITH ACUTE > 98 98 b 0062 LATIN SMALL LETTER B > 99 99 c 0063 LATIN SMALL LETTER C > 232 269 c< 010d LATIN SMALL LETTER C WITH CARON > 100 100 d 0064 LATIN SMALL LETTER D > 239 271 d< 010f LATIN SMALL LETTER D WITH CARON > 101 101 e 0065 LATIN SMALL LETTER E > 233 233 e' 00e9 LATIN SMALL LETTER E WITH ACUTE > 236 283 e< 011b LATIN SMALL LETTER E WITH CARON > 102 102 f 0066 LATIN SMALL LETTER F > 103 103 g 0067 LATIN SMALL LETTER G > 104 104 h 0068 LATIN SMALL LETTER H > ch special character combination > 105 105 i 0069 LATIN SMALL LETTER I > 237 237 i' 00ed LATIN SMALL LETTER I WITH ACUTE > 106 106 j 006a LATIN SMALL LETTER J > 107 107 k 006b LATIN SMALL LETTER K > 108 108 l 006c LATIN SMALL LETTER L > 109 109 m 006d LATIN SMALL LETTER M > 110 110 n 006e LATIN SMALL LETTER N > 242 328 n< 0148 LATIN SMALL LETTER N WITH CARON > 111 111 o 006f LATIN SMALL LETTER O > 243 243 o' 00f3 LATIN SMALL LETTER O WITH ACUTE > 112 112 p 0070 LATIN SMALL LETTER P > 113 113 q 0071 LATIN SMALL LETTER Q > 114 114 r 0072 LATIN SMALL LETTER R > 248 345 r< 0159 LATIN SMALL LETTER R WITH CARON > 115 115 s 0073 LATIN SMALL LETTER S > 185 353 s< 0161 LATIN SMALL LETTER S WITH CARON > 116 116 t 0074 LATIN SMALL LETTER T > 187 357 t< 0165 LATIN SMALL LETTER T WITH CARON > 117 117 u 0075 LATIN SMALL LETTER U > 249 367 u0 016f LATIN SMALL LETTER U WITH RING ABOVE > 250 250 u' 00fa LATIN SMALL LETTER U WITH ACUTE > 118 118 v 0076 LATIN SMALL LETTER V > 119 119 w 0077 LATIN SMALL LETTER W > 120 120 x 0078 LATIN SMALL LETTER X > 121 121 y 0079 LATIN SMALL LETTER Y > 253 253 ' 00fd LATIN SMALL LETTER Y WITH ACUTE > 122 122 z 007a LATIN SMALL LETTER Z > 190 382 z< 017e LATIN SMALL LETTER Z WITH CARON > 65 65 A 0041 LATIN CAPITAL LETTER A > 193 193 A' 00c1 LATIN CAPITAL LETTER A WITH ACUTE > 66 66 B 0042 LATIN CAPITAL LETTER B > 67 67 C 0043 LATIN CAPITAL LETTER C > 200 268 C< 010c LATIN CAPITAL LETTER C WITH CARON > 68 68 D 0044 LATIN CAPITAL LETTER D > 207 270 D< 010e LATIN CAPITAL LETTER D WITH CARON > 69 69 E 0045 LATIN CAPITAL LETTER E > 201 201 E' 00c9 LATIN CAPITAL LETTER E WITH ACUTE > 204 282 E< 011a LATIN CAPITAL LETTER E WITH CARON > 70 70 F 0046 LATIN CAPITAL LETTER F > 71 71 G 0047 LATIN CAPITAL LETTER G > 72 72 H 0048 LATIN CAPITAL LETTER H > Ch special character combination > 73 73 I 0049 LATIN CAPITAL LETTER I > 205 205 I' 00cd LATIN CAPITAL LETTER I WITH ACUTE > 74 74 J 004a LATIN CAPITAL LETTER J > 75 75 K 004b LATIN CAPITAL LETTER K > 76 76 L 004c LATIN CAPITAL LETTER L > 77 77 M 004d LATIN CAPITAL LETTER M > 78 78 N 004e LATIN CAPITAL LETTER N > 210 327 N< 0147 LATIN CAPITAL LETTER N WITH CARON > 79 79 O 004f LATIN CAPITAL LETTER O > 211 211 O' 00d3 LATIN CAPITAL LETTER O WITH ACUTE > 80 80 P 0050 LATIN CAPITAL LETTER P > 81 81 Q 0051 LATIN CAPITAL LETTER Q > 82 82 R 0052 LATIN CAPITAL LETTER R > 216 344 R< 0158 LATIN CAPITAL LETTER R WITH CARON > 83 83 S 0053 LATIN CAPITAL LETTER S > 169 352 S< 0160 LATIN CAPITAL LETTER S WITH CARON > 84 84 T 0054 LATIN CAPITAL LETTER T > 171 356 T< 0164 LATIN CAPITAL LETTER T WITH CARON > 85 85 U 0055 LATIN CAPITAL LETTER U > 217 366 U0 016e LATIN CAPITAL LETTER U WITH RING ABOVE > 218 218 U' 00da LATIN CAPITAL LETTER U WITH ACUTE > 86 86 V 0056 LATIN CAPITAL LETTER V > 87 87 W 0057 LATIN CAPITAL LETTER W > 88 88 X 0058 LATIN CAPITAL LETTER X > 89 89 Y 0059 LATIN CAPITAL LETTER Y > 221 221 Y' 00dd LATIN CAPITAL LETTER Y WITH ACUTE > 90 90 Z 005a LATIN CAPITAL LETTER Z > 174 381 Z< 017d LATIN CAPITAL LETTER Z WITH CARON > > I look forward to hearing from you and the next step (by the way, Volker,
I only
> briefly looked through your code last night for the reason I've already > explained; I will look more carefully when I go to the next step.
Thanks!) I will check once I get at my work to look to some of our dbase char sorting. However - I just asked about sorting. I not necessarily needed you spend all of your free time solving my problems :-) Anyway - thanks a lot for taking care! Cheers, -pekr-