[REBOL] Re: I miss ..., bug in to-local-file ...
From: agem:crosswinds at: 8-Jun-2001 16:28
>>>>>>>>>>>>>>>>>> Ursprüngliche Nachricht <<<<<<<<<<<<<<<<<<
Am 08.06.01, 14:28:19, schrieb Petr Krenzelok <[Petr--Krenzelok--trz--cz]>
zum Thema [REBOL] Re: I miss ..., bug in to-local-file ...:
> GS Jones wrote:
> > From: "Petr Krenzelok"
> > > 1) today I worked on some short virus user log analysing script
> > > producing .html file output, and while I was able to use Czech
langueage
> > > characters, I have two following observations:
> > >
> > > a) Special characters are not correctly sorted, e.g. S^ is not
following
> > > "S", but is placed right after "Z" ... not sure if it is currently
> > > solvable ...
> > ...
> >
> > Hi, Petr,
> >
> > At least in the interim, a proper sort may be able to be accomplished
with a
> > custom sort. I'm trying to find the right code page that covers
Czech. The
> > following looks to be it.
> >
> > http://czyborra.com/charsets/iso8859.html#ISO-8859-2
> >
> > Given that this is correct, next I need to confirm that the ordering
is correct.
> > For example, there are a number of what we call capital U's with the
various
> > diacritical marks. Does this table show them to be in the correct
sort order?
> > Does a "C" come before a "C" with the diacritical marks?
> Yes, it does ..., what is more - lowercase letters first, uppercase
follows ...
> One special case however :-) "ch" is regarded being special char
combination, and it
> follows "h" :-)
just a suggestion: instead of using a customized sort
it may be usefull to use customized encoding.
then you would translate the original strings to a »sort-form«,
and sort with this.
of course keep the original!
which could be like
parse string[any[
»a« (append sort-form to char! »a«)
| »ch« (append sort-form to char! 1 + #«h«)
]
]
with some (a lot?) clever organisation.
> Maybe Ladislav or Richard (aka Cyphre :-) could comment?
> -pekr-
-Volker