[REBOL] Re: Frequency of phrases
From: louisaturk:coxinet at: 23-Aug-2002 1:20
Hi Charles,
At 12:37 AM 8/23/2002 -0400, you wrote:
> > Phrase: A phrase will be defined as a string len characters long and with a
> > space at each end. All phrases 100 characters long are to be processed
> > first, then all phrases of length len - 1 and so on until len = 5.
> My apologies, but what about a phrase such as:
>This is a phrase.<cr>
> There is no whitespace at beginning or end.
This really is what I need, strange as it may sound.
> > The text file: To simplify things, manually place a space at the beginning
> > and at the end of the file to be processed. To further simplify things,
> > place a space before all punctuation marks.
My source file really is like this also.
> Eek. Or possibly allow for cases to accept punctuation, so long as
> there is
>no other printable non-whitespace character afterwards? For instance, accept:
>This is a phrase. This too is a phrase.<cr>
> ... As two phrases. However, do not end with the dot here as a phrase
>ending:
>This phrase mentions index.html.<cr>
>
> Yes?
Not necessary for my needs right now. I am just going to need this script
for one use, but the resulting data is going to be very helpful. The
script would certainly be more generally useful if it can process more
normal text (and would work for me), but it should be much simpler to make
the script as I need it.
Correction (in all caps): A phrase will be defined as a string len
characters long and with a space at each end AND CONTAINING AT LEAST THREE
WORDS.
Thanks,
Louis