Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Re: Frequency of phrases

From: louisaturk:coxinet at: 23-Aug-2002 1:20

Hi Charles, At 12:37 AM 8/23/2002 -0400, you wrote:
> > Phrase: A phrase will be defined as a string len characters long and with a > > space at each end. All phrases 100 characters long are to be processed > > first, then all phrases of length len - 1 and so on until len = 5. > My apologies, but what about a phrase such as: >This is a phrase.<cr> > There is no whitespace at beginning or end.
This really is what I need, strange as it may sound.
> > The text file: To simplify things, manually place a space at the beginning > > and at the end of the file to be processed. To further simplify things, > > place a space before all punctuation marks.
My source file really is like this also.
> Eek. Or possibly allow for cases to accept punctuation, so long as > there is >no other printable non-whitespace character afterwards? For instance, accept: >This is a phrase. This too is a phrase.<cr> > ... As two phrases. However, do not end with the dot here as a phrase >ending: >This phrase mentions index.html.<cr> > > Yes?
Not necessary for my needs right now. I am just going to need this script for one use, but the resulting data is going to be very helpful. The script would certainly be more generally useful if it can process more normal text (and would work for me), but it should be much simpler to make the script as I need it. Correction (in all caps): A phrase will be defined as a string len characters long and with a space at each end AND CONTAINING AT LEAST THREE WORDS. Thanks, Louis