[REBOL] Parse doing weird things...
From: mat:eurogamer at: 19-Mar-2001 15:20
Heya,
While I'm at it;
I've noticed this issue repeatedly in my data mining scripts. The
parse command sometimes just will not work with the following format;
webpage: read http://www.yaddayadda.com
datamine: make string! ""
parse webpage [thru "something" copy datamine to "somethingelse"]
In the vast bulk of cases it works but right now I'm sitting in my
Rebol console with a bag load of HTML fragment in a variable and parse
turns up false every time. Now the rules DO match the content in the
html!
Specifically;
parse tmp/content [thru "Etymology:" copy DictEnty to "<br>"]
Here's everything I've tried that I can remember.
tmp/content comes from HTTP-TOOLS and I've tried loading it into
another variable, no luck.
Etymology: *does* exist. If I put 'to end' instead of the <br> tag
then it parses true. Find also turns up Etymology as well.
It doesn't appear to matter WHAT I put in the quotes at the end. IE I
can hunt for a couple of plain letters that are in the string directly
following Etymology. It doesn't appear to make any difference that
this is a tag.
Finally, all of this worked before and now it doesn't. In the past to
cure this random parse blowout, I've had to cut down on the data I'm
trying to parse.
All of the above leads me to think there is some sort of restriction
to how much data can be parsed before some sort of
feature/limitation/bug is run into.
Anyone seen this before? Comments?
--
Mat Bettinson - EuroGamer's Gaming Evangelist with a Goatee
http://www.eurogamer.net | http://www.eurogamer-network.com