Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Parse doing weird things...

From: mat:eurogamer at: 19-Mar-2001 15:20

Heya, While I'm at it; I've noticed this issue repeatedly in my data mining scripts. The parse command sometimes just will not work with the following format; webpage: read http://www.yaddayadda.com datamine: make string! "" parse webpage [thru "something" copy datamine to "somethingelse"] In the vast bulk of cases it works but right now I'm sitting in my Rebol console with a bag load of HTML fragment in a variable and parse turns up false every time. Now the rules DO match the content in the html! Specifically; parse tmp/content [thru "Etymology:" copy DictEnty to "<br>"] Here's everything I've tried that I can remember. tmp/content comes from HTTP-TOOLS and I've tried loading it into another variable, no luck. Etymology: *does* exist. If I put 'to end' instead of the <br> tag then it parses true. Find also turns up Etymology as well. It doesn't appear to matter WHAT I put in the quotes at the end. IE I can hunt for a couple of plain letters that are in the string directly following Etymology. It doesn't appear to make any difference that this is a tag. Finally, all of this worked before and now it doesn't. In the past to cure this random parse blowout, I've had to cut down on the data I'm trying to parse. All of the above leads me to think there is some sort of restriction to how much data can be parsed before some sort of feature/limitation/bug is run into. Anyone seen this before? Comments? -- Mat Bettinson - EuroGamer's Gaming Evangelist with a Goatee http://www.eurogamer.net | http://www.eurogamer-network.com