Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] comparing two URLs

From: hallvard:ystad:helpinhand at: 22-Oct-2003 12:05

Hi list My rebol stuff search engine now has more than 10000 entries, and works pretty fast thanks to DocKimbels mysql protocol. Here's a problem: Some websites work both with and without the www prefix (ex. www.rebol.com and just plain and simple rebol.com). Sometimes this gives double records in my DB (ex. http://www.oops-as.no/cgi-bin/rebsearch.r?q=mysql : you'll see that both http://www.softinnov.com/bdd.html and http://softinnov.com/bdd.html appears). Is there a way to detect such behaviour on a server? Or do I have to compare my incoming document to whatever documents I already have in the DB that _might_ be the same document? Thnaks, Hallvard