Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Rebol web presence statistics

From: hallvard::ystad::oops-as::no at: 19-Mar-2004 1:16

Hi list A quick tour around the search engines reveals they find this many documents about "rebol": Google: "about 188,000" Altavista: 16,721 Alltheweb: 51,983 Hotbot: 14,131 Teoma: "about 19,100" msn: "about 13635" Yahoo: "about 78,300" As I write, the RIXbot has 45621 documents in its index. These are documents that contain "rebol" in ANY way, so putting <rebol> (as an html tag) in a web page, or linking to rebol.com, will cause the page to be included in the index. It is intended to work this way. This means some pages will not have the word "rebol" on them (visibly), but still be indexed. The last time I spoke about the RIX on this list, someone suggested I make it possible to search through rebol headers. This is now done. The bot has several indexes, both in full text and in rebol headers. E.g., you can see some of Carl Sassenrath's and Carl Read's scripts here: http://www.oops-as.no/rix?q=carl&st=sauthor Do we need this? I think maybe not. Then why make it? Because rebol is fun and a bit too addictive. I really hope I will reach some stage that I find satisfactory with this, so I can leave it behind and get some sleep... There are duplicates in the database: http://rebol.com/, http://www.rebol.com/, http://rebol.com/index.html and http://www.rebol.com/index.html are all registered. I'm working on a filter to get them out. Rebol scripts are detected with 'load. Web pages with more than one script are currently registered with the first script on the page only. This too will be changed if/when I find the time. If you're curious about whether or not some page is in the index, please use http://www.oops-as.no/rixaddurl to check. I hope this index can be more or less exhaustive, so I'm grateful to all who tell the bot where to go. So Google reports 188000 pages... But clicking "next" repeatedly never gets you to the end. I wonder if this figure is really real... Thanks for all the help I have gotten from this list, and thanks to Nenad for the mysql protocol in particular. HY