[REBOL] Rebol web presence statistics
From: hallvard::ystad::oops-as::no at: 19-Mar-2004 1:16
Hi list
A quick tour around the search engines reveals they find this many documents about "rebol":
Google: "about 188,000"
Altavista: 16,721
Alltheweb: 51,983
Hotbot: 14,131
Teoma: "about 19,100"
msn: "about 13635"
Yahoo: "about 78,300"
As I write, the RIXbot has 45621 documents in its index. These are documents that contain
"rebol" in ANY way, so putting <rebol> (as an html tag) in a web page, or linking to
rebol.com, will cause the page to be included in the index. It is intended to work this
way. This means some pages will not have the word "rebol" on them (visibly), but still
be indexed.
The last time I spoke about the RIX on this list, someone suggested I make it possible
to search through rebol headers. This is now done. The bot has several indexes, both
in full text and in rebol headers. E.g., you can see some of Carl Sassenrath's and Carl
Read's scripts here: http://www.oops-as.no/rix?q=carl&st=sauthor
Do we need this? I think maybe not. Then why make it? Because rebol is fun and a bit
too addictive. I really hope I will reach some stage that I find satisfactory with this,
so I can leave it behind and get some sleep...
There are duplicates in the database:
http://rebol.com/, http://www.rebol.com/, http://rebol.com/index.html and http://www.rebol.com/index.html
are all registered. I'm working on a filter to get them out.
Rebol scripts are detected with 'load. Web pages with more than one script are currently
registered with the first script on the page only. This too will be changed if/when I
find the time.
If you're curious about whether or not some page is in the index, please use http://www.oops-as.no/rixaddurl
to check. I hope this index can be more or less exhaustive, so I'm grateful to all who
tell the bot where to go.
So Google reports 188000 pages... But clicking "next" repeatedly never gets you to the
end. I wonder if this figure is really real...
Thanks for all the help I have gotten from this list, and thanks to Nenad for the mysql
protocol in particular.
HY