Mailing List Archive: Re: Library & smart docs => was:{Re: Re: Parsing comment}

[REBOL] Re: Library & smart docs => was:{Re: Re: Parsing comment}

From: gchiu:compkarori at: 27-Sep-2002 11:34

On Thu, 26 Sep 2002 23:21:54 +0200
  Christian Langreiter <[chris--langreiter--com]> wrote:

>> Hmm.  I thought that there was not supposed to be a
>>direct
>> link anywhere to that archive to prevent spiders getting
>> to it ... as email addresses are visible on those pages.
>
>Well, spiders don't get to sites if they observe
>robots.txt, which
>spam address harvesters most certainly don't.

We had a discussion before about the rebol.org archive,
and Jeff removed all links to the archive.  To get the
address, you have to use the link on the RT rebsite. So
even a spam address harvesting robot should not get there.
 But I guess google had it in it's database prior to that
happening.

>> I think that all dynamically created sites tend to be
>> invisible to search engines.  Zope is an example.
>
>Not exactly. Search engines have avoided URLs with query
>strings for a
>long time, but how would they distinguish static from
>dynamic content?
>They cannot, it's all bits to them (and us, for that
>matter ;-).

In practise they tend to be invisible.

Here's an article from Paul Graham making that point

http://www.paulgraham.com/mistakes.html

and in my Zope site, I have this in my robots.txt to make
me invisible!

User-agent: *
Disallow: /Shopping/ # This is an infinite virtual URL
space

--
Graham Chiu