Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)

To: gopher-project@other.debian.org
Subject: Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)
From: Christoph Lohmann <20h@r-36.net>
Date: Fri, 29 Nov 2019 06:19:46 +0100
Message-id: <[🔎] 20191129052905.53DDB101923F1@r-36.net>
In-reply-to: <[🔎] 20191128223304.GC30671@brevard.conman.org>
References: <[🔎] 16eb08176ef.11eef69dc482927.3075840238943420488@kiwidev.xyz> <[🔎] 20191128103133.GB30671@brevard.conman.org> <[🔎] 20191128182223.0117879b@aluminium.mobile.teply.info> <[🔎] 20191128174525.11D43101923F3@r-36.net> <[🔎] 20191128223304.GC30671@brevard.conman.org>

Greetings.

On Fri, 29 Nov 2019 06:19:46 +0100 Sean Conner <sean@conman.org> wrote:
>   I have a question about this, and it releates to the section I added last
> night to my gopherhole.  I read the document given above, and in there, I
> read:
> 
> 	Now put into this file:
>   
>         	User-agent: eomyidae/0.3
>         	Disallow: /
>   
> 	Or to disallow all crawlers:
>   
>         	User-agent: *
>         	Disallow: /
> 
>   That follows directly from the standard for HTTP, but gopher isn't HTTP. 
> I'm asking because very few selectors in my gopherhole start with a '/' [1],
> so this doesn't really work for me if I wanted to block all crawlers from my
> site (which I don't do [2]).
> 
>   But if I wanted to block robots from crawling the black hole I created,
> would the following actuallyu work?
> 
> 		User-agent: *
> 		Disallow: BlackHole:

Good point. In eomyidae you have two possibilities:

	User-Agent: *
	Disallow: *

and

	User-Agent: *
	Disallow:

I have changed the eomyidae hole to clarify this.

But it already has a special case for »/«, to not crawl anything.


Sincerely,

Christoph Lohmann

💻 https://r-36.net
💻 gopher://r-36.net
☺ https://r-36.net/about
🔐 1C3B 7E6F 9805 E5C8 C0BD  1F7F EA21 7AAC 09A9 CB55
🔐 http://r-36.net/about/20h.asc
📧 20h@r-36.net

Reply to:

Follow-Ups:
- Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)
  - From: Sean Conner <sean@conman.org>

References:
- Download a whole gopherhole using wget/curl?
  - From: kiwidevelops <kiwidev@kiwidev.xyz>
- Re: Download a whole gopherhole using wget/curl?
  - From: Sean Conner <sean@conman.org>
- Re: Download a whole gopherhole using wget/curl?
  - From: Florian Teply <usenet@teply.info>
- Re: Download a whole gopherhole using wget/curl?
  - From: Christoph Lohmann <20h@r-36.net>
- robots.txt (was Re: Download a whole gopherhole using wget/curl?)
  - From: Sean Conner <sean@conman.org>

Prev by Date: Re: Download a whole gopherhole using wget/curl?
Next by Date: Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)
Previous by thread: Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)
Next by thread: Re: robots.txt (was Re: Download a whole gopherhole using wget/curl?)
Index(es):
- Date
- Thread