slackbuilds/perl/perl-www-robotrules
Zachary Storer 177af84a22 perl/perl-www-robotrules: Fix source URLs. 2014-07-22 16:22:59 +07:00
..
README perl/perl-uri-escape: Fixed dep information 2012-08-25 18:36:51 +02:00
perl-www-robotrules.SlackBuild various: Update find command to match template. 2013-11-22 02:37:19 -05:00
perl-www-robotrules.info perl/perl-www-robotrules: Fix source URLs. 2014-07-22 16:22:59 +07:00
slack-desc various: Fix slack-desc formatting and comment nit picks. 2013-11-22 02:29:22 -05:00

README

This module parses /robots.txt files as specified in "A Standard for
Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html>
Webmasters can use the /robots.txt file to forbid conforming robots
from accessing parts of their web site.
The parsed files are kept in a WWW::RobotRules object, and this
object provides methods to check if access to a given URL is
prohibited. The same WWW::RobotRules object can be used for one
or more parsed /robots.txt files on any number of hosts.