Information for RPM perl-WWW-RobotRules-6.02-1.ru.src.rpm
ID | 26485 | ||||||||
---|---|---|---|---|---|---|---|---|---|
Name | perl-WWW-RobotRules | ||||||||
Version | 6.02 | ||||||||
Release | 1.ru | ||||||||
Epoch | |||||||||
Arch | src | ||||||||
Summary | WWW-RobotRules - database of robots.txt-derived permissions | ||||||||
Description | This module parses /robots.txt files as specified in "A Standard for Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html> Webmasters can use the /robots.txt file to forbid conforming robots from accessing parts of their web site. The parsed files are kept in a WWW::RobotRules object, and this object provides methods to check if access to a given URL is prohibited. The same WWW::RobotRules object can be used for one or more parsed /robots.txt files on any number of hosts. The following methods are provided: =over 4 =item $rules = WWW::RobotRules->new($robot_name) This is the constructor for WWW::RobotRules objects. The first argument given to new() is the name of the robot. =item $rules->parse($robot_txt_url, $content, $fresh_until) The parse() method takes as arguments the URL that was used to retrieve the /robots.txt file, and the contents of the file. =item $rules->allowed($uri) Returns TRUE if this robot is allowed to retrieve this URL. =item $rules->agent([$name]) Get/set the agent name. NOTE: Changing the agent name will clear the robots.txt rules and expire times out of the cache. =back | ||||||||
Build Time | 2015-04-06 16:54:53 GMT | ||||||||
Size | 14730 | ||||||||
Payload Hash | 315ff7f962f3b5bdc52220eea56c711b | ||||||||
Buildroot | centos5-rutgers-staging-build-4596-6905 | ||||||||
Provides | No Provides | ||||||||
Requires |
|
||||||||
Obsoletes | No Obsoletes | ||||||||
Conflicts | No Conflicts | ||||||||
Files |
|
||||||||
Component of | No Buildroots |