PHP Classes

Classes of Andy Pieters

Recommend this page to a friend!
  All class groups  >  All authors  >  Classes of Andy Pieters (1)  >  Mission progress status  >  Reputation  
Picture of Andy Pieters
Name: Andy Pieters <contact>
Classes: 1
Country: United Kingdom
Age: 46
All time rank: 196385 in United Kingdom
Week rank: 216 Up13 in United Kingdom Up
All time users: 1360
Week users: 0
Innovation award
Innovation award
Nominee: 1x
 
  A   B   C   D   E   F   G   H   I   J   K   L   M   N   O   P   Q   R   S   T   U   V   W   X   Y   Z  
  Files folder image Robots_txt  
PHP Programming Innovation award nominee
January 2008
Number 8
Test if a URL may be crawled looking at robots.txt
This class can be used to check whether a page may be crawled by looking at the robots.txt file of its site.

It takes the URL of a page and retrieves the robots.txt file of the same site.

The class parses the robots.txt file and looks up for the rules defined in that file to see if the site allows crawling the intended page.

The class also stores the time when a page is crawled to check whether next time another page of the same site is being crawled it is honoring the intended crawl delay and request rate limits.
RatingsUtility Consistency Documentation Examples Tests Videos Overall Rank
All time: 50% 60% 50% - - - 36% 4092
Month: Not yet rated by the users


  A   B   C   D   E   F   G   H   I   J   K   L   M   N   O   P   Q   R   S   T   U   V   W   X   Y   Z