|Liam Delahunty: Home||Tips||Web||Contact|
Think I deserve a present? See my Amazon Wish List
Identifing the Googlebot
gethostbyname & gethostbyaddr
Seeing the post How to verify Googlebot post on the Official Google Webmaster Central Blog I'm looking into using gethostbyname & gethostbyaddr to help verify if the bot claiming to be Googlebot actually is.
Firstly, the Googlebot will be listed as the User Agent so we can test all requests to the page using a simple if statement.
So, from their post we know
Test result using known google IP of 18.104.22.168
The word googlebot is found in the $name returned by the getHostByAddr(IP). (crawl-66-249-66-1.googlebot.com)
The $host IP address getHostByName($name) for the getHostByAddr(IP) equals that given and has passed. (22.214.171.124)
Your User Agent, IP Address, host by address and host by name
$_SERVER['HTTP_USER_AGENT'] : CCBot/2.0 (https://commoncrawl.org/faq/)
$name = gethostbyaddr($ip): ec2-54-167-15-6.compute-1.amazonaws.com
$host = gethostbyname($name): 126.96.36.199
On my Online Sales site, I published a script that will email you when the Googlebot visits.