translate.googleusercontent.com


What is translate.googleusercontent.com

Many site managers have asked what this traffic referrer is. The answer is simple. This is Google’s Translation Service.

Translate by GoogleSurprisingly the answer was not too easy to find considering this is GOOGLE! Many of us know the Google Translation Service: want to read a webpage in a language we do not know, Google Translate provides a very good translation to and from a range of international languages.Lately this other strange address has been seen in traffic statistics, as well as generating error messages for sites carrying Google AdSense advertising.

The website or URL we are more likely to be familiar with is translate.google.com; Fairly recently Google changed some of the service to this alternative domain.

From what I can find, translate.googleusercontent.com is the URL seen when cached translated pages are served, and the cached page pulls data from the live site, e.g. images.

A thread found on the Google’s AdSense support forum states:
Seeing translate.googleusercontent.com in traffic data “means that someone has viewed your page using the Google Translate service, you will probably also see Google Cache in your list which as noted above means someone has viewed a cached version of your page”

Identity confirmed on Google Support Forum

The confirmation of the domain was finally found hidden in a post on Google Groups by Josh (a Google Employee) who answers some questions about the errors seen by AdSense users.

Quote:

“We correctly block robots.txt from crawling our translations of webpages.  We don’t want web crawlers to try to crawl complete translations of the web through our service.”

The translate.googleusercontent.com site blocks robots/spiders from crawling the indexed content.Josh also describes how to fix the error messages from AdSense by allowing the AdSense spider to crawl the affected site…

Apparently the translated pages are only partly cached, and there is a good reason for that. If the translated page was fully cached, then clicks from the cached page would result in loss of income to the site owner – who will the click-through fee go to – the cache?

Fair enough, it looks like Google is trying to combine the best of both worlds here, by caching the translated page content, but calling the page from the live website so any click-through on AdSense links gets credited to the site owners account. Rather clever in fact.

This explanation also answers another question.

IP look-up for translate.googleusercontent.com fails

Having identified a number of IPs used by translate.googleusercontent.com (a number of Google regional services have different IP addresses), those of us who have tried discovering what this referrer is have found no answers from IP look-up services. Josh’s answer to the AdSense question provides the clue – these services spiders are also blocked…

Not a hacker, spambot or botnet

This traffic source (translate.googleusercontent.com) is not a hacker, spambot or any other source of malign activity! Nor is it a bandwidth thief. It is simply Google doing it’s thing.

The real problem is the lack of information (once again) from Google. Considering this is their translation service they really should have a claim of ownership posted on their main website, or even Webmaster Tools… I wonder how may site managers have blocked the IP’s this service uses not knowing what it was. I did for a while!

I came across a large number of 404 page ‘page not found’ errors caused by traffic accredited to translate.googleusercontent.com after hotlink protecting images and downloads on my domains at Graphicline (It seems allowing free download and use of these items is not enough for some people – they also want free hosting and bandwidth too).

Considering the several hundred links from translate…  resulting in 404′s I added the IP’s to the blocked list… The result of course was incomplete pages viewed from the translated and cached pages. I know it is not Google’s wish to serve partial content, nor is it mine to prevent the service from delivering full-page views – lack of information… After looking through the first 50 results from a Google search for the term translate… and getting only pages of more questions without adequate answers from a reliable source, the IP’s were blocked.

Now, having found the facts, I can add the domain to the list of those allowed to hot-link, and remove the IP’s from the block.

Yes, it is safe to allow translate.googleusercontent.com to link to your site, as well as your AdSense content!

me on google plus+Mike Otgaar

About these ads

About Mike

Web Developer and Techno-geek Saltwater fishing nut Blogger

Posted on December 22, 2011, in Google, Internet, TECHNOLOGY and tagged , , , , , , . Bookmark the permalink. 4 Comments.

  1. unauthorized site -> translate.googleusercontent.com
    I just authorized the adsense account but it appears like that a few hours later. Is it dangerous if I authorize this site in my account?

    • In My Opinion it’s safe to authorize translate.googleusercontent.com. I am not aware of any script injection or other vulnerabilities in the genuine google translate sites.
      However, Google translate also users regional sites e.g. translate.googleusercontent.in and translate.googleusercontent.dk among many others. Each will need authorizing if regular traffic is seen from these sub-domains. Also, it’s always a good idea to check the site is a genuine Google site (use reverse Ip/Domain lookup tools like whois.domaintools.com
      Another thing you could do is embed Google Translate “on the fly translation” in your site. For WordPress GTranlate information see this article

  2. thanks for the great share, i was considering this referral site as a spam or a bot.

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

Join 2,267 other followers

%d bloggers like this: