Google bandwidth problem Google bandwidth problem
 

News:

cpg1.5.48 Security release - upgrade mandatory!
The Coppermine development team is releasing a security update for Coppermine in order to counter a recently discovered vulnerability. It is important that all users who run version cpg1.5.46 or older update to this latest version as soon as possible.
[more]

Main Menu

Google bandwidth problem

Started by worzelb, October 20, 2006, 12:16:27 PM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.

worzelb

Hi,
  My problem is that Google is hammering my sites bandwidth. Nearly a gigabyte so far this month.

  I know how to block Google's access to the gallery but I don't really wan't to do that as the
gallery contains a lot of keywords that help in my site's ranking. What I would like to find out is
if I can allow Google to spider the text on the gallery but to disallow it from calling displayimage.php
to display images?

  I've stopped hotlinking to the pictures but Google displays the pictures the same way someone
browsing the site would so the pictures are served from the same domain.

I have CPG 1.4.9 and Apache 2 server

So, is this possible or can I not have it both ways and will have to ban Google from the gallery entirely?

Sami

#1
- use robots.txt
http://webtools.live2support.com/se_robots.php

- Or put rel attribute to image link (rel="nofollow")

- or set the htaccess
‍I don't answer to PM with support question
Please post your issue to related board

worzelb

Thanks Sami but two of your solutions will only ban google from the entire gallery (text as well).

The nofollow won't help either as if I place that in cpg's index.php it will have the same effect
as above as it won't allow the sub directory names to be indexed (which is what i would like).

I have this kind of setup on my gallery:

Main page
  Events 2006
    event 1
    event 2
  Events 2005
    event 1
    event 2
  Events 2004
    event 1
    etc.

I get a lot of hits from Google because of the names of 'event 1', 'event 2' etc which is why I want Google to still Spider the text.
Also I've found that denying Google from the 'albums' folder in htaccess doesn't work because of the way displayimage.php works.
Because the folder 'albums' is not in the url of the picture Google doesn't know it's actually going into that folder via displayimage.php

Nibbler

Use robots.txt as suggested. Either deny google access to displayimage.php or to the image files themselves.

Abbas Ali

To make it very clear for you...

To disallow any bot to crawl on images


User-agent: *
Disallow: /path_to_your_gallery/albums/


To disallow access on displayimage.php


User-agent: *
Disallow: /path_to_your_gallery/displayimage.php


Or you can combine both.
Chief Geek at Ranium Systems

worzelb

Quote from: Abbas Ali on October 20, 2006, 01:54:50 PM
To disallow access on displayimage.php


User-agent: *
Disallow: /path_to_your_gallery/displayimage.php



Thankyou!  That's just was I was after. I never knew you could disallow access to a specific file. I always thought it was only folders.  :-[

Nibbler

You should have read the link Sami gave you, it explains how to set access to individual files.