I Hate Google

February 9th, 2007

Mekakushi no KuniI know I should be grateful that they’re indexing my sites frequently. That would mean that we’re getting more exposure and more people visit the site.

But should their indexing take up so much bandwidth?! I mean for one subdomain alone, we got 14gb of used bandwidth within 7 days. Isn’t that insane? Shouldn’t our bandwidth be reserved for people who actually have an interest on our site, rather than robotic machines that only report back to google about our pages?

Bandwidth doesn’t come for free; in fact, it’s rather expensive if any exceeded bandwidth is not part of your hosting plan. How expensive? Just imagine that a few months ago, this blog racked up 100gb of bandwith, and yes, just for this little blog alone.

Now our hosting package only allows for 20gb per month and that’s more than enough for us. Unfortunately, we didn’t expect the googlebot to ransack it.

How about using the robots.txt file? People say it works.

It doesn’t. I tried it for several months and it did stabilize my bandwidth towards the middle but in the end my bandwidth is still sky-rocketing.

Now if only I knew how to stop googlebot from sucking up our bandwidth and yet not lose our page ranking in google. T_T

  • Share/Bookmark
divider

You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

divider

2 Comments »

michan on February 12th, 2007 at 11:17 pm
  1. I just did some random reading and found out that spiders/crawlers these days don’t even honor the robots.txt exclusion codes anymore. However, now that you’re on a Linux server, you should be able to use .htaccess freely w/o much restrictions anymore. This might be useful for Kachou-sama.. ^^

    [dive into mark] How to block spambots, ban spybots, and tell unwanted robots to go to hell

    Ganbatte. ^^

Licorne on February 13th, 2007 at 3:54 pm
  1. Thanks, Michan. I love the title of that article.

    We’ll stick it out first with the robots.txt to see if it works. If it doesn’t we’ll see about banning googlebot on the entire site. Hopefully not, since we don’t want to lose our page ranking and pages spidered.

RSS feed for comments on this post. TrackBack URL

Leave a comment

divider