Stop Google putting a website into the search engine

This is a discussion on "Stop Google putting a website into the search engine" within the Starting Out section. This forum, and the thread "Stop Google putting a website into the search engine are both part of the Design Your Website category.



 Subscribe in a reader

Go Back   Webforumz.com > Main Forums > Design Your Website > Starting Out

Notices


Reply
 
LinkBack Thread Tools
  #1  
Old Jun 19th, 2008, 22:57
Stormraven's Avatar
SuperMember

SuperMember
Join Date: Feb 2006
Location: $_home
Age: 19
Posts: 155
Thanks: 3
Thanked 1 Time in 1 Post
Stop Google putting a website into the search engine

I hope this is the right place to post this, but how do i stop google putting a website into google? it used cpanel. just wondering how to go about doing this
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote

  #2  
Old Jun 20th, 2008, 02:45
Junior Member
Join Date: Apr 2008
Location: Killeen,Texas
Age: 14
Posts: 18
Blog Entries: 1
Thanks: 0
Thanked 0 Times in 0 Posts
Re: Stop Google putting a website into the search engine

Hmm I dont really understand why you wouldnt... If im not mistaken there isnt anyway to bypass it to my knowledge but i could be wrong
Last Blog Entry: My Dreamweaver series (Apr 24th, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #3  
Old Jun 20th, 2008, 10:23
Aso's Avatar
Aso Aso is offline
Moderator

SuperMember
Join Date: Oct 2007
Location: UK
Posts: 1,324
Blog Entries: 2
Thanks: 9
Thanked 48 Times in 45 Posts
Re: Stop Google putting a website into the search engine

Create a robots.txt file at the root of your site, and copy and paste this snippet;
Code: Select all
User-agent: Google
Disallow: /
This instructs the Google agent to ignore all content from your root downwards (i.e. everything), but will not affect other spiders and SE's.

However, if your site is already in Google's index, it will take a while to propagate the changes. You may even have to user Google's Webmaster Tools to instruct Google to manually remove your site from it's index.
Last Blog Entry: The Google Misconception (Feb 3rd, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #4  
Old Jun 20th, 2008, 14:47
Stormraven's Avatar
SuperMember

SuperMember
Join Date: Feb 2006
Location: $_home
Age: 19
Posts: 155
Thanks: 3
Thanked 1 Time in 1 Post
Re: Stop Google putting a website into the search engine

Quote:
Originally Posted by aso View Post
Create a robots.txt file at the root of your site, and copy and paste this snippet;
Code: Select all
User-agent: Google
Disallow: /
This instructs the Google agent to ignore all content from your root downwards (i.e. everything), but will not affect other spiders and SE's.

However, if your site is already in Google's index, it will take a while to propagate the changes. You may even have to user Google's Webmaster Tools to instruct Google to manually remove your site from it's index.
only a .txt file? thought it would have been a .html or something like that. no my website is not in the search engine yet, the the hosting still needs 24 hours, what do you mean by the root? i have no idea where to put it lol >.<

Oh i forgot to say, i would like it not to appear in any search engine lol, sorry.

Last edited by Stormraven; Jun 20th, 2008 at 14:49.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #5  
Old Jun 20th, 2008, 16:05
Aso's Avatar
Aso Aso is offline
Moderator

SuperMember
Join Date: Oct 2007
Location: UK
Posts: 1,324
Blog Entries: 2
Thanks: 9
Thanked 48 Times in 45 Posts
Re: Stop Google putting a website into the search engine

In which case, use;
Code: Select all
User-agent: *
Disallow: /
The asterisk (*) is a 'wildcard' that applies to all user agents.

The robots.txt file is a web standard for instructing spiders which areas of your site you wish to be included / excluded in their crawl (more information).

Note that all the major players (Google, Yahoo, MSN etc.) obey the robots.txt convention, but there are spiders out there that do not (namely insignificant spiders and spambots).
Quote:
what do you mean by the root? i have no idea where to put it
The root is the 'top level' of your website (i.e. www.example.com, not www.example.com/folder/)
Last Blog Entry: The Google Misconception (Feb 3rd, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #6  
Old Jun 20th, 2008, 16:30
Rob's Avatar
Rob Rob is offline
Webforumz Founder
Join Date: Jul 2003
Location: Southern UK
Age: 34
Posts: 3,159
Blog Entries: 7
Thanks: 27
Thanked 19 Times in 16 Posts
Re: Stop Google putting a website into the search engine

As Aso has said, a robots.txt file is a file that all search engines request before spidering any content - it tells them what they CANT have on your site....

The content of the file Aso has posted above basically tells all robots to go away as they cant have anything.

Upload the file to the root of your website so it is accessible via yourdomain.com/robots.txt
__________________
Click the 'Thanks!' button if this post has helped you

Rob - Webforumz Founder
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #7  
Old Jun 20th, 2008, 22:17
Stormraven's Avatar
SuperMember

SuperMember
Join Date: Feb 2006
Location: $_home
Age: 19
Posts: 155
Thanks: 3
Thanked 1 Time in 1 Post
Re: Stop Google putting a website into the search engine

But im waiting for the hosting to go through now, but the second it gets up and ready google is going to be swarming it with spiders and stuff. or will it not do that until i have content on it? lol. I have never used a cpanel before so ya know
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #8  
Old Jun 20th, 2008, 23:29
Aso's Avatar
Aso Aso is offline
Moderator

SuperMember
Join Date: Oct 2007
Location: UK
Posts: 1,324
Blog Entries: 2
Thanks: 9
Thanked 48 Times in 45 Posts
Re: Stop Google putting a website into the search engine

cPanel's got nothing to do with Google, so don't worry!

cPanel is simply one of many popular systems for managing common (and complex) server-related tasks.

Google can find out about your site pretty much no matter what, but so long as that robots.txt file is hosted in your main website folder, that'll be the first thing Google checks. And as soon as it realises it's not wanted, it'll leave!

Plus until your hosting is set up, Google won't be able to index anything anyway!
Last Blog Entry: The Google Misconception (Feb 3rd, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #9  
Old Jun 21st, 2008, 09:02
Jack Franklin's Avatar
Moderator

SuperMember
Join Date: May 2007
Location: Cornwall, England
Posts: 1,402
Blog Entries: 8
Thanks: 18
Thanked 14 Times in 14 Posts
Re: Stop Google putting a website into the search engine

The robots.txt file is like the 'Keep of the Grass' sign at the local park. The kids/robots get to the edge/domain and see/read the sign/file and are nice good kids/robots and obey the sign/file.

__________________
Jack Franklin - Webforumz Moderator
(x)HTML | CSS | PHP | MySQL | JQuery (Javascript)
Contact: My Blog | Twitter | Delicious
Want Lessons? PM me.
If you think I've helped, please press the 'Thanks' Button.
Last Blog Entry: A Week with VBulletin (Aug 28th, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #10  
Old Jun 21st, 2008, 10:54
Junior Member
Join Date: Jun 2008
Location: Newcastle, England
Age: 19
Posts: 39
Thanks: 4
Thanked 1 Time in 1 Post
Re: Stop Google putting a website into the search engine

a little off topic but as its he has a solution...

why? lol
i thought everyone was tryin to be found :S
sorry, i just wondered what this would be useful for?

cheers
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
The Following User Says Thank You to mickhartley2000 For This Useful Post:
  #11  
Old Jun 21st, 2008, 10:56
Rob's Avatar
Rob Rob is offline
Webforumz Founder
Join Date: Jul 2003
Location: Southern UK
Age: 34
Posts: 3,159
Blog Entries: 7
Thanks: 27
Thanked 19 Times in 16 Posts
Re: Stop Google putting a website into the search engine

Mick, sometimes during the development phases of a site when you would NOT want people arriving at the site then it's useful - personally, I always password protect the site but others use this among other methods to keep people and bots out.
__________________
Click the 'Thanks!' button if this post has helped you

Rob - Webforumz Founder
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #12  
Old Jun 21st, 2008, 12:02
Aso's Avatar
Aso Aso is offline
Moderator

SuperMember
Join Date: Oct 2007
Location: UK
Posts: 1,324
Blog Entries: 2
Thanks: 9
Thanked 48 Times in 45 Posts
Re: Stop Google putting a website into the search engine

Actually, while we're on the subject - if you block Google (and other bots) with a robots.txt file, I'm guessing it (they) will still keep coming back every now and then to check if it's still wanted or not?
Last Blog Entry: The Google Misconception (Feb 3rd, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #13  
Old Jun 21st, 2008, 12:23
Junior Member
Join Date: Jun 2008
Location: Newcastle, England
Age: 19
Posts: 39
Thanks: 4
Thanked 1 Time in 1 Post
Re: Stop Google putting a website into the search engine

ah thats quite a nifty trick then, didnt think of that.

cheers
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #14  
Old Jun 21st, 2008, 13:28
Jack Franklin's Avatar
Moderator

SuperMember
Join Date: May 2007
Location: Cornwall, England
Posts: 1,402
Blog Entries: 8
Thanks: 18
Thanked 14 Times in 14 Posts
Re: Stop Google putting a website into the search engine

Quote:
Originally Posted by Aso View Post
Actually, while we're on the subject - if you block Google (and other bots) with a robots.txt file, I'm guessing it (they) will still keep coming back every now and then to check if it's still wanted or not?
You would think so, logically.
__________________
Jack Franklin - Webforumz Moderator
(x)HTML | CSS | PHP | MySQL | JQuery (Javascript)
Contact: My Blog | Twitter | Delicious
Want Lessons? PM me.
If you think I've helped, please press the 'Thanks' Button.
Last Blog Entry: A Week with VBulletin (Aug 28th, 2008)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #15  
Old Jun 21st, 2008, 13:55
Marc's Avatar
Staff Manager

SuperMember
Join Date: Apr 2007
Location: Scotland, UK
Posts: 1,761
Thanks: 0
Thanked 14 Times in 14 Posts
Re: Stop Google putting a website into the search engine

Quote:
Originally Posted by Aso View Post
Actually, while we're on the subject - if you block Google (and other bots) with a robots.txt file, I'm guessing it (they) will still keep coming back every now and then to check if it's still wanted or not?
I reckon that they will come back but not so often, however I don't know this for fact.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #16  
Old Jun 26th, 2008, 21:39
Stormraven's Avatar
SuperMember

SuperMember
Join Date: Feb 2006
Location: $_home
Age: 19
Posts: 155
Thanks: 3
Thanked 1 Time in 1 Post
Re: Stop Google putting a website into the search engine

Quote:
Originally Posted by Rob View Post
Mick, sometimes during the development phases of a site when you would NOT want people arriving at the site then it's useful - personally, I always password protect the site but others use this among other methods to keep people and bots out.
Tryed that password thing but failed

So this robots file has no script tags or anything? just that text?
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #17  
Old Jun 26th, 2008, 21:58
Aso's Avatar
Aso Aso is offline
Moderator

SuperMember
Join Date: Oct 2007
Location: UK
Posts: 1,324
Blog Entries: 2
Thanks: 9
Thanked 48 Times in 45 Posts
Re: Stop Google putting a website into the search engine

Quote:
So this robots file has no script tags or anything? just that text?
Yup. That's it.

How did you try password protection? If you're running Apache (90% chance you are), you can use htaccess