MyBB Community Forums

Full Version: preventing google from archiving my website
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Hello,

As a webmaster I know that there are websites that enable google to save my website appearance and content in the past. One of these websites:

https://archive.org/web/

According to that,I wouldnt like google to do that since my website includes paid membership options that could change and I dont want to reveal the price changes of the past website. Also I dont want Google to save the website that was free back then and could turn into pay-per-view material.

What I understood is that in the head section I should insert the following code - is that correct?

<META NAME=”GOOGLEBOT” CONTENT=”NOARCHIVE”>

Is this going to disable google bot from creating screenshots and archives on my website WHILE STILL INDEXING IT LIKE IT SHOULD DO? 

Thanks in advance
Ramireo
Archive.org has nothing to do with Google and also ignors robots.txt: https://blog.archive.org/2017/04/17/robo...-archives/
Thank you for your both answers!
1. So, this means that archive.org will create screenshots of my website regardless of whether I want or not, there is no way to disable them from doing that or should I edit the htaccess file and put e.g. this code 
(according to https://www.inmotionhosting.com/support/...g-htaccess
RewriteEngine On
RewriteCond %{HTTP_REFERER} archive\.org [NC]
RewriteRule .* - [F]


2. I really didnt hear about robots.txt file so far.. but it probably due to my super little experience. I am a regular web developer, just willing to promote my website without enabling google to archive it - should I create the robots.txt file in 2018 (it seems kind of "old" but I am not sure) or just put the:

Quote:<META NAME=”GOOGLEBOT” CONTENT=”NOARCHIVE”>

 in the head of my index.php file?
help ?
(2018-07-12, 03:09 PM)Ramireo Wrote: [ -> ]Thank you for your both answers!
1. So, this means that archive.org will create screenshots of my website regardless of whether I want or not, there is no way to disable them from doing that or should I edit the htaccess file and put e.g. this code 
(according to https://www.inmotionhosting.com/support/...g-htaccess
RewriteEngine On
RewriteCond %{HTTP_REFERER} archive\.org [NC]
RewriteRule .* - [F]

True. That should keep them out.

(2018-07-12, 03:09 PM)Ramireo Wrote: [ -> ]2. I really didnt hear about robots.txt file so far.. but it probably due to my super little experience. I am a regular web developer, just willing to promote my website without enabling google to archive it - should I create the robots.txt file in 2018 (it seems kind of "old" but I am not sure) or just put the:
Old or not - it is effective.

(2018-07-12, 03:09 PM)Ramireo Wrote: [ -> ]
<META NAME=”GOOGLEBOT” CONTENT=”NOARCHIVE”>
 in the head of my index.php file?
No, that should go in the "headerinclude" template. together with the other meta tags.