Current time: 09-03-2010, 03:27 PM Hello There, Guest! (LoginRegister)


Thread Closed 
Robots.txt
07-19-2010, 12:57 PM
Post: #1
Robots.txt
Robots.txt : http://newsforums.biz/robots.txt

As you can see, it says disallow newreply.php.

But Google has indexed it.

http://www.google.co.uk/search?q=site:ne...4&filter=0

What am I doing wrong?

http://mfgaming.com/

[Image: ad.php]
Find all posts by this user
07-19-2010, 01:04 PM
Post: #2
RE: Robots.txt
It may take a while for Google to index it. You could always use the URL removal tool.

Thanks, Polarbear541
MyBB Support Team


[Image: ad.php]
Become an ANYwebcam Affiliate and earn $30 per Join!
Visit this user's website Find all posts by this user
07-19-2010, 01:07 PM
Post: #3
RE: Robots.txt
http://www.google.co.uk/search?q=site:ne...4&filter=0
lake i see here google dosen't indexed it
take a look on the image:
http://img836.imageshack.us/img836/8991/google.jpg
Visit this user's website Find all posts by this user
07-19-2010, 01:15 PM
Post: #4
RE: Robots.txt
@haytoch - See image:

http://img843.imageshack.us/img843/3508/74373823.jpg

http://mfgaming.com/

[Image: ad.php]
Find all posts by this user
07-19-2010, 01:18 PM
Post: #5
RE: Robots.txt
newreply.php isn't indexed by Google here either.

No idea why this is happening. Maybe because you're the owner?

[Image: mwhbirthday.gif]
Find all posts by this user
07-19-2010, 01:19 PM
Post: #6
RE: Robots.txt
Cache? Toungue

Thanks, Polarbear541
MyBB Support Team


[Image: ad.php]
Become an ANYwebcam Affiliate and earn $30 per Join!
Visit this user's website Find all posts by this user
07-19-2010, 01:44 PM
Post: #7
RE: Robots.txt
@Polar - I have already tried that. I can only see it on google if I click on :

Code:
In order to show you the most relevant results, we have omitted some entries very similar to the 8 already displayed.
If you like, you can repeat the search with the omitted results included.

But it still means that it's indexed, which I don't want.

http://mfgaming.com/

[Image: ad.php]
Find all posts by this user
07-19-2010, 02:08 PM
Post: #8
RE: Robots.txt
As I said you can get a URL removal service in Google Webmaster tools if you're really that desperate. If not then it will just disappear in time.

Thanks, Polarbear541
MyBB Support Team


[Image: ad.php]
Become an ANYwebcam Affiliate and earn $30 per Join!
Visit this user's website Find all posts by this user
07-19-2010, 03:20 PM (This post was last modified: 07-19-2010 03:22 PM by frostschutz.)
Post: #9
RE: Robots.txt
You can go to Google Webmaster Tools, and check with their utility if your robots.txt actually does disallow newreply.php URLs. It's blocked only if that says it's blocked.

If it's indeed blocked, you can have use the URL removal utility with yoursite/newreply.php to get all newreply URLs removed. This will only work if it's either blocked or 404. Sites that are not blocked and give 200 OK will stay indexed.

EDIT:

Your robots.txt seems to be interpreted as HTML since it contains this code at the end:

Code:
<!-- www.000webhost.com Analytics Code -->
<script type="text/javascript" src="http://analytics.hosting24.com/count.php"></script>
<noscript><a href="http://www.hosting24.com/"><img src="http://analytics.hosting24.com/count.php" alt="web hosting" /></a></noscript>
<!-- End Of Analytics Code -->

This could break your entire robots.txt. Verify with the Google Webmaster Tools whether it's actually accepted or not.
Visit this user's website Find all posts by this user
07-19-2010, 04:56 PM
Post: #10
RE: Robots.txt
I just had the Analytics Code removed.

@Polar - I can't find the URL removal service. Could you provide a link?

http://mfgaming.com/

[Image: ad.php]
Find all posts by this user
Thread Closed 


Forum Jump:


User(s) browsing this thread: 1 Guest(s)