MyBB Community Forums

Full Version: Whats wrong with robots.txt file
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
I am using MYbb as a subdomin. http://forum.mytabletguru.com/
and I am using the following in my robots.txt file (http://forum.mytabletguru.com/robots.txt)

User-Agent: *
Disallow: captcha.php
Disallow: editpost.php
Disallow: misc.php
Disallow: modcp.php
Disallow: moderation.php
Disallow: newreply.php
Disallow: newthread.php
Disallow: online.php
Disallow: printthread.php
Disallow: private.php
Disallow: ratethread.php
Disallow: report.php
Disallow: reputation.php
Disallow: search.php
Disallow: sendthread.php
Disallow: task.php
Disallow: usercp.php
Disallow: usercp2.php
Disallow: calendar.php
Disallow: *action=emailuser*
Disallow: *action=nextnewest*
Disallow: *action=nextoldest*
Disallow: *year=*
Disallow: *action=weekview*
Disallow: *action=nextnewest*
Disallow: *action=nextoldest*
Disallow: *sort=*
Disallow: *order=*
Disallow: *mode=*
Disallow: *datecut=*
Allow: /

But when I search Google for site:http://forum.mytabletguru.com/

I found out that calendar.php, action.php and other pages are also index.
Plz help me out to solve the issue.
(2013-11-07, 06:07 PM)sidhartha Wrote: [ -> ]I am using MYbb as a subdomin. http://forum.mytabletguru.com/
and I am using the following in my robots.txt file (http://forum.mytabletguru.com/robots.txt)

User-Agent: *
Disallow: captcha.php
Disallow: editpost.php
Disallow: misc.php
Disallow: modcp.php
Disallow: moderation.php
Disallow: newreply.php
Disallow: newthread.php
Disallow: online.php
Disallow: printthread.php
Disallow: private.php
Disallow: ratethread.php
Disallow: report.php
Disallow: reputation.php
Disallow: search.php
Disallow: sendthread.php
Disallow: task.php
Disallow: usercp.php
Disallow: usercp2.php
Disallow: calendar.php
Disallow: *action=emailuser*
Disallow: *action=nextnewest*
Disallow: *action=nextoldest*
Disallow: *year=*
Disallow: *action=weekview*
Disallow: *action=nextnewest*
Disallow: *action=nextoldest*
Disallow: *sort=*
Disallow: *order=*
Disallow: *mode=*
Disallow: *datecut=*
Allow: /

But when I search Google for site:http://forum.mytabletguru.com/

I found out that calendar.php, action.php and other pages are also index.
Plz help me out to solve the issue.

Oh, I'm the same problem with you, in your case: your robots.txt is
incomplete "/"
To fix your problem:
Step 1: Replace your robots.txt with this:

User-Agent: *
Disallow: /captcha.php
Disallow: /editpost.php
Disallow: /member.php
Disallow: /misc.php
Disallow: /modcp.php
Disallow: /moderation.php
Disallow: /newreply.php
Disallow: /newthread.php
Disallow: /online.php
Disallow: /printthread.php
Disallow: /private.php
Disallow: /ratethread.php
Disallow: /report.php
Disallow: /reputation.php
Disallow: /search.php
Disallow: /sendthread.php
Disallow: /task.php
Disallow: /usercp.php
Disallow: /usercp2.php
Disallow: /calendar.php
Disallow: /*action=emailuser*
Disallow: /*action=nextnewest*
Disallow: /*action=nextoldest*
Disallow: /*year=*
Disallow: /*action=weekview*
Disallow: /*action=nextnewest*
Disallow: /*action=nextoldest*
Disallow: /*sort=*
Disallow: /*order=*
Disallow: /*mode=*
Disallow: /*datecut=*
Allow: /
Step 2: Access your Google Webmaster tools everyday select the option Crawl Errors and choose Mark as fixed until your crawl errors = 0.

Good luck!
I have tried the above mention steps. But still all pages are index.
Sad

Google site://http://forum.mytabletguru.com/
^ robots.txt file should be located at the root folder of your domain. see replies here

if your MyBB forum is located in a sub-folder then you have to add files path in the robots.txt file
eg. Disallow: /forums/captcha.php
Forum is not in sub-folder but in totally separate folder.
Main site is in /Mytableguru folder
Forum is in /Forummytabletguru.
^ robots.txt file should be in public_html or www or htdocs folder. files path should be clearly mentioned in robots.txt
eg. Disallow: /forummytabletguru/captcha.php