MyBB Community Forums

Full Version: Google SEO 1.6.8 [EOL]
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Oh sorry, I guess using the same search results to try it was a bad idea. PMing you the debug output with gzip off.
hello;

I have problem with this plugin.
I used CDN and also this plugin, when 404 is enable spider/bots can not fetch my site as a specified usergroups (return error 404) and only can fetch as guest usergroup.
for example at google webmaster when I enter site map return me error 404.
when I disable 404 plugin, bots can fetch forum but in google webmaster return me new error regarding to site map:
	
Sitemap is HTML
Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead.
Hi, is it possible to convert this file (sitemap-index.xml) to .gz? How? Thanks!
The webserver should be gzipping it actually. If not, try adding this to your .htaccess: (text/xml covers the sitemap)

<IfModule mod_deflate.c>
    AddOutputFilterByType DEFLATE text/html text/plain text/xml text/x-js text/css application/javascript
</IfModule>

I'm not sure if compression in PHP (based on the MyBB setting) is currently happening with the sitemap; if not I'll see if I can add it in the next version. But in general it's better left to the webserver to figure out whether to compress stuff or not.
I have found that sitemap generates wrong last modification time for threads with many pages.

For example thread costist of 12 pages (this is real example from http://www.japanisch-netzwerk.de/sitemap...xml?page=8)

Look at the time of each page, they are all the same and it's time of last post in thread:
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt 2011-11-28T12:24Z
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=2 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=3 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=4 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=5 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=6 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=7 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=8 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=9 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=10 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=11 2011-11-28T12:24Z 0.2
http://www.japanisch-netzwerk.de/Thread-NHK-Sumo-Japanisch-im-Sauseschritt?page=12 2011-11-28T12:24Z 0.2

Those first 11 pages were not modified since last google visit, but sitemap says to google to read them again and again (sitemap update their modification date) after each time user posts new messages on page 12. (And what if thread consists of 1000 pages?) I think it leads to 2 things:

1. sitemap does not reflect reality, it produces wrong modification date for pages, and it leads to
2. extra overload from search bots
Hi there,

i updated mybb yesterday and i just noticed that Seo doesn's work any more.

If i go to the pluginsettings it says this:

1. URL is passive. Apply changes to core files to activate.
2. Redirect is enabled and URL is passive. Redirecting SEO URLs back to MyBB's default URLs.


Can someone help me with this issue?

Thank you.
@Maj: Yes, it's intentional though. It's a compromise.

It's not possible to produce the correct date. In order to do it you would have to query all posts of a thread (for each thread), filter out the invisible/moderated ones, bring them into the correct order, split em into the correct pagination, then find the largest create or edit time stamp for each page. Only then have you the real accurate value. Obtaining it however is quite simply way too expensive.

So instead of querying the entire posts table, the sitemap uses the timestamp given in the threads table, and there are only two timestamps to work with in there: the first post date (which is useless as soon as someone edits or replies), and the last post date (which is what is currently being used).

Interpolating the data is not really possible either; any heuristic you introduce that "guesses" the timestamp of the older pages is bound to introduce more noise than what we currently have.

The current behaviour is not all that bad. After all, users can delete posts and cause all other posts to move up, and moderators can merge and split threads which does the same, etc. So there are several cases where there are lots of changes happening even on earlier pages.

Also, threads with thousands of pages are the exception, not the rule. The vast majority of threads (of content) does not get replies anymore after a while, and for those threads the timestamps do not ever change. So the sitemap still serves its purpose that in the big picture of things, only new / changed content is highlighted.

The longest thread in this forum is the Google SEO 1.4.1 thread and it's still below 300 pages. The runner up (Game section) is below 150 pages.

As for forum games such as me thread, last one to post wins thread, if you're worried about giving too much noise to search engines, you might want consider moving them into a subforum that isn't visible / not indexed by Google. Such threads usually have very little in the way of valuable content.

@checkdutch,
well, as it says, the core file changes have to be applied. there should be a "click here to apply changes" also, which will do just that as long as your inc/functions.php file is writable.
(2011-12-07, 11:06 AM)frostschutz Wrote: [ -> ]So instead of querying the entire posts table, the sitemap uses the timestamp given in the threads table, and there are only two timestamps to work with in there: the first post date (which is useless as soon as someone edits or replies), and the last post date (which is what is currently being used).

I think it's good idea to set last modification date of page to date of last post in that page (ignoring hidden, and etc. posts). Anyway google reloads and checks all pages by itself (ignore sitemaps) regularly. And if there some modifications on earlier pages, google will reload them within two weeks.

Quote:As for forum games such as me thread, last one to post wins thread, if you're worried about giving too much noise to search engines, you might want consider moving them into a subforum that isn't visible / not indexed by Google.

Your plugin doesn't generate links in sitemaps for hidden forums?
(2011-12-07, 02:38 PM)Maj Wrote: [ -> ]I think it's good idea to set last modification date of page to date of last post in that page (ignoring hidden, and etc. posts).

would if I could without making wasteful queries

Quote:Your plugin doesn't generate links in sitemaps for hidden forums?

It doesn't, or at least it shouldn't. There's also an option to exclude specific forums from search engines, and these also should not be listed in the sitemap.
I've installed the Google Seo 1.6.2 plugin activated it and also added
RewriteBase /
RewriteRule ^sitemap-([^./]+)\.xml$ misc.php?google_seo_sitemap=$1 [L,QSA,NC]

to my .htaccess file

When i go /sitemap-index.xml everything seems alright but when i try to go /sitemap-forums.xml?page=1
/sitemap-threads.xml?page=1
/sitemap-users.xml?page=1
/sitemap-index.xml?page=1

all displays the same error "Sitemap page invalid."