2011-11-28, 12:30 AM
Just finished the first prototype of a plugin I'm working on called Post Relevance.
It attempts to compute how relevant a post is to the thread title using tf-idf weightings in a ranked retrieval system. The output of which is a little message on each post giving a score for how relevant the post is. The higher the score, the more relevant.
The plugin pulls in the thread title and splits it up into keywords, removing words like 'the' etc. Then it fetches synonyms for the keywords using http://words.bighugelabs.com/api.php , it removes all words like 'the' etc from the synonyms list.
Once it has a complete set of keywords from the thread title and the synonyms, it computes the tf-idf (wiki it) weightings using the term-frequency and the inverse-document-frequency. This ranks each post in a thread according to how relevant the respective post is to the thread title.
I will put the protoype online this week once I've cleaned it up a bit and added some ACP configuration settings.
For now, how useful do you think this would be as a plugin? Worth me releasing and maintaining it?
It attempts to compute how relevant a post is to the thread title using tf-idf weightings in a ranked retrieval system. The output of which is a little message on each post giving a score for how relevant the post is. The higher the score, the more relevant.
The plugin pulls in the thread title and splits it up into keywords, removing words like 'the' etc. Then it fetches synonyms for the keywords using http://words.bighugelabs.com/api.php , it removes all words like 'the' etc from the synonyms list.
Once it has a complete set of keywords from the thread title and the synonyms, it computes the tf-idf (wiki it) weightings using the term-frequency and the inverse-document-frequency. This ranks each post in a thread according to how relevant the respective post is to the thread title.
I will put the protoype online this week once I've cleaned it up a bit and added some ACP configuration settings.
For now, how useful do you think this would be as a plugin? Worth me releasing and maintaining it?