You are browsing the archive for WordPress.

by admin

WordPress plugins: “More like this” from across the blogosphere

2:00 pm in Aggregation, Last.fm, Lucene, Political Blogging, Technical, WordPress by admin

Here’s a first look at Poblish‘s first WordPress plugin.

It looks at the content of the current blog post, and automatically identifies related content from across all the content hosted at Poblish – currently 216,296 articles from 1,698 working feeds – returning you a list of the most closely matching articles in under a second.

You can click the name of any blogger or blog to see their live feed (pictured) in a Facebook-style popup frame.

In fact, forget about the screenshot, because you can see the plugin installed on this very blog – just look at the foot of this post, and scroll forward and back through our other posts.

The plugin is stable, but needs to be packaged-up a little so it fits seamlessly into the WordPress world. If you’re impatient to try it out, though, drop me a line and I’ll let you know the two or three steps you need to follow.

Let me know if you have any ideas of your own for developing the plugin. Some of mine are:

  • Ignoring matches from your own blog.
  • Restricting matches by date.
  • Restricting to matches with the same set of tags as the current post – somewhat influenced by Last.fm Radio.

by admin

Taking ‘Possibly Related Posts’ to the next level

9:30 am in Natural Language Processing, Policy-making, Political Blogging, Semantic Web, Stemming, Stopwords, WordPress by admin

Many WordPress bloggers use plugins like these to help people who read their posts find other, related posts.

That’s all well and good if you only want to help readers find your own articles, but perhaps other political bloggers have made your own point better than you have? Let’s now turn that round: perhaps you’ve made another blogger’s point better than he has? Wouldn’t it be great if there was a Related Posts service that let people follow links to similar content from one blog to another, irrespective of who wrote it and where you started reading?

Poblish offers just this. Simply open a blog post from one of the 1500 feeds we monitor, and you’ll see a list of similar posts, ranked by similarity, from across all of those feeds.

If that wasn’t cool enough, the list of related posts updates automatically. So, if you create a new, collaborative article with us, you can watch the list update as your work progresses – literally as you type. That’s very useful – perhaps you make a particular point, then some articles appear that strongly refute that point. You might then reconsider, delete your last paragraph, and move on in a different vein.

I believe that tools like this are an essential part of making the political blogosphere a knowledge base, that can not only improve political blogging, but also improve policy-making.

(I should add, as an aside, that all these services essentially use Inverse Document Frequency algorithms. Here’s a worked example. They can work very well – Poblish’s especially, I hope, as our algorithm uses stemming and stopwords – but there’s no attempt by the computer to understand the text, or the context, so there will inevitably be howlers. These are not semantic solutions of the type I mentioned yesterday, but don’t worry: we have big plans in that area.)