Bayes for blog comment moderation?

This is really a call for arms to any Perl hackers with serious amounts of hybris out there: I recently installed the latest “POPFile”:http://popfile.sourceforge.net/ from CVS[Concurrent Versioning System] in order to help in auto-scoring messages on Usenet. With great results, I might add. It already seems near perfect at anticipating what I’ll find interesting to read or not, and.
Mark Pilgrim is writing about a “troll in Sam Ruby’s comment system”:http://diveintomark.org/archives/2003/06/22/why_im_unsubscribing_from_your_blog.html, and I’m just wondering: couldn’t Naive Bayesian filtering be used to automoderate weblog comments?
*Update:* Apparently, “Ben Hammersley”:http://www.benhammersley.com/archives/004850.html and “Jim Hughes”:http://www.cix.co.uk/~jimh/weblog/2003_06_01_archive.html#95940608 are thinking along the same lines as I am.

Previous Post
Next Post

3 Comments

  1. How do you use POPfile for nntp? I thought it was POP3 only.

  2. I am using the latest development version, the one that eventually will be named 0.20.0.
    If you are unfamiliar with CVS, there are some “basic instructions here.”:http://sourceforge.net/cvs/?group_id=63137 If you are a windows user, and would like a client, you can either download and install “Cygwin”:http://sources.redhat.com/cygwin/download.html – or you can install “TortoiseCVS”:http://tortoisecvs.sourceforge.net – which integrates into the windows shell. TortoiseCVS also has instructions that makes CVS easier to understand.
    If you’re unable to download it from CVS, ask me nicely, and I might offer a zip-file for download 🙂

  3. Ah, thanks. I didn’t notice the CVS part.