Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search

[REBOL] Re: fighting spam paper & links / naive bayes / anybody ?

From: brett:codeconscious at: 26-Aug-2002 19:32

> << I ran it on a small set of spam and good emails - and it worked > beautifully until I realised that my logic was different to Paul's. :^) > Then I fixed it and it didn't work so good :^( >> > > I hope the broken one, that worked better, is still available for > comparison. :)
The broken program calculated the "interesting" words to be the words with the *highest* 15 probabilities of being spam - instead of calculating the highest variance from 0.5. So with small sample sizes it looked like it was working really well, but I doubt that it would work in all cases. Probably worth testing against a good set of emails. Regards, Brett.