Home Geschichten Kunst Computer Tindertraum

[current]

Mozilla Junk filter questions
(Tuesday 6th May 2003)

I can't really find answers to:

background on this: I now have an example corpus of about 1300+ Spam/Junk messages, and I noticed a degrade in detection accuracy. Actually, I had a large corpus of Junk mail I trained Moz on, and found it was overzealous. Having now marked a lot of messages as 'not junk' I see the exact oposite, it doesn't detect some obvious Junk at all...

So is it better to just have a rather small (200+) corpus of example Junk or what?

And then, is there a difference between messages not marked at all and messages marked not junk???

[ by Martin>] [permalink] [similar entries]

similar entries (vs):

similar entries (cg):

relevant words



Martin Spernau
© 1994-2003

traumwind icon Big things to come (TM) 30th Dez 2002

Look closely at the most embarrassing details and amplify them
Oblique Strategies, Ed.3 Brian Eno and Peter Schmidt



amazon.de Wunschliste





 

usefull links:
Google Graph browser
Traumwind 6-Colormatch
UAV News

powered by SBELT