Am looking for a dictionary of swear words

P

peteZ

I have completed my project to develop an anti-spam plugin to Outlook 2003.

It uses VS2005 and VSTO and SQL Express (written in C#) and I must say I am
thrilled with the result. I have been testing on users (like me) who get >
400 spams per day (thus the fake email addy)) and have reduced it to either
1, 2 or 3 or, more frequently no spam each day - it's detected and moved to
configured spam folders. As it gets smarter, I get less spam.

I am using a few new tricks (not Bayesian either) and am getting very
accurate results apart from a few false positves (about 0.5%) which I am not
happy about.

I have developed a word / sentence analysis algorithm that gets smarter with
each email I mark as spam (or the domain as a spam domain) but need to
crystalise a list of "bad words" ie. swear or porno words to better the
algorithm.

Does anyone now of a set of these word types that are freeware ? I can
create my own but it might be a bit of a job to build, but it's quite
possible.

I'd prefer a list that could be loaded into a SQL Server database

thanks if you can help,

- peteZ
 
B

Brian Tillman

peteZ said:
I have developed a word / sentence analysis algorithm that gets
smarter with each email I mark as spam (or the domain as a spam
domain) but need to crystalise a list of "bad words" ie. swear or
porno words to better the algorithm.

Does anyone now of a set of these word types that are freeware ?

The alt.comp.freeware newsgroup may be a jumping off point for that.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top