Trinity College Dublin, Department of Computer Science
Delany, Sarah Jane; Cunningham, Pádraig. 'ECUE: A Spam Filter that Uses Machine Learning to Track Concept Drift'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2006-05, 2006, pp5
Computer Science Technical Report TCD-CS-2006-05
While text classification has been identified for some time as a
promising application area for Artificial Intelligence, so far few deployed
applications have been described. In this paper we present
a spam filtering system that uses example-based machine learning
techniques to train a classifier from examples of spam and legitimate
email. This approach has the advantage that it can personalise to the
specifics of the user’s filtering preferences. This classifier can also
automatically adjust over time to account for the changing nature
of spam (and indeed changes in the profile of legitimate email). A
significant software engineering challenge in developing this system
was to ensure that it could interoperate with existing email systems
to allow easy managment of the training data over time. This system
has been deployed and evaluated over an extended period and the
results of this evaluation are presented here.
Please note: There is a known bug in some browsers that causes an
error when a user tries to view large pdf file within the browser window.
If you receive the message "The file is damaged and could not be
repaired", please try one of the solutions linked below based on the
browser you are using.
Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.