dc.contributor.author | Delany, Sarah Jane | |
dc.contributor.author | Cunningham, Padraig | |
dc.contributor.author | Tsymbal, Alexey | |
dc.date.accessioned | 2008-01-28T10:39:54Z | |
dc.date.available | 2008-01-28T10:39:54Z | |
dc.date.issued | 2005 | en |
dc.identifier.citation | Delany, Sarah Jane; Cunningham, Padraig; Tsymbal, Alexey. 'A Comparison of Ensemble and Case-Base Maintenance Techniques for Handling Concept Drift in Spam Filtering'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2005-19, 2005, pp6 | en |
dc.identifier.other | TCD-CS-2005-19 | |
dc.identifier.uri | http://hdl.handle.net/2262/13439 | |
dc.description.abstract | The problem of concept drift has recently received
considerable attention in machine learning
research. One important practical problem where
concept drift needs to be addressed is spam filtering.
The literature on concept drift shows that
among the most promising approaches are ensembles
and a variety of techniques for ensemble construction
has been proposed. In this paper we consider
an alternative lazy learning approach to concept
drift whereby a single case-based classifier
for spam filtering keeps itself up-to-date through
a case-base maintenance protocol. We present an
evaluation that shows that the case-base maintenance
approach is more effective than a variety of
ensemble techniques. The evaluation is complicated
by the overriding importance of False Positives
(FPs) in spam filtering. The ensemble approaches
can have very good performance on FPs
because it is possible to bias an ensemble more
strongly away from FPs than it is to bias the single
classifer. However this comes at considerable
cost in overall accuracy. | en |
dc.description.sponsorship | This research was supported by funding from Enterprise Ireland
under grant no. CFTD/03/219 and funding from Science Foundation
Ireland under grant no. SFI-02IN.1I111 | en |
dc.format.extent | 93675 bytes | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | en |
dc.publisher | Trinity College Dublin, Department of Computer Science | en |
dc.relation.ispartofseries | Computer Science Technical Report | en |
dc.relation.ispartofseries | TCD-CS-2005-19 | en |
dc.relation.haspart | TCD-CS-[no.] | en |
dc.subject | Computer Science | en |
dc.title | A Comparison of Ensemble and Case-Base Maintenance Techniques for Handling Concept Drift in Spam Filtering | en |
dc.type | Technical Report | en |
dc.contributor.sponsor | Enterprise Ireland | |
dc.contributor.sponsor | Science Foundation Ireland | en |
dc.identifier.rssuri | https://www.cs.tcd.ie/publications/tech-reports/reports.05/TCD-CS-2005-19.pdf | |