Show simple item record

dc.contributor.authorDelany, Sarah Jane
dc.contributor.authorCunningham, Padraig
dc.contributor.authorTsymbal, Alexey
dc.date.accessioned2008-01-28T10:39:54Z
dc.date.available2008-01-28T10:39:54Z
dc.date.issued2005en
dc.identifier.citationDelany, Sarah Jane; Cunningham, Padraig; Tsymbal, Alexey. 'A Comparison of Ensemble and Case-Base Maintenance Techniques for Handling Concept Drift in Spam Filtering'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2005-19, 2005, pp6en
dc.identifier.otherTCD-CS-2005-19
dc.identifier.urihttp://hdl.handle.net/2262/13439
dc.description.abstractThe problem of concept drift has recently received considerable attention in machine learning research. One important practical problem where concept drift needs to be addressed is spam filtering. The literature on concept drift shows that among the most promising approaches are ensembles and a variety of techniques for ensemble construction has been proposed. In this paper we consider an alternative lazy learning approach to concept drift whereby a single case-based classifier for spam filtering keeps itself up-to-date through a case-base maintenance protocol. We present an evaluation that shows that the case-base maintenance approach is more effective than a variety of ensemble techniques. The evaluation is complicated by the overriding importance of False Positives (FPs) in spam filtering. The ensemble approaches can have very good performance on FPs because it is possible to bias an ensemble more strongly away from FPs than it is to bias the single classifer. However this comes at considerable cost in overall accuracy.en
dc.description.sponsorshipThis research was supported by funding from Enterprise Ireland under grant no. CFTD/03/219 and funding from Science Foundation Ireland under grant no. SFI-02IN.1I111en
dc.format.extent93675 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherTrinity College Dublin, Department of Computer Scienceen
dc.relation.ispartofseriesComputer Science Technical Reporten
dc.relation.ispartofseriesTCD-CS-2005-19en
dc.relation.haspartTCD-CS-[no.]en
dc.subjectComputer Scienceen
dc.titleA Comparison of Ensemble and Case-Base Maintenance Techniques for Handling Concept Drift in Spam Filteringen
dc.typeTechnical Reporten
dc.contributor.sponsorEnterprise Ireland
dc.contributor.sponsorScience Foundation Irelanden
dc.identifier.rssurihttps://www.cs.tcd.ie/publications/tech-reports/reports.05/TCD-CS-2005-19.pdf


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record