Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/10829
Título: | Spam email filtering using network-level properties |
Autor(es): | Cortez, Paulo Correia, André Sousa, Pedro Rocha, Miguel Rio, Miguel |
Palavras-chave: | Anti-Spam filtering Text Mining Naive Bayes Support Vector Machines |
Data: | 2010 |
Editora: | Springer |
Revista: | Lecture Notes in Computer Science |
Citação: | CORTEZ, Paulo [et al.] - Spam email filtering using network-level properties. In PERNER, Petra, ed. lit. – “Advances in Data Mining : applications and theoretical aspects : proceedings of the Industrial Conference on Data Mining (ICDM 2010), 10, Berlin, Germany, 2010” [Em linha]. Berlin : Springer, 2010. (Lecture Notes in Artificial Intelligence ; 6171) [Consult. 25 Ag. 2010]. p. 476-489. Disponível em: http://www.springerlink.com/content/e7u36014r04h0334. ISBN 978-3-642-14399-1. |
Resumo(s): | Spam is serious problem that affects email users (e.g. phishing attacks, viruses and time spent reading unwanted messages). We propose a novel spam email filtering approach based on network-level attributes (e.g. the IP sender geographic coordinates) that are more persistent in time when compared to message content. This approach was tested using two classifiers, Naive Bayes (NB) and Support Vector Machines (SVM), and compared against bag-of-words models and eight blacklists. Several experiments were held with recent collected legitimate (ham) and non legitimate (spam) messages, in order to simulate distinct user profiles from two countries (USA and Portugal). Overall, the network-level based SVM model achieved the best discriminatory performance. Moreover, preliminary results suggests that such method is more robust to phishing attacks. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/10829 |
ISBN: | 978-3-642-14399-1 |
DOI: | 10.1007/978-3-642-14400-4_37 |
ISSN: | 0302-9743 |
Versão da editora: | © Springer. The original publication is available at: http://www.springerlink.com/content/e7u36014r04h0334 |
Arbitragem científica: | yes |
Acesso: | Acesso aberto |
Aparece nas coleções: | DI/CCTC - Artigos (papers) DSI - Engenharia da Programação e dos Sistemas Informáticos |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
2010-telescope.pdf | 1,44 MB | Adobe PDF | Ver/Abrir |