01823nas a2200265 4500000000100000000000100001008004100002260001200043653003900055653001800094653001600112653001600128100003200144700002100176700002600197700002000223700002500243700001800268245007300286856007900359300001000438490000600448520108900454022001401543 2023 d c12/202310aConvolutional Neural Network (CNN)10aDeep Learning10aSpam Filter10aText Mining1 aIñaki Vélez de Mendizabal1 aXabier Vidriales1 aVitor Basto-Fernandes1 aEnaitz Ezpeleta1 aJosé Ramón Méndez1 aUrko Zurutuza00aDeobfuscating Leetspeak With Deep Learning to Improve Spam Filtering uhttps://www.ijimai.org/journal/sites/default/files/2023-11/ijimai8_4_4.pdf a46-550 v83 aThe evolution of anti-spam filters has forced spammers to make greater efforts to bypass filters in order to distribute content over networks. The distribution of content encoded in images or the use of Leetspeak are concrete and clear examples of techniques currently used to bypass filters. Despite the importance of dealing with these problems, the number of studies to solve them is quite small, and the reported performance is very limited. This study reviews the work done so far (very rudimentary) for Leetspeak deobfuscation and proposes a new technique based on using neural networks for decoding purposes. In addition, we distribute an image database specifically created for training Leetspeak decoding models. We have also created and made available four different corpora to analyse the performance of Leetspeak decoding schemes. Using these corpora, we have experimentally evaluated our neural network approach for decoding Leetspeak. The results obtained have shown the usefulness of the proposed model for addressing the deobfuscation of Leetspeak character sequences. a1989-1660