FIELD: data processing.
SUBSTANCE: invention relates to systems and methods for detecting spam emails based on analysis of statistical data on received emails from clients. Result is achieved due to stages, at which: obtaining statistical data of messages of clients and result of classification of messages on the side of clients; excluding from further analysis data of messages, which relate to at least one of: messages, which are automatic replies; messages relating to internal correspondence of the organization; generating features based on statistical data for non-excluded messages; combining features into a single vector for each message; clustering of messages is carried out using a machine learning algorithm based on vectors of signs of messages; messages in each cluster are classified according to the following rules: if the cluster contains more than a predetermined threshold of messages classified on the client side as messages containing spam, classifying all messages in the cluster as messages containing spam; if the cluster contains not more than a predetermined threshold of messages classified as spam on the client side, the messages which were not classified as spam on the client side are classified using a machine learning model.
EFFECT: high accuracy of detecting spam emails.
17 cl, 4 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD OF CLASSIFYING EMAIL MESSAGES AND SYSTEM FOR IMPLEMENTING IT | 2024 |
|
RU2828610C1 |
METHOD FOR CLUSTERING SPAM EMAILS | 2021 |
|
RU2769633C1 |
SYSTEM AND METHOD OF GENERATING HEURISTIC RULES FOR DETECTING MESSAGES CONTAINING SPAM | 2019 |
|
RU2710739C1 |
METHOD OF DETECTING FRAUDULENT LETTER RELATING TO CATEGORY OF INTERNAL BEC ATTACKS | 2021 |
|
RU2766539C1 |
METHOD FOR GENERATING THE SIGNATURE OF AN UNWANTED ELECTRONIC MESSAGE | 2021 |
|
RU2776924C1 |
SPAM DISPOSAL SYSTEM | 2021 |
|
RU2787308C1 |
METHOD FOR DETERMINATION OF PHISHING ELECTRONIC MESSAGE | 2020 |
|
RU2790330C2 |
METHOD FOR RECOGNIZING A MESSAGE AS SPAM THROUGH ANTI-SPAM QUARANTINE | 2019 |
|
RU2750643C2 |
SYSTEM AND METHOD OF RATING ELECTRONIC MESSAGES TO CONTROL SPAM | 2013 |
|
RU2541123C1 |
SYSTEM AND METHOD FOR RESTRICTING RECEPTION OF ELECTRONIC MESSAGES FROM A MASS SPAM MAIL SENDER | 2021 |
|
RU2787303C1 |
Authors
Dates
2024-10-14—Published
2024-03-27—Filed