FIELD: computer engineering.
SUBSTANCE: method of detecting confidential data in structured documents comprises steps of: a) obtaining a structured document containing data in tabular form; b) processing data using a machine learning model based on a neural network, trained to identify confidential data, during which: determining confidential data in each cell of the table of the structured document; determining a tag corresponding to the type of confidential information for each cell of the column of the structured document; tagging columns based on tags for cells of that column; comparing, in each column, the content of tags with a threshold value for each such tag corresponding to the type of confidential data; determining a column tag based on the comparison results; c) assigning a degree of criticality to the structured document; d) sending the classified document to the security system.
EFFECT: high accuracy of detecting confidential data in structured documents.
2 cl, 3 dwg, 1 tbl
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION | 2019 |
|
RU2759786C1 |
METHOD AND SYSTEM FOR IDENTIFYING DATA SUBJECT TO DEANONYMISATION IN IMPERSONAL DATA SET | 2024 |
|
RU2837785C1 |
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT | 2019 |
|
RU2755606C2 |
METHOD AND SYSTEM FOR RECOGNIZING INFORMATION CONSTITUTING TRADE SECRET | 2024 |
|
RU2841161C1 |
INFORMATION LEAKAGE PREVENTION SYSTEM AND METHOD OF INFORMATION LEAKAGE PREVENTION | 2024 |
|
RU2830388C1 |
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA | 2022 |
|
RU2804747C1 |
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA | 2022 |
|
RU2802549C1 |
METHOD OF DETERMINING PROFILE OF MOBILE DEVICE USER ON MOBILE DEVICE ITSELF AND DEMOGRAPHIC PROFILING SYSTEM | 2016 |
|
RU2647661C1 |
SYSTEM AND METHOD FOR DETECTING PHISHING WEB PAGES | 2024 |
|
RU2836604C1 |
METHOD OF CLASSIFYING ELECTRONIC TEXT INFORMATION FOR AVAILABILITY OF CONFIDENTIAL DATA | 2024 |
|
RU2834318C1 |
Authors
Dates
2025-04-17—Published
2023-06-15—Filed