FIELD: computer equipment.
SUBSTANCE: method includes: extraction of metadata and informative part of document, conversion of document from storage format into text, conversion of words into word forms, discarding non-significant words, counting word weights, generating a set of classification features, wherein at the training step, a system of predicates for identifying the confidentiality mark of the document is generated based on the set of classified documents; at the document classification step, based on the characteristics, a decision is made on the relevance of the document of each of the confidentiality marks, at the training stage, based on the set of manually classified authorized users, forming a predicate identification system of their confidentiality mark, wherein on the basis of confidentiality marks of incoming documents and access rights of authorized users of system to these documents form a set of classification features.
EFFECT: automatic classification of formalized text documents and authorized users of electronic document management system according to confidentiality marks.
1 cl, 1 dwg, 1 tbl
| Title | Year | Author | Number | 
|---|---|---|---|
| METHOD OF AUTOMATIC CLASSIFICATION OF CONFIDENTIAL FORMALIZED DOCUMENTS IN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM | 2015 | 
 | RU2647640C2 | 
| METHOD FOR AUTOMATIC CLASSIFICATION OF ELECTRONIC DOCUMENTS IN AN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM WITH AUTOMATIC GENERATION OF ELECTRONIC CASES | 2019 | 
 | RU2726931C1 | 
| METHOD FOR AUTOMATIC CLASSIFICATION OF ELECTRONIC DOCUMENTS IN AN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM WITH AUTOMATIC GENERATION OF RESOLUTION PROPS OF A MANAGER | 2018 | 
 | RU2692972C1 | 
| METHOD FOR AUTOMATIC CLASSIFICATION OF FORMALIZED ELECTRONIC GRAPHIC AND TEXT DOCUMENTS IN THE ELECTRONIC DOCUMENT CIRCULATION SYSTEM WITH AUTOMATIC FORMATION OF ELECTRONIC CASES | 2020 | 
 | RU2759887C1 | 
| METHOD OF AUTOMATED CLASSIFICATION OF FORMALISED DOCUMENTS IN ELECTRONIC DOCUMENT CIRCULATION SYSTEM | 2013 | 
 | RU2546555C1 | 
| METHOD FOR AUTOMATED CLASSIFICATION OF DOCUMENTS | 2003 | 
 | RU2254610C2 | 
| METHOD OF CLASSIFYING DOCUMENTS BY CATEGORIES | 2012 | 
 | RU2491622C1 | 
| METHOD OF CLASSIFYING ELECTRONIC TEXT INFORMATION FOR AVAILABILITY OF CONFIDENTIAL DATA | 2024 | 
 | RU2834318C1 | 
| CLASSIFICATION OF DOCUMENTS BY LEVELS OF CONFIDENTIALITY | 2019 | 
 | RU2732850C1 | 
| METHOD FOR STREAM PROCESSING OF TEXT MESSAGES | 2003 | 
 | RU2251148C1 | 
Authors
Dates
2019-06-19—Published
2017-12-18—Filed