FIELD: computer equipment.
SUBSTANCE: method includes: extraction of metadata and informative part of document, conversion of document from storage format into text, conversion of words into word forms, discarding non-significant words, counting word weights, generating a set of classification features, wherein at the training step, a system of predicates for identifying the confidentiality mark of the document is generated based on the set of classified documents; at the document classification step, based on the characteristics, a decision is made on the relevance of the document of each of the confidentiality marks, at the training stage, based on the set of manually classified authorized users, forming a predicate identification system of their confidentiality mark, wherein on the basis of confidentiality marks of incoming documents and access rights of authorized users of system to these documents form a set of classification features.
EFFECT: automatic classification of formalized text documents and authorized users of electronic document management system according to confidentiality marks.
1 cl, 1 dwg, 1 tbl
Title | Year | Author | Number |
---|---|---|---|
METHOD OF AUTOMATIC CLASSIFICATION OF CONFIDENTIAL FORMALIZED DOCUMENTS IN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM | 2015 |
|
RU2647640C2 |
METHOD FOR AUTOMATIC CLASSIFICATION OF ELECTRONIC DOCUMENTS IN AN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM WITH AUTOMATIC GENERATION OF ELECTRONIC CASES | 2019 |
|
RU2726931C1 |
METHOD FOR AUTOMATIC CLASSIFICATION OF ELECTRONIC DOCUMENTS IN AN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM WITH AUTOMATIC GENERATION OF RESOLUTION PROPS OF A MANAGER | 2018 |
|
RU2692972C1 |
METHOD FOR AUTOMATIC CLASSIFICATION OF FORMALIZED ELECTRONIC GRAPHIC AND TEXT DOCUMENTS IN THE ELECTRONIC DOCUMENT CIRCULATION SYSTEM WITH AUTOMATIC FORMATION OF ELECTRONIC CASES | 2020 |
|
RU2759887C1 |
METHOD OF AUTOMATED CLASSIFICATION OF FORMALISED DOCUMENTS IN ELECTRONIC DOCUMENT CIRCULATION SYSTEM | 2013 |
|
RU2546555C1 |
METHOD FOR AUTOMATED CLASSIFICATION OF DOCUMENTS | 2003 |
|
RU2254610C2 |
METHOD OF CLASSIFYING DOCUMENTS BY CATEGORIES | 2012 |
|
RU2491622C1 |
METHOD OF CLASSIFYING ELECTRONIC TEXT INFORMATION FOR AVAILABILITY OF CONFIDENTIAL DATA | 2024 |
|
RU2834318C1 |
CLASSIFICATION OF DOCUMENTS BY LEVELS OF CONFIDENTIALITY | 2019 |
|
RU2732850C1 |
METHOD FOR STREAM PROCESSING OF TEXT MESSAGES | 2003 |
|
RU2251148C1 |
Authors
Dates
2019-06-19—Published
2017-12-18—Filed