FIELD: means of classifying documents based on confidentiality levels.
SUBSTANCE: obtaining, using a computer system, at least one electronic document which includes text in a natural language. Metadata are obtained which are associated with at least one electronic document. Text in natural language is extracted from said at least one electronic document. At least a part of text is analyzed in natural language to obtain at least one of its lexical, morphological, syntactic or semantic features. At least one information object or its attribute represented by text in natural language is extracted from natural language text. A level of confidentiality is calculated by applying a set of classification rules to the extracted information objects and metadata of at least one electronic document. Associating with at least one electronic document is a metadata element reflecting the calculated level of confidentiality.
EFFECT: technical result is higher confidentiality of documents.
20 cl, 14 dwg
Title | Year | Author | Number |
---|---|---|---|
EXTRACTION OF INFORMATION USING ALTERNATIVE VARIANTS OF SEMANTIC-SYNTACTIC ANALYSIS | 2016 |
|
RU2646386C1 |
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS | 2018 |
|
RU2686000C1 |
METHOD OF EXTRACTING FACTS FROM TEXTS ON NATURAL LANGUAGE | 2016 |
|
RU2637992C1 |
TRAINING CLASSIFIERS USED TO EXTRACT INFORMATION FROM NATURAL LANGUAGE TEXTS | 2018 |
|
RU2691855C1 |
CLASSIFIER TRAINING USED FOR EXTRACTING INFORMATION FROM TEXTS IN NATURAL LANGUAGE | 2018 |
|
RU2681356C1 |
EXTRACTION OF INFORMATION FROM SANITARY BLOCKS OF DOCUMENTS USING MICROMODELS ON BASIS OF ONTOLOGY | 2017 |
|
RU2662688C1 |
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS | 2017 |
|
RU2665261C1 |
USING VERIFIED BY USER DATA FOR TRAINING MODELS OF CONFIDENCE | 2016 |
|
RU2646380C1 |
VERIFICATION OF INFORMATION OBJECT ATTRIBUTES | 2016 |
|
RU2640718C1 |
SYSTEM AND METHOD FOR AUTOMATIC CREATION OF TEMPLATES | 2018 |
|
RU2697647C1 |
Authors
Dates
2020-09-23—Published
2019-04-29—Filed