METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION IN THE TEXT Russian patent published in 2021 - IPC G06F21/60 G06F40/279 G06N20/00 

Abstract RU 2755606 C2

FIELD: computer technology.

SUBSTANCE: invention relates to the field of computer technology for identifying confidential information. A computer-implemented method contains stages in which: data presented in a text format is obtained; obtained data is processed using machine learning algorithms, during which each word in a text is assigned with a tag corresponding to a given type of confidential information, wherein a classification matrix is formed for each machine learning algorithm, based on which an F-measure is calculated for each data type; the classification of each word in the text is performed based on texts with tags received from each machine learning algorithm and the F-measure matrix corresponding to machine learning algorithms, and the final version of the text with tags is formed; the classification of the text with tags attached to each word by privacy classes is performed based on the comparison of the combination of available tags in the text with specified tags of confidential information.

EFFECT: increased accuracy of classification of confidential information.

4 cl, 8 dwg

Similar patents RU2755606C2

Title Year Author Number
METHOD AND SYSTEM FOR RETRIEVING NAMED ENTITIES 2020
  • Emelyanov Anton Aleksandrovich
RU2760637C1
METHOD AND SYSTEM FOR CLASSIFYING DATA FOR IDENTIFYING CONFIDENTIAL INFORMATION 2019
  • Terenin Aleksej Alekseevich
  • Smirnov Dmitrij Vladimirovich
  • Strukov Dmitrij Konstantinovich
  • Koryakovskij Denis Aleksandrovich
RU2759786C1
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA 2022
  • Babak Nikita Grigorevich
  • Belorybkin Leonid Yurevich
  • Terenin Aleksej Alekseevich
  • Shabrova Anastasiya Igorevna
RU2804747C1
TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS 2018
  • Eugene Indenbom
  • Daniil Anastasiev
RU2721190C1
METHOD AND SYSTEM FOR DEPERSONALIZATION OF CONFIDENTIAL DATA 2022
  • Babak Nikita Grigorevich
  • Belorybkin Leonid Yurevich
  • Terenin Aleksej Alekseevich
  • Shabrova Anastasiya Igorevna
RU2802549C1
METHOD OF TRAINED RECURRENT NEURAL NETWORK DEBUGGING 2019
  • Zharov Yaroslav Maksimovich
  • Korzhenkov Denis Mikhajlovich
RU2715024C1
METHOD OF CREATING MODEL FOR ANALYSING DIALOGUES BASED ON ARTIFICIAL INTELLIGENCE FOR PROCESSING USER REQUESTS AND SYSTEM USING SUCH MODEL 2019
  • Antyukhov Denis Olegovich
  • Pugachev Leonid Petrovich
RU2730449C2
METHOD FOR CONTROLLING A DIALOGUE AND NATURAL LANGUAGE RECOGNITION SYSTEM IN A PLATFORM OF VIRTUAL ASSISTANTS 2020
  • Ashmanov Stanislav Igorevich
  • Sukhachev Pavel Sergeevich
  • Zorkij Fedor Kirillovich
RU2759090C1
RECOGNITION OF EVENTS ON PHOTOGRAPHS WITH AUTOMATIC SELECTION OF ALBUMS 2020
  • Savchenko Andrey Vladimirovich
RU2742602C1
SYSTEM AND METHOD FOR AUGMENTATION OF THE TRAINING SAMPLE FOR MACHINE LEARNING ALGORITHMS 2020
  • Shavrina Tatyana Olegovna
RU2758683C2

RU 2 755 606 C2

Authors

Terenin Aleksej Alekseevich

Kotova Margarita Aleksandrovna

Dates

2021-09-17Published

2019-10-16Filed