TRAINING CLASSIFIERS USED TO EXTRACT INFORMATION FROM NATURAL LANGUAGE TEXTS Russian patent published in 2019 - IPC G06K9/66 G06F17/27 G06F17/28 

Abstract RU 2691855 C1

FIELD: data processing.

SUBSTANCE: invention relates to a system and methods of extracting information from natural language texts. Method of extracting information from natural language texts includes: training information extraction classifier to extract first plurality of information objects from text in natural language, wherein determination of information extraction classifier includes one or more hyperparameters; obtaining a list of extracted information objects by performing a conflict arbitration function with respect to a plurality of conflicting information objects; changing values of hyperparameters of information extraction classifier; and optimizing the information extraction quality factor for the list of extracted information objects by iterative repetition of the information extraction classifier training operations, performing the conflict arbitration function and changing the hyperparameter values.

EFFECT: high efficiency and quality of extracting information from natural language texts.

25 cl, 16 dwg

Similar patents RU2691855C1

Title Year Author Number
CLASSIFIER TRAINING USED FOR EXTRACTING INFORMATION FROM TEXTS IN NATURAL LANGUAGE 2018
  • Matskevich Stepan Evgenevich
  • Bulgakov Ilya Aleksandrovich
RU2681356C1
SELECTION OF TEXT CLASSIFIER PARAMETER BASED ON SEMANTIC CHARACTERISTICS 2016
  • Kolotienko Sergej Sergeevich
  • Anisimovich Konstantin Vladimirovich
RU2628431C1
CLASSIFICATION OF TEXTS ON NATURAL LANGUAGE BASED ON SEMANTIC SIGNS 2016
  • Kolotienko Sergej Sergeevich
  • Anisimovich Konstantin Vladimirovich
  • Myakutin Andrej Valerevich
  • Indenbom Evgenij Mikhajlovich
RU2628436C1
USING VERIFIED BY USER DATA FOR TRAINING MODELS OF CONFIDENCE 2016
  • Matskevich Stepan Evgenevich
  • Belov Andrej Aleksandrovich
RU2646380C1
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1
DEFINITION OF CONFIDENCE DEGREES RELATED TO ATTRIBUTE VALUES OF INFORMATION OBJECTS 2016
  • Belov Andrej Aleksandrovich
  • Matskevich Stepan Evgenevich
RU2640297C2
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS 2017
  • Bulgakov Ilya Aleksandrovich
  • Indenbom Evgenij Mikhajlovich
RU2665261C1
VERIFICATION OF INFORMATION OBJECT ATTRIBUTES 2016
  • Pospelova Anna Alekseevna
  • Rakhmatulina Elmira Monirovna
RU2640718C1
EXTRACTING INFORMATION OBJECTS WITH THE HELP OF A CLASSIFIER COMBINATION 2017
  • Matskevich Stepan Evgenevich
  • Starostin Anatolij Sergeevich
  • Sukhodolov Dmitrij Andreevich
RU2679988C1
USE OF DEPTH SEMANTIC ANALYSIS OF TEXTS ON NATURAL LANGUAGE FOR CREATION OF TRAINING SAMPLES IN METHODS OF MACHINE TRAINING 2016
  • Anisimovich Konstantin Vladimirovich
  • Selegej Vladimir Pavlovich
  • Garashchuk Ruslan Vladimirovich
RU2636098C1

RU 2 691 855 C1

Authors

Matskevich Stepan Evgenevich

Bulgakov Ilya Aleksandrovich

Dates

2019-06-18Published

2018-03-23Filed