CLASSIFIER TRAINING USED FOR EXTRACTING INFORMATION FROM TEXTS IN NATURAL LANGUAGE Russian patent published in 2019 - IPC G06F17/27 

Abstract RU 2681356 C1

FIELD: examination of documents.

SUBSTANCE: invention relates to the extraction of facts from texts in natural languages. First set of information objects is extracted from the natural language text. Second set of information objects is extracted from the natural language text. Intermediate list of information objects is generated, including at least a subset of the first set of information objects and at least a subset of the second set of information objects. Set of conflicting information objects in the intermediate list of information objects is identified, where the first information object from the set of conflicting information objects belongs to the first set of information objects, and the second information object from the set of conflicting information objects belongs to the second set of information objects. Final list of information objects extracted from natural language text is generated, by means of applying the function of arbitration of the conflicting objects to the set of conflicting information objects, which performs at least one of the following actions: changing the first information object, deleting the first information object or merging two or more information objects from the set of conflicting information objects.

EFFECT: technical result consists in increasing the efficiency and quality of information extraction.

20 cl, 16 dwg

Similar patents RU2681356C1

Title Year Author Number
TRAINING CLASSIFIERS USED TO EXTRACT INFORMATION FROM NATURAL LANGUAGE TEXTS 2018
  • Matskevich Stepan Evgenevich
  • Bulgakov Ilya Aleksandrovich
RU2691855C1
SELECTION OF TEXT CLASSIFIER PARAMETER BASED ON SEMANTIC CHARACTERISTICS 2016
  • Kolotienko Sergej Sergeevich
  • Anisimovich Konstantin Vladimirovich
RU2628431C1
CLASSIFICATION OF TEXTS ON NATURAL LANGUAGE BASED ON SEMANTIC SIGNS 2016
  • Kolotienko Sergej Sergeevich
  • Anisimovich Konstantin Vladimirovich
  • Myakutin Andrej Valerevich
  • Indenbom Evgenij Mikhajlovich
RU2628436C1
USING VERIFIED BY USER DATA FOR TRAINING MODELS OF CONFIDENCE 2016
  • Matskevich Stepan Evgenevich
  • Belov Andrej Aleksandrovich
RU2646380C1
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1
DEFINITION OF CONFIDENCE DEGREES RELATED TO ATTRIBUTE VALUES OF INFORMATION OBJECTS 2016
  • Belov Andrej Aleksandrovich
  • Matskevich Stepan Evgenevich
RU2640297C2
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS 2017
  • Bulgakov Ilya Aleksandrovich
  • Indenbom Evgenij Mikhajlovich
RU2665261C1
VERIFICATION OF INFORMATION OBJECT ATTRIBUTES 2016
  • Pospelova Anna Alekseevna
  • Rakhmatulina Elmira Monirovna
RU2640718C1
EXTRACTING INFORMATION OBJECTS WITH THE HELP OF A CLASSIFIER COMBINATION 2017
  • Matskevich Stepan Evgenevich
  • Starostin Anatolij Sergeevich
  • Sukhodolov Dmitrij Andreevich
RU2679988C1
USE OF DEPTH SEMANTIC ANALYSIS OF TEXTS ON NATURAL LANGUAGE FOR CREATION OF TRAINING SAMPLES IN METHODS OF MACHINE TRAINING 2016
  • Anisimovich Konstantin Vladimirovich
  • Selegej Vladimir Pavlovich
  • Garashchuk Ruslan Vladimirovich
RU2636098C1

RU 2 681 356 C1

Authors

Matskevich Stepan Evgenevich

Bulgakov Ilya Aleksandrovich

Dates

2019-03-06Published

2018-03-23Filed