FIELD: examination of documents.
SUBSTANCE: invention relates to the extraction of facts from texts in natural languages. First set of information objects is extracted from the natural language text. Second set of information objects is extracted from the natural language text. Intermediate list of information objects is generated, including at least a subset of the first set of information objects and at least a subset of the second set of information objects. Set of conflicting information objects in the intermediate list of information objects is identified, where the first information object from the set of conflicting information objects belongs to the first set of information objects, and the second information object from the set of conflicting information objects belongs to the second set of information objects. Final list of information objects extracted from natural language text is generated, by means of applying the function of arbitration of the conflicting objects to the set of conflicting information objects, which performs at least one of the following actions: changing the first information object, deleting the first information object or merging two or more information objects from the set of conflicting information objects.
EFFECT: technical result consists in increasing the efficiency and quality of information extraction.
20 cl, 16 dwg
Title | Year | Author | Number |
---|---|---|---|
TRAINING CLASSIFIERS USED TO EXTRACT INFORMATION FROM NATURAL LANGUAGE TEXTS | 2018 |
|
RU2691855C1 |
SELECTION OF TEXT CLASSIFIER PARAMETER BASED ON SEMANTIC CHARACTERISTICS | 2016 |
|
RU2628431C1 |
CLASSIFICATION OF TEXTS ON NATURAL LANGUAGE BASED ON SEMANTIC SIGNS | 2016 |
|
RU2628436C1 |
USING VERIFIED BY USER DATA FOR TRAINING MODELS OF CONFIDENCE | 2016 |
|
RU2646380C1 |
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS | 2018 |
|
RU2686000C1 |
DEFINITION OF CONFIDENCE DEGREES RELATED TO ATTRIBUTE VALUES OF INFORMATION OBJECTS | 2016 |
|
RU2640297C2 |
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS | 2017 |
|
RU2665261C1 |
VERIFICATION OF INFORMATION OBJECT ATTRIBUTES | 2016 |
|
RU2640718C1 |
EXTRACTING INFORMATION OBJECTS WITH THE HELP OF A CLASSIFIER COMBINATION | 2017 |
|
RU2679988C1 |
USE OF DEPTH SEMANTIC ANALYSIS OF TEXTS ON NATURAL LANGUAGE FOR CREATION OF TRAINING SAMPLES IN METHODS OF MACHINE TRAINING | 2016 |
|
RU2636098C1 |
Authors
Dates
2019-03-06—Published
2018-03-23—Filed