METHOD FOR ATTRIBUTION OF PARTIALLY STRUCTURED TEXTS FOR FORMATION OF NORMATIVE-REFERENCE INFORMATION Russian patent published in 2021 - IPC G06F40/20 G06F40/40 G06F16/31 

Abstract RU 2750852 C1

FIELD: computer technology.

SUBSTANCE: invention relates to computer technology. The method for attributing partially structured texts for generating normative-reference information includes selecting a training set of texts in the natural language of partially structured texts, extracting the appropriate set of features for each category of named entities, training a classification model using the training set of texts and sets of features for each category of named entities, performing training using attributes, obtaining a model for each named entity and checking attributes, extracting tokens from unmarked text by the processor, generating a marked-up representation by the processor of at least a part of the text based on at least one of the tokens classified by categories.

EFFECT: increased speed of data attribution processes.

1 cl, 2 dwg, 1 tbl

Similar patents RU2750852C1

Title Year Author Number
NAMED ENTITIES FROM THE TEXT AUTOMATIC EXTRACTION 2014
  • Nekhaj Ilya Vladimirovich
RU2665239C2
USE OF DEPTH SEMANTIC ANALYSIS OF TEXTS ON NATURAL LANGUAGE FOR CREATION OF TRAINING SAMPLES IN METHODS OF MACHINE TRAINING 2016
  • Anisimovich Konstantin Vladimirovich
  • Selegej Vladimir Pavlovich
  • Garashchuk Ruslan Vladimirovich
RU2636098C1
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS 2017
  • Bulgakov Ilya Aleksandrovich
  • Indenbom Evgenij Mikhajlovich
RU2665261C1
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1
MULTI STAGE RECOGNITION OF THE REPRESENT ESSENTIALS IN TEXTS ON THE NATURAL LANGUAGE ON THE BASIS OF MORPHOLOGICAL AND SEMANTIC SIGNS 2016
  • Anisimovich Konstantin Vladimirovich
  • Indenbom Evgeny Mihaylovich
  • Novitskiy Valery Igorevich
RU2619193C1
METHOD FOR AUTOMATED EXTRACTION OF SEMANTIC COMPONENTS FROM COMPOUND SENTENCES OF NATURAL LANGUAGE TEXTS IN MACHINE TRANSLATION SYSTEMS AND DEVICE FOR IMPLEMENTATION THEREOF 2021
  • Karpov Anton Gennadevich
  • Khachukaev Eduard Magomedovich
  • Khachukaeva Elina Eduardovna
RU2766821C1
METHOD OF EXTRACTING FACTS FROM TEXTS ON NATURAL LANGUAGE 2016
  • Starostin Anatolij Sergeevich
  • Smurov Ivan Mikhajlovich
  • Dzhumaev Stanislav Sergeevich
RU2637992C1
AUTOMATED LEGAL ADVICE SYSTEM CONTROL METHOD 2019
  • Prikhodko Olga Viktorovna
  • Khyurri Ruslan Vladimirovich
  • Prikhodko Olga Viktorovna
RU2718978C1
USING VERIFIED BY USER DATA FOR TRAINING MODELS OF CONFIDENCE 2016
  • Matskevich Stepan Evgenevich
  • Belov Andrej Aleksandrovich
RU2646380C1
CLASSIFICATION OF DOCUMENTS BY LEVELS OF CONFIDENTIALITY 2019
  • Zyuzin Andrej Andreevich
  • Uskova Olesya Vladimirovna
RU2732850C1

RU 2 750 852 C1

Authors

Fedosin Sergei Alekseevich

Plotnikova Natalia Pavlovna

Martynov Vladislav Aleksandrovich

Ryskin Konstantin Eduardovich

Kuznetsov Dmitrii Aleksandrovich

Deniskin Aleksandr Vladimirovich

Vechkanova Iuliia Sergeevna

Fediushkin Nikolai Alekseevich

Tsilikov Nikita Sergeevich

Dates

2021-07-05Published

2020-10-19Filed