IDENTIFYING COLLOCATIONS IN THE TEXTS IN NATURAL LANGUAGE Russian patent published in 2017 - IPC G06F9/00 

Abstract RU 2618374 C1

FIELD: physics.

SUBSTANCE: method is done through the stages way of identifying collocations in the texts in natural language. At the same time, a computing device is used to perform semantic-syntactic analysis of text in a natural language for obtaining a multitude of semantic structures. Next, form the initial list of combinations of words in the light of relations defined semantic structures. Make up a list of phrases by applying a heuristic filter to the original list of combinations of words, where the use of a heuristic filter based on quality metrics, representing a function of semantic classes and frequency of relationships between words in composition of phrases. Use a list of phrases to perform a natural language processing.

EFFECT: improving the efficiency of solving tasks of texts in natural language processing.

25 cl, 15 dwg

Similar patents RU2618374C1

Title Year Author Number
TRAINING CLASSIFIERS USED TO EXTRACT INFORMATION FROM NATURAL LANGUAGE TEXTS 2018
  • Matskevich Stepan Evgenevich
  • Bulgakov Ilya Aleksandrovich
RU2691855C1
METHOD OF CLUSTERING OF SEARCH RESULTS DEPENDING ON SEMANTICS 2014
  • Andreev Sergey Gennadievich
RU2564629C1
CLASSIFIER TRAINING USED FOR EXTRACTING INFORMATION FROM TEXTS IN NATURAL LANGUAGE 2018
  • Matskevich Stepan Evgenevich
  • Bulgakov Ilya Aleksandrovich
RU2681356C1
SYSTEM FOR CREATING DOCUMENTS BASED ON TEXT ANALYSIS ON NATURAL LANGUAGE 2016
  • Danielyan Tatyana Vladimirovna
RU2639655C1
EXTRACTION OF ENTITIES FROM TEXTS IN NATURAL LANGUAGE 2015
  • Starostin Anatolij Sergeevich
  • Danielyan Tatyana Vladimirovna
  • Smurov Ivan Mikhajlovich
RU2626555C2
SENTIMENT ANALYSIS AT THE LEVEL OF ASPECTS USING METHODS OF MACHINE LEARNING 2016
  • Matskevich Stepan Evgenevich
  • Kuznetsova Ekaterina Sergeevna
  • Gusev Ilya Olegovich
RU2657173C2
EXPANDING OF INFORMATION SEARCH POSSIBILITY 2015
  • Danielyan Tatyana Vladimirovna
  • Indenbom Evgenij Mikhajlovich
RU2618375C2
METHOD OF EXTRACTING FACTS FROM TEXTS ON NATURAL LANGUAGE 2016
  • Starostin Anatolij Sergeevich
  • Smurov Ivan Mikhajlovich
  • Dzhumaev Stanislav Sergeevich
RU2637992C1
CREATION OF ONTOLOGIES BASED ON NATURAL LANGUAGE TEXTS ANALYSIS 2014
  • Danielyan Tatiana Vladimirovna
RU2606873C2
SENTIMENT ANALYSIS AT LEVEL OF ASPECTS AND CREATION OF REPORTS USING MACHINE LEARNING METHODS 2016
  • Mikhajlov Maksim Borisovich
  • Pasechnikov Konstantin Alekseevich
RU2635257C1

RU 2 618 374 C1

Authors

Novitskij Valerij Igorevich

Indenbom Evgenij Mikhajlovich

Dates

2017-05-03Published

2015-11-05Filed