FIELD: physics.
SUBSTANCE: method of classifying the text includes the initial creation of a semantic dictionary in the form of a repository of semantic characteristics of words. Then the spoken language is recognized, and the text is received. In the result of the spoken language recognition, each word is selected in the text. A plurality of semantic characteristics in the semantic dictionary is found for each selected word. On the basis of the identified plurality of the semantic characteristics, the semantic consistency of, at least, one word sequence is determined to obtain a phrase. A plurality of phrases is selected from the text with a comparison of their semantic characteristics and the selection of the results of comparison of the dominant semantic characteristics. A plurality of phrases is converted into a plurality of key phrases containing a dominant semantic characteristic. A class is formed from the first received key phrases and their semantic characteristics.
EFFECT: increasing the accuracy of classifying the text files obtained as a result of recognizing speech in the telephone communication.
4 dwg, 1 tbl
Title | Year | Author | Number |
---|---|---|---|
INTERACTIVE SPEECH SIMULATION SYSTEM | 2023 |
|
RU2807436C1 |
EXTRACTING INFORMATION OBJECTS WITH THE HELP OF A CLASSIFIER COMBINATION | 2017 |
|
RU2679988C1 |
METHOD FOR AUTOMATIC SEMANTIC CLASSIFICATION OF NATURAL LANGUAGE TEXTS | 2013 |
|
RU2538304C1 |
MACHINE TRAINING | 2005 |
|
RU2391791C2 |
EXPANDING OF INFORMATION SEARCH POSSIBILITY | 2015 |
|
RU2618375C2 |
NAMED ENTITIES FROM THE TEXT AUTOMATIC EXTRACTION | 2014 |
|
RU2665239C2 |
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS | 2018 |
|
RU2686000C1 |
COMPUTER EQUIPMENT FOR READING OF PRINTED TEXT | 1996 |
|
RU2113726C1 |
SYSTEM AND METHOD FOR SEMANTIC SEARCH | 2013 |
|
RU2563148C2 |
METHOD OF ANALYSING TEXT DATA TONALITY | 2014 |
|
RU2571373C2 |
Authors
Dates
2017-08-22—Published
2016-07-25—Filed