FIELD: data processing.
SUBSTANCE: invention relates to automated identification of a language or a linguistic group (for example, Roman, German, Celtic, Slavic, etc.), which the analyzed text language belongs to. Method for automated identification of a language or a linguistic group of a text involves creating a set of identifying elements from a group of the most common verbs of each identified language or linguistic group and saving it on a data medium. Herewith the identifying elements used are grammatical forms and semantically significant parts of the verbs (roots or stems) of each identified language. Each identifying element of the set is compared with elements of the analyzed text. If matches of elements are observed, the language is identified by belonging of the matched elements to a certain language of the set.
EFFECT: technical result is providing the possibility of operation with multilingual texts and accurate identification of all the languages used in the analyzed text in case it includes forms of verbs from a set of identifying elements.
1 cl, 1 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD FOR ORDERING DATA SUBMITTED IN ALPHANUMERIC INFORMATION BLOCKS | 2000 |
|
RU2210809C2 |
METHOD FOR AUTOMATED ANALYSIS OF TEXT AND SELECTION OF RELEVANT RECOMMENDATIONS TO IMPROVE READABILITY THEREOF | 2021 |
|
RU2769427C1 |
SYSTEM AND METHOD FOR AUTOMATIC CREATION OF TEMPLATES | 2018 |
|
RU2697647C1 |
COMPUTER SYSTEM AND METHOD FOR PREPARING TEXTS IN SOURCE LANGUAGE AND THEIR TRANSLATION INTO FOREIGN LANGUAGES | 1993 |
|
RU2136038C1 |
COMPREHENSIVE AUTOMATIC PROCESSING OF TEXT INFORMATION | 2014 |
|
RU2662699C2 |
METHOD FOR SYNTHESIS OF SELF-TEACHING SYSTEM FOR EXTRACTING KNOWLEDGE FROM TEXT DOCUMENTS FOR SEARCH ENGINES | 2002 |
|
RU2273879C2 |
METHOD FOR PRELIMINARY CONVERSION OF STRUCTURED DATA ARRAY | 2014 |
|
RU2571405C1 |
METHOD TO GENERATE MAP OF CONNECTIONS OF CONVERTED STRUCTURED DATA ARRAY COMPONENTS | 2014 |
|
RU2571407C1 |
METHOD OF DOUBLE-LEVEL SEARCH OF INFORMATION IN PREVIOUSLY CONVERTED STRUCTURED DATA ARRAY | 2014 |
|
RU2571406C1 |
METHOD OF SEARCHING FOR INFORMATION IN PRE-TRANSFORMED STRUCTURED DATA ARRAY | 2014 |
|
RU2572367C1 |
Authors
Dates
2017-01-11—Published
2015-07-08—Filed