METHOD FOR AUTOMATED IDENTIFICATION OF LANGUAGE OR LINGUISTIC GROUP OF TEXT Russian patent published in 2017 - IPC G06F17/27 

Abstract RU 2607989 C1

FIELD: data processing.

SUBSTANCE: invention relates to automated identification of a language or a linguistic group (for example, Roman, German, Celtic, Slavic, etc.), which the analyzed text language belongs to. Method for automated identification of a language or a linguistic group of a text involves creating a set of identifying elements from a group of the most common verbs of each identified language or linguistic group and saving it on a data medium. Herewith the identifying elements used are grammatical forms and semantically significant parts of the verbs (roots or stems) of each identified language. Each identifying element of the set is compared with elements of the analyzed text. If matches of elements are observed, the language is identified by belonging of the matched elements to a certain language of the set.

EFFECT: technical result is providing the possibility of operation with multilingual texts and accurate identification of all the languages used in the analyzed text in case it includes forms of verbs from a set of identifying elements.

1 cl, 1 dwg

Similar patents RU2607989C1

Title Year Author Number
METHOD FOR ORDERING DATA SUBMITTED IN ALPHANUMERIC INFORMATION BLOCKS 2000
  • Pripachkin Ju.I.
  • Smentsarev G.V.
RU2210809C2
METHOD FOR AUTOMATED ANALYSIS OF TEXT AND SELECTION OF RELEVANT RECOMMENDATIONS TO IMPROVE READABILITY THEREOF 2021
  • Burov Anatolii Vladimirovich
  • Iliakhov Maksim Olegovich
RU2769427C1
SYSTEM AND METHOD FOR AUTOMATIC CREATION OF TEMPLATES 2018
  • Anisimovich Konstantin Vladimirovich
  • Garashchuk Ruslan Vladimirovich
  • Matskevich Stepan Evgenevich
RU2697647C1
COMPUTER SYSTEM AND METHOD FOR PREPARING TEXTS IN SOURCE LANGUAGE AND THEIR TRANSLATION INTO FOREIGN LANGUAGES 1993
  • Dzhejm G.Karbonell
  • Sharlin L. Gehllap
  • Timoti Dzh.Kharris
  • Dzhejms V.Khigdon
  • Dennis A.Khill
  • Dehvid K.Khadson
  • Dehvid Nehsleti
  • Mervin L.Rennikh
  • Peggi M.Anderson
  • Majkl M.Bauer
  • Roj F.Basdiker
  • Filip Dzh. Khejs
  • Brjus M.Maklaren
  • Iren Nirenburg
  • Ehrik Kh.Ribling
  • Linda M.Shmandt
  • Dzhon F.Svit
  • Katrin L.Bejker
  • Nikolas D.Braunlou
  • Aleksandr M.Frants
  • Sjuzn E.Kholm
  • Dzhon Robert Rassel Livitt
  • Deril V.Lonsdejl
  • Teruko Mitamura
  • Ehrik Kh.Njuberg
RU2136038C1
COMPREHENSIVE AUTOMATIC PROCESSING OF TEXT INFORMATION 2014
  • Danielyan Tatyana Vladimirovna
  • Starostin Anatolij Sergeevich
  • Zuev Konstantin Alekseevich
  • Anisimovich Konstantin Vladimirovich
  • Selegej Vladimir Pavlovich
RU2662699C2
METHOD FOR SYNTHESIS OF SELF-TEACHING SYSTEM FOR EXTRACTING KNOWLEDGE FROM TEXT DOCUMENTS FOR SEARCH ENGINES 2002
  • Nasypnyj Vladimir Vladimirovich
  • Nasypnaja Galina Anatol'Evna
RU2273879C2
METHOD FOR PRELIMINARY CONVERSION OF STRUCTURED DATA ARRAY 2014
  • Rogachev Igor' Petrovich
RU2571405C1
METHOD TO GENERATE MAP OF CONNECTIONS OF CONVERTED STRUCTURED DATA ARRAY COMPONENTS 2014
  • Rogachev Igor' Petrovich
RU2571407C1
METHOD OF DOUBLE-LEVEL SEARCH OF INFORMATION IN PREVIOUSLY CONVERTED STRUCTURED DATA ARRAY 2014
  • Rogachev Igor' Petrovich
RU2571406C1
METHOD OF SEARCHING FOR INFORMATION IN PRE-TRANSFORMED STRUCTURED DATA ARRAY 2014
  • Rogachev Igor' Petrovich
RU2572367C1

RU 2 607 989 C1

Authors

Kalegin Sergej Nikolaevich

Dates

2017-01-11Published

2015-07-08Filed