FIELD: statistical language models, used in speech recognition systems.
SUBSTANCE: word indexes of bigrams are stored in form of common base with characteristic shifting. In one variant of realization, memory volume required for serial storage of bigram word indexes is compared to volume of memory, required for storage of indexes of bigram words in form of common base with characteristic shifting. Then indexes of bigram words are stored for minimization of size of data file of language model.
EFFECT: decreased memory volume needed for storing data structure of language model.
7 cl, 4 dwg
Title |
Year |
Author |
Number |
SPEECH RECOGNITION METHOD BASED ON TWO-LEVEL MORPHOPHONEMIC PREFIX GRAPH |
2015 |
- Ronzhin Andrej Leonidovich
- Karpov Aleksej Anatolevich
|
RU2597498C1 |
METHOD FOR RECOGNIZING WORDS IN CONTINUOUS SPEECH |
2005 |
- Agranovskij Aleksandr Vladimirovich
- Lednov Dmitrij Anatol'Evich
- Zulkarneev Mikhail Jur'Evich
- Arutjunjan Roman Ehrnstovich
|
RU2297676C2 |
DEVICES AND METHODS, WHICH BUILD THE HIERARCHIALLY ORDINARY DATA STRUCTURE, CONTAINING NONPARAMETERIZED SYMBOLS FOR DOCUMENTS IMAGES CONVERSION TO ELECTRONIC DOCUMENTS |
2013 |
- Chulinin Yurij Georgievich
|
RU2625533C1 |
RESOLUTION SEMANTIC AMBIGUITY BY STATISTICAL ANALYSIS |
2013 |
- Zuev Konstantin Alekseevich
- Bogdanova Daria Nikolaevna
|
RU2592395C2 |
METHOD AND SYSTEM FOR OBTAINING VECTOR REPRESENTATION OF ELECTRONIC TEXT DOCUMENT FOR CLASSIFICATION BY CATEGORIES OF CONFIDENTIAL INFORMATION |
2021 |
- Vyshegorodtsev Kirill Evgenevich
- Obolenskij Ivan Aleksandrovich
- Golovnya Maksim Sergeevich
|
RU2775358C1 |
RESOLUTION OF SEMANTIC AMBIGUITY USING LANGUAGE-INDEPENDENT SEMANTIC STRUCTURE |
2013 |
- Zuev Konstantin Alekseevich
- Bogdanova Daria Nikolaevna
|
RU2579699C2 |
METHODS AND DEVICES THAT CONVERT IMAGES OF DOCUMENTS TO ELECTRONIC DOCUMENTS USING TRIE-DATA STRUCTURES CONTAINING UNPARAMETERIZED SYMBOLS FOR DEFINITION OF WORD AND MORPHEMES ON DOCUMENT IMAGE |
2013 |
- Chulinin Yurij Georgievich
|
RU2631168C2 |
DEVICES AND METHODS USING A HIERARCHIALLY ORDERED DATA STRUCTURE CONTAINING UNPARAMETRIC SYMBOLS FOR CONVERTING DOCUMENT IMAGES TO ELECTRONIC DOCUMENTS |
2013 |
- Chulinin Yurij Georgievich
|
RU2643465C2 |
METHOD FOR PRELIMINARY PROCESSING OF TEXT |
2007 |
- Gusev Mikhail Nikolaevich
- Egorova Ol'Ga Borisovna
- Smirnov Valentin Aleksandrovich
|
RU2386178C2 |
CUSTOMIZED OUTPUT WHICH IS OPTIMIZED FOR USER PREFERENCES IN DISTRIBUTED SYSTEM |
2020 |
- Yoshioka, Takuya
- Stolcke, Andreas
- Chen, Zhuo
- Dimitriadis, Dimitrios, Basile
- Zeng, Nanshan
- Qin, Lijuan
- Hinthorn, William, Isaac
- Huang, Xuedong
|
RU2821283C2 |