FIELD: computer technology.
SUBSTANCE: invention relates to the field of computer technology for processing audio data by information processing systems, and in particular to methods for constructing speech recognition systems. The effect is achieved by determining the frequencies of the formants in the sections of the speech signal and phonemic recognition of each section of the speech signal by comparing its phonetic features with the available data bank separately for each speech sound, where sequences of speech signals are formed from the speech signal, which are separated from the original signal for the period of the analyzed frequencies, the differences of the generated signals with the original signal are calculated, the levels of the difference signals in the analyzed area are calculated, the frequencies corresponding to the minima of the calculated levels are selected, the selected frequencies are grouped in pairs, and the pairs are grouped from the selected frequencies differing by at least 90 chalk, after which a pair is selected that has a minimum distance in the frequency plane F1, F2, where F1, F2 are the frequency axes corresponding to the studied range of the speech signal, from base pairs placed in the database, with the assignment of the analyzed area to the phoneme value of the base phonological pair.
EFFECT: increasing the reliability of speaker-independent speech recognition.
5 cl, 5 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD OF IDENTIFYING SPEAKER FROM ARBITRARY SPEECH PHONOGRAMS BASED ON FORMANT EQUALISATION | 2009 |
|
RU2419890C1 |
SPEECH RECOGNITION METHOD BASED ON TWO-LEVEL MORPHOPHONEMIC PREFIX GRAPH | 2015 |
|
RU2597498C1 |
METHOD FOR RECOGNIZING SPOKEN WORDS | 2005 |
|
RU2296376C2 |
METHOD AND SYSTEM FOR LEXICAL INTERPRETATION OF FUSED SPEECH | 1997 |
|
RU2119196C1 |
METHOD FOR RECOGNITION OF WORDS IN CONTINUOUS SPEECH AND DEVICE WHICH IMPLEMENTS SAID METHOD | 1996 |
|
RU2101782C1 |
METHOD FOR RECOGNITION OF SPEECH PATTERNS AND DEVICE FOR REALIZATION OF METHOD | 2004 |
|
RU2268504C9 |
METHOD OF RECOGNITION OF SEPARATE WORDS OF SPEECH WITH ADAPTATION TO ANNOUNCER | 1994 |
|
RU2047912C1 |
METHOD FOR HYBRID GENERATIVE-DISCRIMINATIVE SEGMENTATION OF SPEAKERS IN AUDIO-FLOW | 2013 |
|
RU2530314C1 |
SYSTEM AND METHOD OF SPEECH RECOGNITION | 2011 |
|
RU2466468C1 |
SYSTEM AND METHOD OF CONVERTING VOICE SIGNAL INTO TRANSCRIPT PRESENTATION WITH METADATA | 2014 |
|
RU2589851C2 |
Authors
Dates
2021-12-27—Published
2021-07-06—Filed