FIELD: computer engineering.
SUBSTANCE: invention relates to computer engineering for processing audio data. Result is achieved due to speech processing segments extraction method, based on input speech signal reception, analogue-to-digital conversion and speech signal fragment extraction, wherein the value of dispersion of the noise of the speech signal fragment of the given duration is measured, using which the value of the threshold and the optimum accumulation interval are calculated, on which the dispersion of the input signal is determined, comparing the dispersion with the calculated threshold, if in the interval equal to half the accumulation interval, the dispersion value is below the threshold, then a decision is made on the classification of this interval as a pause, otherwise, as speech, discrete samples containing pauses are deleted, and the remaining samples are combined and a speech section without pauses is obtained.
EFFECT: high probability of correct segmentation of a speech signal in conditions of noise.
1 cl, 1 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD OF RECOGNITION OF SEPARATE WORDS OF SPEECH WITH ADAPTATION TO ANNOUNCER | 1994 |
|
RU2047912C1 |
METHOD OF SELECTING SPEECH PROCESSING SEGMENTS BASED ON ANALYSIS OF CORRELATION DEPENDENCIES IN SPEECH SIGNAL | 2010 |
|
RU2445718C1 |
METHOD FOR ANNOUNCER AUTHENTICATION BY VOICE | 2022 |
|
RU2789689C1 |
METHOD OF DIVIDING SPEECH AND PAUSES BY VALUES OF DISPERSIONS OF AMPLITUDES OF SPECTRAL COMPONENTS | 2019 |
|
RU2723301C1 |
METHOD AND DEVICE FOR CLASSIFYING NOISY VOICE SEGMENTS USING MULTISPECTRAL ANALYSIS | 2014 |
|
RU2606566C2 |
DEVICE FOR IDENTIFYING ISOLATED WORDS | 1998 |
|
RU2136059C1 |
METHOD FOR SEPARATING SPEECH AND PAUSES BY ANALYZING CHARACTERISTICS OF SPECTRAL COMPONENTS OF MIXTURE OF SIGNAL AND NOISE | 2023 |
|
RU2814115C1 |
METHOD FOR RECOGNIZING SPOKEN WORDS | 2005 |
|
RU2296376C2 |
METHOD FOR HYBRID GENERATIVE-DISCRIMINATIVE SEGMENTATION OF SPEAKERS IN AUDIO-FLOW | 2013 |
|
RU2530314C1 |
METHOD OF SEPARATING SPEECH AND SPEECH-LIKE NOISE BY ANALYZING VALUES OF ENERGY AND PHASES OF FREQUENCY COMPONENTS OF SIGNAL AND NOISE | 2019 |
|
RU2700189C1 |
Authors
Dates
2025-03-31—Published
2024-04-23—Filed