FIELD: computer equipment for processing audio data.
SUBSTANCE: technical result is achieved by determining a primary voice activity detection solution (VAD) based on voice activity; determining the final VAD decision based on whether the primary decision signal tails are added; determining a measure of short-term voice activity based on past primary decisions; determining a measure of long-term voice activity based on past final decisions or past primary decisions; and determining an alternative final solution for adjusting the addition of signal tails based on a measure of short-term voice activity and a measure of long-term voice activity.
EFFECT: elimination of audio data reproduction artefacts with cutting off the ends of the last speech segments, such as a speech fragment ending with a non-speech explosion.
14 cl, 9 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND DEVICE TO DETECT VOICE ACTIVITY | 2013 |
|
RU2670785C9 |
METHOD AND DEVICE TO DETECT VOICE ACTIVITY | 2013 |
|
RU2609133C2 |
ESTIMATION OF BACKGROUND NOISE IN AUDIO SIGNALS | 2014 |
|
RU2618940C1 |
METHOD FOR ESTIMATING BACKGROUND NOISE, A UNIT FOR ESTIMATING BACKGROUND NOISE AND A COMPUTER-READABLE MEDIUM | 2014 |
|
RU2720357C2 |
ESTIMATION OF BACKGROUND NOISE IN AUDIO SIGNALS | 2015 |
|
RU2665916C2 |
ESTIMATING BACKGROUND NOISE IN AUDIO SIGNALS | 2015 |
|
RU2713852C2 |
ESTIMATION OF BACKGROUND NOISE IN AUDIO SIGNALS | 2020 |
|
RU2760346C2 |
NOISE-ROBUST SPEECH CODING MODE CLASSIFICATION | 2012 |
|
RU2584461C2 |
METHOD OF PRODUCING SPEECH ACTIVITY MODIFICATION FRAMES, SPEED ACTIVITY DETECTION DEVICE AND METHOD | 2015 |
|
RU2684194C1 |
SPEECH ENHANCEMENT WITH VOICE CLARITY | 2008 |
|
RU2469423C2 |
Authors
Dates
2022-03-24—Published
2018-10-10—Filed