ADAPTIVE AUDIO ENHANCEMENT FOR MULTICHANNEL SPEECH RECOGNITION Russian patent published in 2019 - IPC G10L15/20 G10L15/16 

Abstract RU 2698153 C1

FIELD: information technology.

SUBSTANCE: invention discloses means for adaptive shaping of neuron pattern for multichannel speech recognition. A first channel of audio data corresponding to a speech fragment and a second audio data channel corresponding to said speech fragment are received. A first set of filter parameters for a first filter based on a first audio data channel and a second audio data channel and a second set of filter parameters for a second filter based on a first audio data channel and a second audio data channel are generated using a trained recurrent neural network. Generating a single combined audio data channel by combining first channel audio data which has been filtered using first filter, and audio data of second channel, which was filtered using second filter. Audio data are introduced for a single combined channel into a neural network trained as an acoustic model.

EFFECT: high efficiency of speech recognition.

20 cl, 5 dwg

Similar patents RU2698153C1

Title Year Author Number
TECHNOLOGY FOR ANALYZING ACOUSTIC DATA FOR SIGNS OF COVID-19 DISEASE 2021
  • Samsonov Pavel Romanovich
  • Mikhajlov Dmitrij Mikhajlovich
  • Chumanskaya Vera Vasilevna
  • Dvoryankin Sergej Vladimirovich
RU2758649C1
METHOD FOR IMPROVING A SPEECH SIGNAL WITH A LOW DELAY, A COMPUTING DEVICE AND A COMPUTER-READABLE MEDIUM THAT IMPLEMENTS THE ABOVE METHOD 2023
  • Babaev Nicholas Andrew
  • Andreev Pavel Konstantinovich
  • Saginbaev Azat Rustamovich
  • Shchekotov Ivan Sergeevich
RU2802279C1
CUSTOMIZED OUTPUT WHICH IS OPTIMIZED FOR USER PREFERENCES IN DISTRIBUTED SYSTEM 2020
  • Yoshioka, Takuya
  • Stolcke, Andreas
  • Chen, Zhuo
  • Dimitriadis, Dimitrios, Basile
  • Zeng, Nanshan
  • Qin, Lijuan
  • Hinthorn, William, Isaac
  • Huang, Xuedong
RU2821283C2
METHOD FOR AUDIOVISUAL RECOGNITION OF PERSONAL PROTECTION EQUIPMENT ON HUMAN FACE 2022
  • Riumina Elena Vitalevna
  • Markitantov Maksim Viktorovich
  • Riumin Dmitrii Aleksandrovich
  • Karpov Aleksei Anatolevich
RU2791415C1
METHOD OF MULTIMODAL CONTACTLESS CONTROL OF MOBILE INFORMATION ROBOT 2020
  • Ryumin Dmitrij
  • Kipyatkova Irina Sergeevna
  • Kagirov Ildar Amirovich
  • Aksenov Aleksandr
  • Karpov Aleksej Anatolevich
RU2737231C1
DEVICE, METHOD, OR COMPUTER PROGRAM FOR GENERATING AN EXTENDED-BAND AUDIO SIGNAL USING A NEURAL NETWORK PROCESSOR 2018
  • Schmidt, Konstantin
  • Uhle, Christian
  • Edler, Bernd
RU2745298C1
SPEAKER VERIFICATION 2017
  • Moreno, Ignacio Lopez
  • Wan, Li
  • Wang, Quan
RU2697736C1
TRAINING NEURAL NETWORKS USING LOSS FUNCTIONS REFLECTING RELATIONSHIPS BETWEEN NEIGHBOURING TOKENS 2018
  • Eugene Indenbom
  • Daniil Anastasiev
RU2721190C1
RECOGNIZING OF MIXED SPEECH 2015
  • Yu, Dong
  • Weng, Chao
  • Seltzer, Michael L.
  • Droppo, James
RU2686589C2
SYSTEMS AND METHODS OF AUDIO SIGNAL PRODUCTION 2019
  • Zhou, Meilin
  • Liao, Fengyun
  • Qi, Xin
RU2804933C2

RU 2 698 153 C1

Authors

Li, Bo

Weiss, Ron J.

Bacchiani, Michiel A.U.

Sainath, Tara N.

Wilson, Kevin William

Dates

2019-08-22Published

2016-12-28Filed